Senior Software Engineer – Cloud Infrastructure Reliability & Automation - Full-time
Oracle
About the role
About Us
Only Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life‑saving care. And with AI embedded across our products and services, we help customers turn that promise into a better future for all. Discover your potential at a company leading the way in AI and cloud solutions that impact billions of lives.
True innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing a workforce that promotes opportunities for all with competitive benefits that support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling 1‑888‑404‑2494 in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
Job Description
Join Oracle's Health Data Intelligence (HDI) team as a Software Engineer 3 and contribute to the reliability, scalability, and performance of our world‑class analytics platform. In this role, you will develop, maintain, and optimize the infrastructure and data pipelines that power healthcare analytics globally. You will work within a collaborative team to implement robust solutions for business intelligence and reporting, ensuring our platform handles massive datasets with precision and speed.
U.S. citizenship is required for this position, as the successful candidate will be required to obtain (and maintain) a U.S. government security clearance after hire.
Required Skills
- Infrastructure & Reliability: Experience implementing and maintaining high‑availability systems with a focus on performance monitoring and fault tolerance.
- Data Technologies: Proficiency in Data Warehousing platforms (e.g., Vertica, Snowflake) and ETL frameworks; understanding of columnar storage and large‑scale data processing.
- BI & Reporting: Practical experience integrating or supporting Business Intelligence tools (e.g., Tableau, Power BI, Oracle Analytics) to surface data‑driven insights.
- DevOps/SRE Practices: Competency in CI/CD pipelines (Jenkins, Kubernetes), Infrastructure as Code (Terraform), and observability tools (Prometheus, Grafana).
- Cloud Ecosystems: Working knowledge of public cloud environments (OCI, AWS, or Azure) with an emphasis on deployment and resource management.
- Problem‑Solving: Strong ability to troubleshoot complex production issues, perform root‑cause analysis, and document technical findings.
- Programming & Tools: Solid foundation in Python, Java, or Go, along with containerization (Docker) and shell scripting.
Responsibilities
Work with Site Reliability Engineering (SRE) team on the shared full‑stack ownership of a collection of services and/or technology areas. Understand the end‑to‑end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission‑critical stack, with focus on security, resiliency, scale, and performance. Authority for end‑to‑end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to develop deep understanding of services and technologies.
Key Responsibilities
- Develop & Maintain: Implement and tune infrastructure components for the Oracle HDI Analytics Platform to ensure system stability and uptime.
- Data Pipeline Execution: Build and refine scalable data pipelines, leveraging Vertica and ETL processes to ensure efficient data ingestion and transformation.
- BI Support: Assist in the integration and optimization of BI and reporting tools to ensure seamless data visualization for healthcare leaders.
- Operational Excellence: Apply DevOps and SRE principles to automate routine tasks, manage deployments via CI/CD, and monitor system health using Prometheus/Grafana.
- Cloud Integration: Support platform‑agnostic initiatives across Oracle Cloud and AWS, ensuring cost‑efficient and compliant resource usage.
- Incident Response: Participate in on‑call rotations or troubleshooting sessions to resolve production issues and implement preventative fixes.
- Collaboration: Work closely with senior engineers to execute technical roadmaps and provide peer reviews for code and infrastructure changes.
What You Bring
- 8+ years of software engineering experience, with 5+ years focused on cloud infrastructure, SRE, or DevOps
- Proven ownership of production system reliability and uptime in cloud environments
- Strong expertise in:
- Cloud infrastructure design and automation
- Distributed systems and performance optimization
- Data warehousing and ETL frameworks
- Columnar databases (e.g., Vertica)
- Hands‑on experience with:
- Infrastructure as Code (Terraform)
- Containerization (Docker) and orchestration (Kubernetes)
- Observability stacks (Prometheus, Grafana)
- Experience integrating BI/reporting tools (Tableau, Power BI, Oracle Analytics, etc.)
- Proficiency in Python, Java, or Go
- Strong problem‑solving skills with a track record of improving system reliability, automation, and scalability
Preferred Qualifications
- Experience in healthcare or regulated environments (HIPAA, compliance frameworks)
- Familiarity with Oracle HDI or large‑scale analytics platforms
- Experience working in environments requiring security clearance
Why Join Oracle HDI?
- Own and shape cloud reliability and automation strategy for a mission‑critical platform
- Work on large‑scale, data‑intensive systems in healthcare
- Be part of Oracle’s investment in AI‑driven healthcare innovation
- Collaborate with top‑tier engineers solving complex, real‑world problems
Compensation & Benefits
- US Hiring Range: $79,100 to $158,200 per annum (may be eligible for bonus and equity)
- Oracle maintains broad salary ranges to account for variations in knowledge, skills, experience, market conditions, locations, and product lines. Candidates are placed into the range based on these factors and internal peer equity.
Benefits Package
- Medical, dental, and vision insurance, including expert medical opinion
- Short‑term disability and long‑term disability
- Life insurance and AD&D
- Supplemental life insurance (Employee/Spouse/Child)
- Health care and dependent care Flexible Spending Accounts
- Pre‑tax commuter and parking benefits
- 401(k) Savings and Investment Plan with company match
- Paid time off:
- Flexible Vacation for salaried (non‑overtime eligible) employees
- Accrued Vacation for other employees (13 days annually for first three years, 18 days thereafter; prorated for part‑time)
- 11 paid holidays
- Paid sick leave: 72 hours upon hire, refreshed each calendar year, carry‑over up to 112 hours
- Paid parental leave
- Adoption assistance
- Employee Stock Purchase Plan
- Financial planning and group legal services
- Voluntary benefits including auto, homeowner, and pet insurance
Disclaimer
- Certain US customer or client‑facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.
- Range and benefit information provided in this posting are specific to the stated locations only.
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level
- IC3
Job Description (Duplicate)
Join Oracle's Health Data Intelligence (HDI) team as a Software Engineer 3 and contribute to the reliability, scalability, and performance of our world‑class analytics platform. In this role, you will develop, maintain, and optimize the infrastructure and data pipelines that power healthcare analytics globally. You will work within a collaborative team to implement robust solutions for business intelligence and reporting, ensuring our platform handles massive datasets with precision and speed.
U.S. citizenship is required for this position, as the successful candidate will be required to obtain (and maintain) a U.S. government security clearance after hire.
Required Skills (Duplicate)
- Infrastructure & Reliability: Experience implementing and maintaining high‑availability systems with a focus on performance monitoring and fault tolerance.
- Data Technologies: Proficiency in Data Warehousing platforms (e.g., Vertica, Snowflake) and ETL frameworks; understanding of columnar storage and large‑scale data processing.
- BI & Reporting: Practical experience integrating or supporting Business Intelligence tools (e.g., Tableau, Power BI, Oracle Analytics) to surface data‑driven insights.
- DevOps/SRE Practices: Competency in CI/CD pipelines (Jenkins, Kubernetes), Infrastructure as Code (Terraform), and observability tools (Prometheus, Grafana).
- Cloud Ecosystems: Working knowledge of public cloud environments (OCI, AWS, or Azure) with an emphasis on deployment and resource management.
- Problem‑Solving: Strong ability to troubleshoot complex production issues, perform root‑cause analysis, and document technical findings.
- Programming & Tools: Solid foundation in Python, Java, or Go, along with containerization (Docker) and shell scripting.
Responsibilities (Duplicate)
Work with Site Reliability Engineering (SRE) team on the shared full‑stack ownership of a collection of services and/or technology areas. Understand the end‑to‑end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission‑critical stack, with focus on security, resiliency, scale, and performance. Authority for end‑to‑end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to develop deep understanding of services and technologies.
Key Responsibilities (Duplicate)
- Develop & Maintain: Implement and tune infrastructure components for the Oracle HDI Analytics Platform to ensure system stability and uptime.
- Data Pipeline Execution: Build and refine scalable data pipelines, leveraging Vertica and ETL processes to ensure efficient data ingestion and transformation.
- BI Support: Assist in the integration and optimization of BI and reporting tools to ensure seamless data visualization for healthcare leaders.
- Operational Excellence: Apply DevOps and SRE principles to automate routine tasks, manage deployments via CI/CD, and monitor system health using Prometheus/Grafana.
- Cloud Integration: Support platform‑agnostic initiatives across Oracle Cloud and AWS, ensuring cost‑efficient and compliant resource usage.
- Incident Response: Participate in on‑call rotations or troubleshooting sessions to resolve production issues and implement preventative fixes.
- Collaboration: Work closely with senior engineers to execute technical roadmaps and provide peer reviews for code and infrastructure changes.
What You Bring (Duplicate)
- 8+ years of software engineering experience, with 5+ years focused on cloud infrastructure, SRE, or DevOps
- Proven ownership of production system reliability and uptime in cloud environments
- Strong expertise in:
- Cloud infrastructure design and automation
- Distributed systems and performance optimization
- Data warehousing and ETL frameworks
- Columnar databases (e.g., Vertica)
- Hands‑on experience with:
- Infrastructure as Code (Terraform)
- Containerization (Docker) and orchestration (Kubernetes)
- Observability stacks (Prometheus, Grafana)Experience integrating BI/reporting tools (Tableau, Power BI, Oracle Analytics, etc.)Proficiency in Python, Java, or Go
- Strong problem‑solving skills with a track record of improving system reliability, automation, and scalability
Preferred Qualifications (Duplicate)
- Experience in healthcare or regulated environments (HIPAA, compliance frameworks)
- Familiarity with Oracle HDI or large‑scale analytics platforms
- Experience working in environments requiring security clearance
Why Join Oracle HDI? (Duplicate)
- Own and shape cloud reliability and automation strategy for a mission‑critical platform
- Work on large‑scale, data‑intensive systems in healthcare
- Be part of Oracle’s investment in AI‑driven healthcare innovation
- Collaborate with top‑tier engineers solving complex, real‑world problems
Requirements
- *_U.S. citizenship is required for this position, as the successful candidate will be required to obtain (and maintain) a U.S. government security clearance after hire._**
- **Problem-Solving:*
- Professional curiosity and a desire to a develop deep understanding of services and technologies
- **Data Pipeline Execution:*
- 8+ years of software engineering experience, with 5+ years focused on cloud infrastructure, SRE, or DevOps
- Proven ownership of production system reliability and uptime in cloud environments
- Cloud infrastructure design and automation
- Distributed systems and performance optimization
- Data warehousing and ETL frameworks
- Columnar databases (e.g., Vertica)
- Hands-on experience with:
- Infrastructure as Code (Terraform)
- Containerization (Docker) and orchestration (Kubernetes)
- Observability stacks (Prometheus, Grafana)Experience integrating BI/reporting tools (Tableau, Power BI, Oracle Analytics, etc.)Proficiency in Python, Java, or Go
- Strong problem-solving skills with a track record of improving system reliability, automation, and scalability
- Experience in healthcare or regulated environments (HIPAA, compliance frameworks)
- Familiarity with Oracle HDI or large-scale analytics platforms
- Experience working in environments requiring security clearance
- *_U.S. citizenship is required for this position, as the successful candidate will be required to obtain (and maintain) a U.S. government security clearance after hire._**
- *Required Skills**
- **Infrastructure & Reliability:*
- Experience implementing and maintaining high-availability systems with a focus on performance monitoring and fault tolerance
- **Data Technologies:*
- Proficiency in Data Warehousing platforms (e.g., Vertica, Snowflake) and ETL frameworks; understanding of columnar storage and large-scale data processing
- **BI & Reporting:*
- Practical experience integrating or supporting Business Intelligence tools (e.g., Tableau, Power BI, Oracle Analytics) to surface data-driven insights
- Competency in CI/CD pipelines (Jenkins, Kubernetes), Infrastructure as Code (Terraform), and observability tools (Prometheus, Grafana)
- **Cloud Ecosystems:*
- Working knowledge of public cloud environments (OCI, AWS, or Azure) with an emphasis on deployment and resource management
- **Problem-Solving:*
- **Programming & Tools:*
- Solid foundation in Python, Java, or Go, along with containerization (Docker) and shell scripting
- Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack
- Professional curiosity and a desire to a develop deep understanding of services and technologies
- **Data Pipeline Execution:*
- Build and refine scalable data pipelines, leveraging Vertica and ETL processes to ensure efficient data ingestion and transformation
- Apply DevOps and SRE principles to automate routine tasks, manage deployments via CI/CD, and monitor system health using Prometheus/Grafana
- **Cloud Integration:*
- Support platform-agnostic initiatives across Oracle Cloud and AWS, ensuring cost-efficient and compliant resource usage
- **Incident Response:*
- 8+ years of software engineering experience, with 5+ years focused on cloud infrastructure, SRE, or DevOps
- Proven ownership of production system reliability and uptime in cloud environments
- Strong expertise in:
- Cloud infrastructure design and automation
- Distributed systems and performance optimization
- Data warehousing and ETL frameworks
- Columnar databases (e.g., Vertica)
- Hands-on experience with:
- Infrastructure as Code (Terraform)
- Containerization (Docker) and orchestration (Kubernetes)
- Observability stacks (Prometheus, Grafana)Experience integrating BI/reporting tools (Tableau, Power BI, Oracle Analytics, etc.)Proficiency in Python, Java, or Go
- Strong problem-solving skills with a track record of improving system reliability, automation, and scalability
- Experience in healthcare or regulated environments (HIPAA, compliance frameworks)
- Familiarity with Oracle HDI or large-scale analytics platforms
- Experience working in environments requiring security clearance
- Work on large-scale, data-intensive systems in healthcare
- *Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.**
Responsibilities
- In this role, you will develop, maintain, and optimize the infrastructure and data pipelines that power healthcare analytics globally
- You will work within a collaborative team to implement robust solutions for business intelligence and reporting, ensuring our platform handles massive datasets with precision and speed
- **Infrastructure & Reliability:*
- Experience implementing and maintaining high-availability systems with a focus on performance monitoring and fault tolerance
- **Data Technologies:*
- Proficiency in Data Warehousing platforms (e.g., Vertica, Snowflake) and ETL frameworks; understanding of columnar storage and large-scale data processing
- **BI & Reporting:*
- Practical experience integrating or supporting Business Intelligence tools (e.g., Tableau, Power BI, Oracle Analytics) to surface data-driven insights
- **DevOps/SRE Practices:*
- Competency in CI/CD pipelines (Jenkins, Kubernetes), Infrastructure as Code (Terraform), and observability tools (Prometheus, Grafana)
- **Cloud Ecosystems:*
- Working knowledge of public cloud environments (OCI, AWS, or Azure) with an emphasis on deployment and resource management
- Strong ability to troubleshoot complex production issues, perform root-cause analysis, and document technical findings
- **Programming & Tools:*
- Solid foundation in Python, Java, or Go, along with containerization (Docker) and shell scripting
- Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas
- Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services
- Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance
- Authority for end-to-end performance and operability
- Partner with development teams in defining and implementing improvements in service architecture
- Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio
- Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack
- Demonstrate clear understanding of automation and orchestration principles
- Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs)
- Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations
- Understand and explain the affect of product architecture decisions on distributed systems
- **Develop & Maintain:*
- Implement and tune infrastructure components for the Oracle HDI Analytics Platform to ensure system stability and uptime
- Build and refine scalable data pipelines, leveraging Vertica and ETL processes to ensure efficient data ingestion and transformation
- Assist in the integration and optimization of BI and reporting tools to ensure seamless data visualization for healthcare leaders
- Apply DevOps and SRE principles to automate routine tasks, manage deployments via CI/CD, and monitor system health using Prometheus/Grafana
- **Cloud Integration:*
- Support platform-agnostic initiatives across Oracle Cloud and AWS, ensuring cost-efficient and compliant resource usage
- Participate in on-call rotations or troubleshooting sessions to resolve production issues and implement preventative fixes
- **Collaboration:*
- Work closely with senior engineers to execute technical roadmaps and provide peer reviews for code and infrastructure changes
- Own and shape cloud reliability and automation strategy for a mission-critical platform
- Work on large-scale, data-intensive systems in healthcare
- Collaborate with top-tier engineers solving complex, real-world problems
- *Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.**
- In this role, you will develop, maintain, and optimize the infrastructure and data pipelines that power healthcare analytics globally
- You will work within a collaborative team to implement robust solutions for business intelligence and reporting, ensuring our platform handles massive datasets with precision and speed
- **DevOps/SRE Practices:*
- Strong ability to troubleshoot complex production issues, perform root-cause analysis, and document technical findings
- Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas
- Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services
- Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance
- Authority for end-to-end performance and operability
- Partner with development teams in defining and implementing improvements in service architecture
- Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio
- Demonstrate clear understanding of automation and orchestration principles
- Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs)
- Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations
- Understand and explain the affect of product architecture decisions on distributed systems
- **Develop & Maintain:*
- Implement and tune infrastructure components for the Oracle HDI Analytics Platform to ensure system stability and uptime
- Assist in the integration and optimization of BI and reporting tools to ensure seamless data visualization for healthcare leaders
- Participate in on-call rotations or troubleshooting sessions to resolve production issues and implement preventative fixes
- Work closely with senior engineers to execute technical roadmaps and provide peer reviews for code and infrastructure changes
Benefits
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free