Senior Software Engineer – Cloud Infrastructure Reliability & Automation - Full-time

Oracle

Washington · On-site Full-time Senior $79k – $158k/yr 2mo ago

About the role

About Us

Only Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life‑saving care. And with AI embedded across our products and services, we help customers turn that promise into a better future for all. Discover your potential at a company leading the way in AI and cloud solutions that impact billions of lives.

True innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing a workforce that promotes opportunities for all with competitive benefits that support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.

We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling 1‑888‑404‑2494 in the United States.

Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.

Job Description

Join Oracle's Health Data Intelligence (HDI) team as a Software Engineer 3 and contribute to the reliability, scalability, and performance of our world‑class analytics platform. In this role, you will develop, maintain, and optimize the infrastructure and data pipelines that power healthcare analytics globally. You will work within a collaborative team to implement robust solutions for business intelligence and reporting, ensuring our platform handles massive datasets with precision and speed.

U.S. citizenship is required for this position, as the successful candidate will be required to obtain (and maintain) a U.S. government security clearance after hire.

Required Skills

Infrastructure & Reliability: Experience implementing and maintaining high‑availability systems with a focus on performance monitoring and fault tolerance.
Data Technologies: Proficiency in Data Warehousing platforms (e.g., Vertica, Snowflake) and ETL frameworks; understanding of columnar storage and large‑scale data processing.
BI & Reporting: Practical experience integrating or supporting Business Intelligence tools (e.g., Tableau, Power BI, Oracle Analytics) to surface data‑driven insights.
DevOps/SRE Practices: Competency in CI/CD pipelines (Jenkins, Kubernetes), Infrastructure as Code (Terraform), and observability tools (Prometheus, Grafana).
Cloud Ecosystems: Working knowledge of public cloud environments (OCI, AWS, or Azure) with an emphasis on deployment and resource management.
Problem‑Solving: Strong ability to troubleshoot complex production issues, perform root‑cause analysis, and document technical findings.
Programming & Tools: Solid foundation in Python, Java, or Go, along with containerization (Docker) and shell scripting.

Responsibilities

Work with Site Reliability Engineering (SRE) team on the shared full‑stack ownership of a collection of services and/or technology areas. Understand the end‑to‑end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission‑critical stack, with focus on security, resiliency, scale, and performance. Authority for end‑to‑end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to develop deep understanding of services and technologies.

Key Responsibilities

Develop & Maintain: Implement and tune infrastructure components for the Oracle HDI Analytics Platform to ensure system stability and uptime.
Data Pipeline Execution: Build and refine scalable data pipelines, leveraging Vertica and ETL processes to ensure efficient data ingestion and transformation.
BI Support: Assist in the integration and optimization of BI and reporting tools to ensure seamless data visualization for healthcare leaders.
Operational Excellence: Apply DevOps and SRE principles to automate routine tasks, manage deployments via CI/CD, and monitor system health using Prometheus/Grafana.
Cloud Integration: Support platform‑agnostic initiatives across Oracle Cloud and AWS, ensuring cost‑efficient and compliant resource usage.
Incident Response: Participate in on‑call rotations or troubleshooting sessions to resolve production issues and implement preventative fixes.
Collaboration: Work closely with senior engineers to execute technical roadmaps and provide peer reviews for code and infrastructure changes.

What You Bring

8+ years of software engineering experience, with 5+ years focused on cloud infrastructure, SRE, or DevOps
Proven ownership of production system reliability and uptime in cloud environments
Strong expertise in:
- Cloud infrastructure design and automation
- Distributed systems and performance optimization
- Data warehousing and ETL frameworks
- Columnar databases (e.g., Vertica)
Hands‑on experience with:
- Infrastructure as Code (Terraform)
- Containerization (Docker) and orchestration (Kubernetes)
- Observability stacks (Prometheus, Grafana)
Experience integrating BI/reporting tools (Tableau, Power BI, Oracle Analytics, etc.)
Proficiency in Python, Java, or Go
Strong problem‑solving skills with a track record of improving system reliability, automation, and scalability

Preferred Qualifications

Experience in healthcare or regulated environments (HIPAA, compliance frameworks)
Familiarity with Oracle HDI or large‑scale analytics platforms
Experience working in environments requiring security clearance

Why Join Oracle HDI?

Own and shape cloud reliability and automation strategy for a mission‑critical platform
Work on large‑scale, data‑intensive systems in healthcare
Be part of Oracle’s investment in AI‑driven healthcare innovation
Collaborate with top‑tier engineers solving complex, real‑world problems

Compensation & Benefits

US Hiring Range: $79,100 to $158,200 per annum (may be eligible for bonus and equity)
Oracle maintains broad salary ranges to account for variations in knowledge, skills, experience, market conditions, locations, and product lines. Candidates are placed into the range based on these factors and internal peer equity.

Benefits Package

Medical, dental, and vision insurance, including expert medical opinion
Short‑term disability and long‑term disability
Life insurance and AD&D
Supplemental life insurance (Employee/Spouse/Child)
Health care and dependent care Flexible Spending Accounts
Pre‑tax commuter and parking benefits
401(k) Savings and Investment Plan with company match
Paid time off:
- Flexible Vacation for salaried (non‑overtime eligible) employees
- Accrued Vacation for other employees (13 days annually for first three years, 18 days thereafter; prorated for part‑time)
11 paid holidays
Paid sick leave: 72 hours upon hire, refreshed each calendar year, carry‑over up to 112 hours
Paid parental leave
Adoption assistance
Employee Stock Purchase Plan
Financial planning and group legal services
Voluntary benefits including auto, homeowner, and pet insurance

Disclaimer

Certain US customer or client‑facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.
Range and benefit information provided in this posting are specific to the stated locations only.

The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.

Career Level

Job Description (Duplicate)

U.S. citizenship is required for this position, as the successful candidate will be required to obtain (and maintain) a U.S. government security clearance after hire.

Required Skills (Duplicate)

Infrastructure & Reliability: Experience implementing and maintaining high‑availability systems with a focus on performance monitoring and fault tolerance.
Data Technologies: Proficiency in Data Warehousing platforms (e.g., Vertica, Snowflake) and ETL frameworks; understanding of columnar storage and large‑scale data processing.
BI & Reporting: Practical experience integrating or supporting Business Intelligence tools (e.g., Tableau, Power BI, Oracle Analytics) to surface data‑driven insights.
DevOps/SRE Practices: Competency in CI/CD pipelines (Jenkins, Kubernetes), Infrastructure as Code (Terraform), and observability tools (Prometheus, Grafana).
Cloud Ecosystems: Working knowledge of public cloud environments (OCI, AWS, or Azure) with an emphasis on deployment and resource management.
Problem‑Solving: Strong ability to troubleshoot complex production issues, perform root‑cause analysis, and document technical findings.
Programming & Tools: Solid foundation in Python, Java, or Go, along with containerization (Docker) and shell scripting.

Responsibilities (Duplicate)

Key Responsibilities (Duplicate)

Develop & Maintain: Implement and tune infrastructure components for the Oracle HDI Analytics Platform to ensure system stability and uptime.
Data Pipeline Execution: Build and refine scalable data pipelines, leveraging Vertica and ETL processes to ensure efficient data ingestion and transformation.
BI Support: Assist in the integration and optimization of BI and reporting tools to ensure seamless data visualization for healthcare leaders.
Operational Excellence: Apply DevOps and SRE principles to automate routine tasks, manage deployments via CI/CD, and monitor system health using Prometheus/Grafana.
Cloud Integration: Support platform‑agnostic initiatives across Oracle Cloud and AWS, ensuring cost‑efficient and compliant resource usage.
Incident Response: Participate in on‑call rotations or troubleshooting sessions to resolve production issues and implement preventative fixes.
Collaboration: Work closely with senior engineers to execute technical roadmaps and provide peer reviews for code and infrastructure changes.

What You Bring (Duplicate)

8+ years of software engineering experience, with 5+ years focused on cloud infrastructure, SRE, or DevOps
Proven ownership of production system reliability and uptime in cloud environments
Strong expertise in:
- Cloud infrastructure design and automation
- Distributed systems and performance optimization
- Data warehousing and ETL frameworks
- Columnar databases (e.g., Vertica)
Hands‑on experience with:
- Infrastructure as Code (Terraform)
- Containerization (Docker) and orchestration (Kubernetes)
- Observability stacks (Prometheus, Grafana)Experience integrating BI/reporting tools (Tableau, Power BI, Oracle Analytics, etc.)Proficiency in Python, Java, or Go
Strong problem‑solving skills with a track record of improving system reliability, automation, and scalability

Preferred Qualifications (Duplicate)

Experience in healthcare or regulated environments (HIPAA, compliance frameworks)
Familiarity with Oracle HDI or large‑scale analytics platforms
Experience working in environments requiring security clearance

Why Join Oracle HDI? (Duplicate)

Own and shape cloud reliability and automation strategy for a mission‑critical platform
Work on large‑scale, data‑intensive systems in healthcare
Be part of Oracle’s investment in AI‑driven healthcare innovation
Collaborate with top‑tier engineers solving complex, real‑world problems

Skills

High‑availability system designPerformance monitoring and fault toleranceData warehousing (Vertica, Snowflake)ETL framework developmentColumnar storage conceptsBusiness Intelligence integration (Tableau, Power BI, Oracle Analytics)CI/CD pipelines (Jenkins, Kubernetes)Infrastructure as Code (Terraform)Observability tools (Prometheus, Grafana)Public cloud platforms (OCI, AWS, Azure)PythonJavaGoDockerShell scriptingTroubleshooting and root‑cause analysisAutomation and orchestration

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free