[Remote] Principal Site Reliability Developer- USC Required
Oracle
About the role
About Us
Only Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life‑saving care. And with AI embedded across our products and services, we help customers turn that promise into a better future for all. Discover your potential at a company leading the way in AI and cloud solutions that impact billions of lives.
True innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing a workforce that promotes opportunities for all with competitive benefits that support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling 1‑888‑404‑2494 in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
Job Description
Come and join us! Building on our cloud momentum, Oracle has formed a new organization— Oracle Health. This team focuses on product deployment, sustainability, troubleshooting, and product strategy while building a modern, automated healthcare platform. This is a net‑new line of business with an entrepreneurial spirit, offering a unique opportunity to help build a world‑class engineering organization centered on excellence, innovation, and real‑world impact.
As a Site Reliability DevOps Engineer, you will play a critical role in operating and scaling a Clinical AI Assistant platform used by healthcare professionals worldwide. This system is designed to improve the quality, safety, and efficiency of care delivery for billions of patients globally. Your work will directly influence the reliability and performance of AI‑driven systems that clinicians depend on in high‑stakes environments.
This role goes beyond traditional SRE responsibilities—you will have the opportunity to leverage AI/ML techniques and develop AIOps solutions to proactively manage system reliability, detect anomalies, automate remediation, and continuously improve service performance. You will help define how reliability engineering evolves in the context of intelligent, AI‑powered healthcare systems.
You will be responsible for architecture, production operations, capacity planning, performance management, deployment, and release engineering, working across cross‑functional teams to deliver highly reliable, scalable, and secure services.
Responsibilities
- Own the architecture, design, implementation, and production operations of core platform and AI‑driven system services
- Ensure the reliability, availability, and performance of the Clinical AI Assistant platform used in real‑world healthcare settings
- Build and operate AIOps‑driven capabilities (e.g., intelligent alerting, anomaly detection, automated remediation, predictive scaling)
- Continuously improve systems through automation, self‑healing mechanisms, and real‑time observability
- Design and develop software to enhance system scalability, efficiency, and resilience
- Partner with cross‑functional teams to prototype and deliver new platform services
- Lead efforts in capacity planning, demand forecasting, performance tuning, and cost optimization
- Solve complex distributed systems challenges in cloud‑native environments and prevent recurrence through engineering rigor
- Contribute to platform engineering best practices, including infrastructure as code, CI/CD, and service reliability standards
- Stay current with emerging technologies in cloud, distributed systems, and AI/ML‑driven operations
Key Requirements / Experience
Must‑have
- Ability to obtain and maintain a federal security clearance (US citizenship required)
- 8+ years of experience in Site Reliability Engineering, DevOps, or related roles
- Proven experience operating large‑scale, distributed, production systems with high availability requirements
- Strong experience with container orchestration (Kubernetes, Docker, or similar)
- Infrastructure as Code expertise (Terraform, Ansible, Chef, Puppet, Packer, etc.)
- Experience building and operating CI/CD pipelines (Git, Jenkins, GitLab, Rundeck, etc.)
- Proficiency in scripting and automation (Bash, Python, PowerShell, etc.)
- Experience with at least one major cloud provider (OCI, AWS, Azure, etc.)
- Strong Linux systems expertise
- Experience with observability tooling (monitoring, logging, tracing) and performance optimization
Nice‑to‑have
- Experience supporting or operating AI/ML or LLM‑based systems in production
- Exposure to AIOps, intelligent automation, or ML‑driven observability
- Experience in healthcare or other regulated environments (HIPAA, security, compliance)
- Background in high‑throughput, low‑latency systems supporting mission‑critical workloads
- Software engineering experience in Java, Python, C++, or similar languages
Disclaimer
- Certain US customer or client‑facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.
- Range and benefit information provided in this posting are specific to the stated locations only.
Salary Range
US: Hiring range in USD from $86,400 to $199,500 per annum. May be eligible for bonus and equity.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business. Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Benefits
- Medical, dental, and vision insurance, including expert medical opinion
- Short‑term disability and long‑term disability
- Life insurance and AD&D
- Supplemental life insurance (Employee/Spouse/Child)
- Health care and dependent care Flexible Spending Accounts
- Pre‑tax commuter and parking benefits
- 401(k) Savings and Investment Plan with company match
- Paid time off: Flexible Vacation for salaried (non‑overtime eligible) employees; accrued vacation for other eligible employees (13 days annually for the first three years, 18 days thereafter; prorated for part‑time)
- 11 paid holidays
- Paid sick leave: 72 hours upon hire, refreshed each calendar year, carrying over up to a maximum of 112 hours
- Paid parental leave
- Adoption assistance
- Employee Stock Purchase Plan
- Financial planning and group legal services
- Voluntary benefits including auto, homeowner, and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level
- IC4
About Us (continued)
Only Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life‑saving care. And with AI embedded across our products and services, we help customers turn that promise into a better future for all. Discover your potential at a company leading the way in AI and cloud solutions that impact billions of lives.
True innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing a workforce that promotes opportunities for all with competitive benefits that support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free