Skip to content
mimi

Principal Site Reliability Engineer (SRE)

Oracle

Washington · On-site Full-time Lead Today

About the role

About the Role

We are seeking a Principal Site Reliability Engineer (SRE) to provide technical leadership for the core data platforms behind Oracle Health's Data & Analytics Platform. In this role, you will own mission-critical systems that support multiple products and teams, playing a key part in how Oracle Health products revolutionize the healthcare industry.

Your Impact

  • Contribute to technology that positively impacts millions of lives.
  • Develop innovative solutions that set new standards in the healthcare sector.
  • Make a significant and immediate impact on technology development.
  • Experience unlimited career growth in an inspiring work environment.
  • Collaborate with top industry experts.
  • Thrive in an open and diverse atmosphere focused on productivity.

Responsibilities

You will lead the design and operation of large-scale, stateful distributed platforms, including Hadoop ecosystem components (HDFS, YARN, HBase), Kafka, and Storm. These platforms are managed through Ansible- and Terraform-based automation and require architectural expertise to handle scale, changes, and extensive impact across systems.

Required Experience

  • 8+ years of experience in operating large-scale, customer-facing distributed platforms.
  • Extensive knowledge of HDFS, YARN, HBase, Kafka, and Storm.
  • Strong troubleshooting skills in Linux, networking, and distributed systems.
  • Proficient with Infrastructure-as-Code tools such as Ansible and Terraform.
  • Expertise in scripting and automation using Python, Ruby, and Bash.
  • Experience managing Kerberized environments.
  • Ability to document and design technical architecture for complex systems.
  • Demonstrated ownership of shared platforms with broad impact.
  • Experience in designing observability and capacity models for distributed systems.

Qualifications

  • U.S. Citizenship and eligibility for a Federal Security Clearance.
  • 10+ years of relevant technical experience.
  • Strong communication skills and ability to build effective relationships with team members.
  • BS or MS in Computer Science or equivalent.

Required Skills

  • Proficiency in Python programming.

About Us

At Oracle, we integrate data, infrastructure, applications, and expertise to drive innovation across industries and improve lives. With AI embedded in our offerings, we empower our customers to achieve their goals. Join us and be part of a company committed to enabling a workforce that values diversity and inclusion, supported by competitive benefits and opportunities for community engagement.

Oracle is dedicated to providing equal employment opportunities to all qualified applicants without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability, or veteran status. We encourage individuals with arrest and conviction records to apply in accordance with applicable law.

Skills

AnsibleBashHBaseHDFSHadoopKafkaLinuxPythonRubyStormTerraformYARN

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free