Principal Engineer
Qlay
About the role
Qlay Technologies, Inc. is a global technology solutions provider with offices in San Francisco and Tokyo. We are currently seeking a • *Principal Engineer**
to join one of our clients. • *** • *As a Principal Engineer**
, you will: • Own and evolve the architecture of cloud and on-prem infrastructure • Lead design and implementation of highly available, scalable, and secure systems • Set infrastructure standards for reliability, observability, performance, and cost efficiency • Drive company-wide initiatives around availability, disaster recovery, and capacity planning • Act as a technical authority for infrastructure, SRE, and platform-related decisions • Review and guide complex infrastructure designs, migrations, and incident responses • Identify systemic risks and proactively reduce operational and security debt • Mentor senior engineers and raise the bar for infrastructure engineering practices • Partner with product, security, and leadership to align infra strategy with business goals • *** • Preferably graduated from a top Vietnamese university (HCMUT, HCMUS, HUST, UIT, etc.) and/ or top university around the world. • 6+ years of experience in infrastructure, systems, or platform engineering • Deep expertise in cloud platforms (AWS, GCP, or Azure) and cloud-native architecture • Strong background in Linux, networking (TCP/IP, DNS, load balancing), and distributed systems • Extensive experience with Infrastructure as Code (Terraform, CloudFormation, Pulumi, etc.) • Proven experience designing and operating high-availability, production systems • Strong incident management and root cause analysis experience • Excellent communication and technical leadership skills
Technical Skills • Cloud services: compute, networking, storage, IAM • Containers and orchestration (Docker, Kubernetes) • CI/CD systems and deployment automation • Observability: monitoring, logging, tracing, alerting • Security best practices (identity, secrets, encryption, least privilege) • Cost optimization and capacity planning
Nice to Have • Experience with SRE practices and SLIs/SLOs • Experience running infrastructure at scale (high traffic, global systems) • Multi-region and disaster recovery architecture • Experience with compliance-heavy environments (SOC2, ISO, HIPAA, PCI) • Background in platform or developer experience teams • *** • Paid Vacations • *** • This will be • *structured as contractor work.** • **Devices: You will be expected to use your own computer to perform the work.**
\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_ If you are interested in this position, please • Email hr@qlay.ai if you have any questions • Send your CV through the application form: https://sheets.qlay.ai/dashboard/#/nc/form/e24f9aba-77d0-4348-9bf9-c4bc8ff7d53a
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free