Skip to content
mimi

Cloud Operations Engineer

CMK Resources, Inc.

Farmington · Hybrid Full-time Mid Level Today

About the role

CMK Resources is seeking a hands-on Cloud Operations Engineer with strong experience in Datadog monitoring and observability to support and improve a production cloud environment.

This role is focused on real operational ownership — improving monitoring, reducing alert noise, strengthening incident response, and helping standardize infrastructure across AWS environments.

If you enjoy working on production systems, solving reliability challenges, and building better monitoring practices, this is a great opportunity.

Responsibilities

  • Design, implement, and improve monitoring and observability solutions using Datadog
  • Configure alerts, dashboards, and logging to improve system visibility
  • Reduce alert noise and improve incident response workflows
  • Support on-call rotations, alert escalation, and root cause analysis
  • Work with engineering teams to improve system reliability and performance
  • Help standardize infrastructure using Terraform or Infrastructure-as-Code tools
  • Monitor and support AWS cloud environments (compute, networking, storage)
  • Improve operational processes and automation across cloud systems

Qualifications

  • Hands-on experience with Datadog is required (must be able to configure monitors, dashboards, and alerts)
  • 4–8+ years of experience in cloud operations, SRE, or DevOps roles
  • Experience with incident response and on-call environments
  • Strong experience with AWS cloud infrastructure
  • Hands-on experience with Terraform, CloudFormation, or Ansible
  • Experience supporting Linux and/or Windows systems
  • Familiarity with monitoring and alerting best practices

Nice to Have

  • Experience with PagerDuty, OpsGenie, or similar incident management tools
  • Exposure to Prometheus, Grafana, or other observability tools
  • Experience with CI/CD pipelines (Jenkins, GitHub Actions, etc.)
  • Understanding of SRE principles (SLOs, SLIs, MTTR improvement)
  • Exposure to cloud cost optimization (FinOps)

Why This Role

  • Hands-on role with real impact on production systems
  • Opportunity to improve and shape monitoring and observability practices
  • Hybrid flexibility (3 days onsite)
  • Work with modern cloud and automation tools

Skills

AnsibleAWSCloudFormationDatadogDevOpsInfrastructure-as-CodeLinuxSRETerraformWindows

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free