Skip to content
mimi

CloudOps Lead Engineer

Noblesoft Technologies

Plano · On-site Full-time Lead 1mo ago

About the role

Role Overview

We are looking for a motivated and hands-on Cloud Ops Engineer to join our team and play a critical role in deploying, operating, and maintaining our cloud-based products. This role is essential to ensuring smooth deployments, high system reliability, and strong operational performance across our environments.

Key Responsibilities

Cloud Operations & Deployments

  • Support and manage cloud deployments to ensure reliable, repeatable, and high-quality releases.
  • Own day-to-day cloud operational health, including uptime, performance, and stability.
  • Work closely with engineering teams to support product deployments and operational readiness.

Automation & Scripting

  • Design, develop, and maintain automation scripts to improve deployment quality, efficiency, and consistency.
  • Continuously enhance CI/CD and operational workflows through scripting and tooling.
  • Reduce manual effort and operational risk through automation-first approaches.

Monitoring & System Health

  • Implement and maintain monitoring, alerting, and logging to ensure system health and performance.
  • Proactively identify issues, perform root cause analysis, and drive permanent fixes.
  • Ensure systems are scalable, resilient, and performant.

Architecture & Ecosystem Understanding

  • Develop a deep understanding of the platform architecture, cloud ecosystem, and dependencies.
  • Contribute to operational best practices, standards, and continuous improvement initiatives.
  • Act as a self-starter who can independently identify gaps and propose solutions.

LLM & AI Enablement

  • Apply Large Language Models (LLMs) in cloud operations use cases such as automation, observability, diagnostics, or operational intelligence.
  • Stay current with advancements in LLMs and AI-driven tooling and apply them pragmatically within the Cloud Ops domain.
  • Collaborate with engineering teams to integrate LLM-based capabilities into operational workflows.

Technical Skills

Required Skills & Qualifications

  • Hands-on experience with cloud platforms (Azure.
  • Strong scripting skills (e.g., Python, Bash, PowerShell, or similar).
  • Experience with deployment pipelines, automation, and monitoring tools.
  • Solid understanding of cloud infrastructure, networking, and application operations.

LLM & AI Experience

  • Practical experience working with Large Language Models (LLMs).
  • Familiarity with applying LLMs to engineering or operational workflows is required.

Professional Attributes

  • Strong desire to learn and deeply understand complex systems.
  • Self-starter with the ability to take ownership and drive initiatives independently.
  • Demonstrates leadership, accountability, and problem-solving mindset.
  • Strong collaboration and communication skills

Skills

AzureBashCI/CDCloud OperationsDockerKubernetesLLMMonitoringNetworkingObservabilityPowerShellPythonScripting

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free