Senior ML Engineer (LLM Systems)

Cynnovative

Arlington · On-site Full-time Senior 2mo ago

About the role

Company Overview

At Cynnovative, we leverage machine learning, computer science, and software engineering to address high-impact problems in the cyber domain, specifically those which are critical to U.S. national security. We primarily extend fundamental research to invent, design, develop, and deploy prototype solutions that support persistent problems in this domain.

Job Overview

As a Senior ML Engineer (LLM Systems) at Cynnovative, you will be responsible for developing and managing tools that facilitate LLM experimentation and deployment. This role is crucial in ensuring seamless integration and operation of machine learning models in various environments, supporting U.S. national security efforts.

NOTE: This role requires an active TS/SCI security clearance and is located on-site in Northern Virginia.

Responsibilities

Design and build scalable LLM systems for high-throughput experimentation and inference
Optimize inference performance (latency, throughput)
Batching, caching, and request scheduling
Efficient GPU/CPU utilization and memory management
Design and deploy containerized ML services (e.g., Docker, Kubernetes)
Lead development of experimentation infrastructure
Build frameworks for large-scale experiment sweeps and parallel execution
Support distributed execution across compute environments
Ensure fault tolerance, retry logic, and reproducibility
Ensure production readiness and operational reliability of LLM systems
Implement testing strategies and validation pipelines
Design APIs and model serving systems
Support deployment across dev/staging/prod environments
Maintain observability (logging, monitoring, tracing)
Debug and resolve issues in production systems
Deploy systems in secure or constrained environments
Collaborate cross-functionally
Work closely with applied mathematicians and research engineers
Provide technical leadership and mentorship
Establish engineering best practices

Requirements

B.S. in Computer Science, Software Engineering, or related field (M.S. or Ph.D. preferred)
Strong communication skills and cross-functional collaboration
Deep understanding of transformer architectures and LLM inference workflows
Hands-on experience building scalable ML systems
Proficiency in Python and ML frameworks (e.g., PyTorch, Hugging Face)
Experience with distributed systems or large-scale compute environments
Experience with containerization and cloud platforms (Docker, Kubernetes, AWS/GCP/Azure)
Familiarity with CI/CD workflows for ML systems
Experience with version control systems (Git)
U.S. Citizenship and active TS/SCI security clearance

Desired Skills

Experience with high-performance LLM inference frameworks (vLLM, SGLang, etc.)
Experience deploying real-time inference APIs
Understanding of ML system tradeoffs (latency vs throughput vs cost)
Experience bridging research and production systems
Familiarity with cyber-related data, tools, and techniques

Skills

AWSAzureDockerGCPGitHugging FaceKubernetesLLMMachine LearningPyTorchPythonTransformer Architectures

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free

Senior ML Engineer (LLM Systems)

About the role

Company Overview

Job Overview

Responsibilities

Requirements

Desired Skills

Skills

Similar roles

MCP Engineer / AI Backend Engineer

Senior Database Engineer

Team Leads

Don't send a generic resume