DevOps / DataOps Engineer (Private AI / AI Workbench) with Security Clearance
RighIT Solutions LLC
About the role
Role: DevOps / DataOps Engineer (Private AI / AI Workbench) Location: Arlington / Ashburn, VA (Onsite ) Duration: Long-term Project Job Description: The DevOps/DataOps Engineer builds and operates the deployment and data foundations for AI Workbench. This role automates infrastructure provisioning, manages Kubernetes/container operations, builds CI/CD pipelines, and implements secure data pipelines and observability so AI-driven solutions run reliably in private or hybrid environments. Key Responsibilities • Provision and manage environments using Infrastructure as Code (IaC) for compute, networking, storage, and security controls. • Administer Kubernetes/container platforms to host AI Workbench services, agent runtimes, model endpoints, and supporting components. • Build and maintain CI/CD pipelines for application/services, agent configurations, and infrastructure updates with automated checks. • Implement DataOps pipelines for RAG ingestion: secure connectors, preprocessing jobs, scheduling, data quality checks, and lineage tracking. • Implement observability: logs, metrics, traces, dashboards, alerting, and SLO/SLI monitoring across platform and workloads. • Harden environments: secrets management, vulnerability scanning, image signing, policy-as-code, and least-privilege access. • Support release management, incident response, and operational handover including runbooks and knowledge transfer. • Optimize performance and cost: resource sizing, autoscaling policies, GPU scheduling, and storage optimization. Required Qualifications • 5+ years in DevOps/SRE and/or DataOps roles supporting enterprise platforms. • Hands-on experience with Kubernetes and container tooling. • Experience building CI/CD pipelines and IaC automation. • Strong knowledge of security practices in platform operations. • Experience with data pipelines and ETL/ELT concepts. Preferred Qualifications • Experience supporting AI/ML platforms, GPU workloads, or model inference services. • Experience with policy-as-code (OPA/Gatekeeper-like concepts) and compliance-driven operations. • Familiarity with vector databases and search indexing operations. • Experience with hybrid connectivity and network security patterns. Key Skills • Kubernetes, containers, Helm/GitOps concepts • IaC (Terraform/Ansible), CI/CD • Data pipelines, scheduling, data quality • Security hardening, secrets, scanning • Monitoring/observability (metrics, logs, tracing)
Skills: Ansible, Artificial Intelligence (AI), Automation, Autoscaling, Computer Networks, Computer Security, Continuous Deployment/Delivery, Continuous Integration, Cost Control, Data Management, Database Extract Transform and Load (ETL), DevOps, Environmental Management, GPU (Graphics Processing Unit), Incident Response, Information/Data Security (InfoSec), Knowledge Transfer, Machine Tool, Maintain Compliance, Metrics, Multiplatform/Cross-Platform, Network Connectivity, Network Security, Operations Security (OPSEC), Performance Tuning/Optimization, Release Management/Engineering, Reporting Dashboards, Security Clearance, Security Infrastructure, Vulnerability Scanners
About the Company: RighIT Solutions LLC
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free