Skip to content
mimi

AIML Engineer

CCS INC

Plano · On-site Full-time 2w ago

About the role

Benefits

  • Bonus based on performance
  • Dental insurance
  • Health insurance
  • Vision insurance

Qualifications

  • Bachelor’s Degree
  • 6+ years cloud architecture experience
  • 3+ years building production GenAI/LLM systems on AWS.
  • Strong Python and AWS expertise, including Lambda, ECS/EKS, S3, SageMaker, Docker and Kubernetes.
  • Production experience with vector databases and designing ingestion + embedding pipelines for both batch and streaming workloads.
  • Hands-on with prompt design, evaluation, LLM orchestration, and RAG implementation patterns.
  • Experience deploying and operating model- serving or MCP – like server infrastructure (selfhosted or managed).
  • Proficient with IaC and delivery tooling, including Terraform/CloudFormation, GitOps, and CI pipelines.
  • Experience with model-serving infrastructure, such as Amazon SageMaker, NVIDIA Triton, Ray Serve, or similar platforms.
  • Hands-on experience with GenAI libraries and frameworks, including LangChain, LlamaIndex, Hugging Face, and OpenAI APIs.
  • Deep operational expertise with vector databases, such as Pinecone, Milvus, Weaviate, or Qdrant.
  • AWS Solutions Architect, AWS DevOps Engineer, or equivalent industry certifications.

Responsibilities

  • Cloud Architecture & Infrastructure, Design scalable, secure AWS architectures
  • LLM & GenAI Platforms, Lead integration of API-based and self-hosted LLMs, implement RAG solutions
  • Prompting & Evaluation, Develop prompt engineering strategies, reusable templates, and evaluation frameworks
  • Vector Databases & Retrieval Pipelines, Implement and maintain vector stores (OpenSearch, Pinecone, Milvus, Qdrant)
  • Data Ingestion & Processing Pipelines
  • Microservices & Serverless Systems
  • Python Development & AI Tooling
  • Security, Governance & Cross-Functional Leadership

Skills

AWSAWS CloudFormationAWS LambdaAWS SageMakerDockerEKSECSGitOpsHugging FaceKubernetesLangChainLlamaIndexLLMMilvusNVIDIA TritonOpenAI APIsOpenSearchPineconePythonQdrantRAGRay ServeS3TerraformWeaviate

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free