Skip to content
mimi

AI/ML Engineer – LLM & RAG Solutions

Incedo Inc.

Fort Mill · Hybrid Full-time Senior 1mo ago

About the role

Role Overview:

We are seeking a skilled AI/ML Engineer with hands-on experience building and deploying LLM-powered applications in production environments. The ideal candidate will have expertise in prompt engineering, model versioning, output validation, fallback mechanisms, and Retrieval-Augmented Generation (RAG) frameworks.

The role requires experience designing secure and scalable AI solutions with policy-based input/output controls, ensuring reliability, compliance, and high-quality AI responses.

Key Responsibilities

  • Design, develop, and deploy LLM-based applications and AI features in production environments
  • Build and optimize RAG (Retrieval-Augmented Generation) pipelines for enterprise use cases
  • Implement:
    • prompt engineering strategies
    • prompt/version management
    • response validation
    • fallback and retry mechanisms
  • Develop controlled generation workflows with:
    • input filtering
    • output moderation
    • policy enforcement
  • Integrate AI models with APIs, vector databases, and enterprise applications
  • Collaborate with product, engineering, and data teams to deliver scalable AI solutions
  • Monitor model performance, hallucinations, latency, and response quality
  • Improve reliability, observability, and governance of AI systems
  • Contribute to AI architecture, experimentation, and optimization initiatives

Required Skills & Qualifications

  • 3+ years of experience building and deploying AI/ML solutions
  • Hands-on experience shipping LLM features into production
  • Strong expertise in:
    • Prompt Engineering
    • Prompt/Model Versioning
    • Output Validation
    • Fallback Handling
  • Experience with RAG architectures and semantic retrieval systems
  • Knowledge of policy-based AI controls for:
    • input validation
    • output filtering
    • safe AI responses
  • Strong programming skills in Python
  • Experience with:
    • LangChain / LlamaIndex
    • OpenAI / Anthropic / Gemini APIs
    • Vector databases (Pinecone, Weaviate, Chroma, pgvector)
  • Familiarity with cloud platforms such as AWS, Azure, or GCP

Preferred Skills

  • Experience with AI observability and evaluation frameworks
  • Exposure to guardrails, hallucination mitigation, and AI governance
  • Knowledge of CI/CD and MLOps practices
  • Experience working in enterprise or regulated environments

Skills

AWSAzureChromaGemini APIGCPLangChainLlamaIndexOpenAI APIPineconePythonRAGWeaviate

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free