Skip to content
mimi

AI Data Engineer

Insight Global

Anand · On-site Full-time 3w ago

About the role

Role Summary

We are hiring an AI Data Engineer specializing in AWS or Azure to build vector stores, embedding pipelines, RAG architectures, and document-processing systems that power enterprise AI applications.

Responsibilities

• Build scalable ETL/ELT pipelines in AWS or Azure to support unstructured data ingestion • Use services such as Azure Document Intelligence, AWS Textract, or other OCR tools to extract content at scale • Design and manage embedding pipelines, chunking strategies, and vector database integrations (Azure AI Search, etc.) • Develop retrieval and orchestration pipelines for RAG use cases • Implement resilient CICD workflows and production-grade logging/error-handling • Collaborate with platform and application teams to integrate LLM-powered features into enterprise apps • Deploy services using functions, containers, event-driven workflows (AWS Lambda, Azure Functions, etc) • Ensure solutions meet security, compliance, and performance requirements Required Qualifications • 3+ years in data engineering or AI infrastructure roles • Expertise in AWS or Azure (not required to know both) • Hands-on experience with vector stores and embedding pipelines • Strong Python development experience • Experience with OCR/document intelligence tools • Strong familiarity with LLMs, RAG architectures, embeddings, and retrieval techniques • Experience with CICD and software engineering best practices

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free