Skip to content
mimi

Machine Learning Engineer - Generative AI

Qualcomm

San Diego · On-site Full-time Executive 6d ago

About the role

About the position

We are seeking an experienced Machine Learning Engineer specializing in Generative AI to join our core AI team. The ideal candidate will be responsible for designing, developing, and deploying cutting-edge generative AI solutions, with a focus on Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and Intelligent agent systems.

Responsibilities • Design and implement RAG-based solutions to enhance LLM capabilities with external knowledge sources • Develop and optimize LLM fine-tuning strategies for specific use cases and domain adaptation • Create robust evaluation frameworks for measuring and improving model performance • Build and maintain agentic workflows for autonomous AI systems • Collaborate with cross-functional teams to identify opportunities and implement AI solutions

Requirements • Bachelor's or Master's degree in Computer Science, or related technical field • 3+ years of experience in Machine Learning/AI engineering • Strong programming skills in Python and experience with ML frameworks (PyTorch, TensorFlow) • Practical experience with LLM deployments and fine-tuning • Experience with vector databases and embedding models • Familiarity with modern AI/ML infrastructure and cloud platforms (AWS, GCP, Azure) • Strong understanding of RAG architectures and implementation

Nice-to-haves • Experience with popular LLM frameworks (Langchain, LlamaIndex, Transformers) • Knowledge of prompt engineering and chain-of-thought techniques • Experience with containerization and microservices architecture • Background in Reinforcement Learning • Contributions to open-source AI projects • Experience with ML ops and model deployment pipelines

Requirements

  • Bachelor's or Master's degree in Computer Science, or related technical field
  • 3+ years of experience in Machine Learning/AI engineering
  • Strong programming skills in Python and experience with ML frameworks (PyTorch, TensorFlow)
  • Practical experience with LLM deployments and fine-tuning
  • Experience with vector databases and embedding models
  • Familiarity with modern AI/ML infrastructure and cloud platforms (AWS, GCP, Azure)
  • Strong understanding of RAG architectures and implementation
  • Experience with popular LLM frameworks (Langchain, LlamaIndex, Transformers)
  • Knowledge of prompt engineering and chain-of-thought techniques
  • Experience with containerization and microservices architecture
  • Background in Reinforcement Learning
  • Contributions to open-source AI projects
  • Experience with ML ops and model deployment pipelines

Responsibilities

  • The ideal candidate will be responsible for designing, developing, and deploying cutting-edge generative AI solutions, with a focus on Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and Intelligent agent systems
  • Design and implement RAG-based solutions to enhance LLM capabilities with external knowledge sources
  • Develop and optimize LLM fine-tuning strategies for specific use cases and domain adaptation
  • Create robust evaluation frameworks for measuring and improving model performance
  • Build and maintain agentic workflows for autonomous AI systems
  • Collaborate with cross-functional teams to identify opportunities and implement AI solutions

Skills

Machine LearningAIPythonPyTorchTensorFlowLLMRAGIntelligent agent systemsCloud platformsAWSGCPAzure

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free