Machine Learning Engineer - Generative AI
Qualcomm
About the role
About the position
We are seeking an experienced Machine Learning Engineer specializing in Generative AI to join our core AI team. The ideal candidate will be responsible for designing, developing, and deploying cutting-edge generative AI solutions, with a focus on Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and Intelligent agent systems.
Responsibilities • Design and implement RAG-based solutions to enhance LLM capabilities with external knowledge sources • Develop and optimize LLM fine-tuning strategies for specific use cases and domain adaptation • Create robust evaluation frameworks for measuring and improving model performance • Build and maintain agentic workflows for autonomous AI systems • Collaborate with cross-functional teams to identify opportunities and implement AI solutions
Requirements • Bachelor's or Master's degree in Computer Science, or related technical field • 3+ years of experience in Machine Learning/AI engineering • Strong programming skills in Python and experience with ML frameworks (PyTorch, TensorFlow) • Practical experience with LLM deployments and fine-tuning • Experience with vector databases and embedding models • Familiarity with modern AI/ML infrastructure and cloud platforms (AWS, GCP, Azure) • Strong understanding of RAG architectures and implementation
Nice-to-haves • Experience with popular LLM frameworks (Langchain, LlamaIndex, Transformers) • Knowledge of prompt engineering and chain-of-thought techniques • Experience with containerization and microservices architecture • Background in Reinforcement Learning • Contributions to open-source AI projects • Experience with ML ops and model deployment pipelines
Requirements
- Bachelor's or Master's degree in Computer Science, or related technical field
- 3+ years of experience in Machine Learning/AI engineering
- Strong programming skills in Python and experience with ML frameworks (PyTorch, TensorFlow)
- Practical experience with LLM deployments and fine-tuning
- Experience with vector databases and embedding models
- Familiarity with modern AI/ML infrastructure and cloud platforms (AWS, GCP, Azure)
- Strong understanding of RAG architectures and implementation
- Experience with popular LLM frameworks (Langchain, LlamaIndex, Transformers)
- Knowledge of prompt engineering and chain-of-thought techniques
- Experience with containerization and microservices architecture
- Background in Reinforcement Learning
- Contributions to open-source AI projects
- Experience with ML ops and model deployment pipelines
Responsibilities
- The ideal candidate will be responsible for designing, developing, and deploying cutting-edge generative AI solutions, with a focus on Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and Intelligent agent systems
- Design and implement RAG-based solutions to enhance LLM capabilities with external knowledge sources
- Develop and optimize LLM fine-tuning strategies for specific use cases and domain adaptation
- Create robust evaluation frameworks for measuring and improving model performance
- Build and maintain agentic workflows for autonomous AI systems
- Collaborate with cross-functional teams to identify opportunities and implement AI solutions
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free