All jobs · Machine Learning Engineer jobs

Deep Learning Model Optimization Engineer

Jobgether

La Broquerie · flexible Full-time 3mo ago

About the role

About

Become a Deep Learning Model Optimization Engineer and elevate AI capabilities through advanced model training techniques. This role focuses on optimizing pre-training systems within a remote-capable, research-intensive environment.

You will be instrumental in next-generation AI model development, working hands-on with distributed GPU infrastructures. Your focus will be on designing and optimizing efficient training processes for large-scale models, ensuring maximum performance across thousands of NVIDIA GPUs. Collaborate with leading researchers and engineers to drive forward-thinking solutions in AI.

Key Responsibilities:

Conduct large-scale pre-training for AI models
Design and innovate model architectures for better performance
Analyze and refine experimental methodologies
Identify and solve performance bottlenecks
Collaborate to build production-ready training systems

Requirements:

PhD or equivalent in AI, ML, or related field
Experience with large-scale LLM training on GPUs
Deep understanding of transformer architectures
Proficiency in debugging and optimizing AI systems
Strong collaborative skills in research-focused teams

Successfully impact the AI landscape by optimizing training pipelines and enhancing model efficiencies in partnership with expert researchers.

Skills

AILLMMachine LearningNVIDIA GPUsTransformer Architectures

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free

Deep Learning Model Optimization Engineer

About the role

About

Key Responsibilities:

Requirements:

Skills

Similar roles

AOSP Solution Architect (m/w/d)

Manager, Expansion Sales - SMB

Data Scientist

Don't send a generic resume