Skip to content
mimi

Lead Training Engineer for Large Scale AI Models Development

Firstprinciples

Remote · Canada Full-time Lead Today

About the role

About

Drive the future of physics research as a Training Engineer. Oversee end-to-end pre-training of large language models, leveraging your exceptional engineering skills in a remote role.
This position requires a technical expert with over 7 years of experience in large-scale model training. You will design pre-training experiments, optimize data pipelines, and develop distributed training strategies. Collaborate with teams to push the boundaries of fundamental physics research through innovative AI applications.
Contribute to groundbreaking advancements in physics through the development of intelligent systems and state-of-the-art training methods.

Key Responsibilities

  • Design large-scale pre-training experiments for diverse architectures
  • Optimize learning strategies, stabilizing training at scale
  • Build robust data pipelines for high-throughput processing
  • Operate and manage distributed training infrastructure
  • Debug complex issues and manage multi-node GPU jobs

Requirements

  • 7-12+ years in model training and optimization
  • Proficient in PyTorch; experience with distributed frameworks
  • Strong background in applied mathematics
  • Excellent cross-functional collaboration and communication skills
  • Passion for physics and commitment to impactful research

#J-18808-Ljbffr

Requirements

  • Proficient in PyTorch
  • experience with distributed frameworks
  • Strong background in applied mathematics
  • Excellent cross-functional collaboration and communication skills
  • Passion for physics and commitment to impactful research

Responsibilities

  • Design large-scale pre-training experiments for diverse architectures
  • Optimize learning strategies, stabilizing training at scale
  • Build robust data pipelines for high-throughput processing
  • Operate and manage distributed training infrastructure
  • Debug complex issues and manage multi-node GPU jobs

Skills

PyTorch

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free