Skip to content
mimi

Engineering Manager, ML Inference

Cerebras

Canada · On-site Full-time Lead Today

About the role

About

Lead advanced ML runtime development as an Engineering Manager specializing in inference systems. Enhance AI implementation speed and efficiency by directing scalable solutions for multimodal models.

This leadership role within the Inference ML Engineering team involves designing and deploying systems that handle high-demand AI tasks efficiently. With a blend of technical strength and management expertise, you will enable high-quality implementations of complex models while fostering a culture of innovation and teamwork essential for success in AI endeavors.

Key Responsibilities

  • Design and evolve ML inference runtime architecture
  • Build and lead a talented infrastructure engineering team
  • Drive execution of large-scale ML projects across teams
  • Ensure high performance and reliability standards in releases
  • Collaborate on continuous improvement strategies for AI workloads

Requirements

  • Minimum 8 years in software engineering with ML focus
  • 2+ years of experience in engineering management
  • Proficient in Python and C++ for challenging systems
  • Demonstrated ability in large-scale inference and cloud operations
  • Preferred knowledge in ML runtime optimization techniques

Shape the future of AI by driving state-of-the-art inferencing capabilities in a transformative environment, ensuring your team delivers unmatched performance across applications.

Skills

C++Python

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free