Engineering Manager, ML Inference

Cerebras

Canada · On-site Full-time Lead 3mo ago

About the role

About

Lead advanced ML runtime development as an Engineering Manager specializing in inference systems. Enhance AI implementation speed and efficiency by directing scalable solutions for multimodal models.

This leadership role within the Inference ML Engineering team involves designing and deploying systems that handle high-demand AI tasks efficiently. With a blend of technical strength and management expertise, you will enable high-quality implementations of complex models while fostering a culture of innovation and teamwork essential for success in AI endeavors.

Key Responsibilities

Design and evolve ML inference runtime architecture
Build and lead a talented infrastructure engineering team
Drive execution of large-scale ML projects across teams
Ensure high performance and reliability standards in releases
Collaborate on continuous improvement strategies for AI workloads

Requirements

Minimum 8 years in software engineering with ML focus
2+ years of experience in engineering management
Proficient in Python and C++ for challenging systems
Demonstrated ability in large-scale inference and cloud operations
Preferred knowledge in ML runtime optimization techniques

Shape the future of AI by driving state-of-the-art inferencing capabilities in a transformative environment, ensuring your team delivers unmatched performance across applications.

Skills

C++Python

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free

Engineering Manager, ML Inference

About the role

About

Key Responsibilities

Requirements

Skills

Similar roles

backend developer

Software Architect

Senior Android Platform Developer

Don't send a generic resume