Engineering Manager, ML Inference
Cerebras
About the role
About
Lead advanced ML runtime development as an Engineering Manager specializing in inference systems. Enhance AI implementation speed and efficiency by directing scalable solutions for multimodal models.
This leadership role within the Inference ML Engineering team involves designing and deploying systems that handle high-demand AI tasks efficiently. With a blend of technical strength and management expertise, you will enable high-quality implementations of complex models while fostering a culture of innovation and teamwork essential for success in AI endeavors.
Key Responsibilities
- Design and evolve ML inference runtime architecture
- Build and lead a talented infrastructure engineering team
- Drive execution of large-scale ML projects across teams
- Ensure high performance and reliability standards in releases
- Collaborate on continuous improvement strategies for AI workloads
Requirements
- Minimum 8 years in software engineering with ML focus
- 2+ years of experience in engineering management
- Proficient in Python and C++ for challenging systems
- Demonstrated ability in large-scale inference and cloud operations
- Preferred knowledge in ML runtime optimization techniques
Shape the future of AI by driving state-of-the-art inferencing capabilities in a transformative environment, ensuring your team delivers unmatched performance across applications.
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free