J&
Member of Technical Staff: AI Systems Engineer
Jack & Jill
London · On-site Full-time Mid Level 3w ago
About the role
Job Description
As a core member of the technical staff, you will architect and optimize high-throughput inference systems for large-scale generative models. You will tackle deep technical challenges in distributed systems and hardware-software co-design, directly impacting the latency and scalability of production-grade AI services for a global developer ecosystem.
Location
London, UK
Why this role is remarkable
- Work at the intersection of systems engineering and cutting-edge machine learning research to define the future of model deployment.
- Join an elite technical team backed by top-tier venture capital firms during a period of rapid infrastructure scaling.
- Influence the foundational layer of AI applications by building systems that make massive models commercially viable and performant.
What You Will Do
- Design and implement low-level optimizations for model inference to maximize GPU utilization and minimize token latency.
- Build robust, distributed systems capable of serving frontier models with high reliability and cost-efficiency.
- Collaborate with research teams to integrate novel architectures into production-ready inference engines and serving stacks.
The ideal candidate
- Demonstrates deep expertise in systems programming and optimizing performance-critical software in C++ or Rust.
- Has a proven track record of working with deep learning frameworks and low-level GPU acceleration libraries.
- Possesses a strong understanding of distributed systems and the mechanics of modern large language model architectures.
Skills
C++Rustgenerative modelsGPUlarge language modelsmachine learningmodel inferenceserving systems
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free