Skip to content
mimi

Large Machine Learning Model Optimization Engineer

Apple

Seattle · On-site Full-time Lead $140k – $258k/yr 4w ago

About the role

About the Team

Our team is an applied research and engineering team responsible for developing real-time on-device Language, Computer Vision, and Machine Perception technologies across Apple products. We focus on technology research and development to deliver Apple quality, state-of-the-art experiences. Our team prides itself on innovating through the full stack, and partnering with HW, SW and ML teams to influence the sensor and silicon roadmap that brings our vision to life.

We are directly responsible for the on-device optimization and deployment of the Apple Intelligence LLM and diffusion models.

Role Overview

As a Machine Learning Engineer, you will have the opportunity to be at the forefront of technological advancements and contribute to the successful shipping and delivery of Apple intelligence. You will be responsible for implementing and delivering various optimization techniques that improve the performance of large language and diffusion models on devices. Additionally, you will collaborate with a diverse range of organizations within Apple. Your innovations will significantly impact the entire ML model lifecycle of Apple intelligence.

Description

We’re looking for strong Machine Learning software engineers/leaders to drive the development of the on-device Apple Intelligence LLM and diffusion model developments. This includes defining and leading the execution of model compression, distillation, and integrating to the full Apple Intelligence user experiences. We expect you to have strong, efficient ML model development experiences and a passion for shipping machine learning models on device. We also encourage publishing novel research at top ML conferences.

Preferred Qualifications

  • Familiar with model compression algorithms including quantization, pruning, distillations, and experience on optimizing large diffusion models or language models
  • MS or PhD degree in Computer Science, or equivalent industry research experience
  • Experience with hardware architecture, software & hardware co-design
  • Leadership experience in driving large‑scale projects in the industry
  • Strong communication skills; phenomenal work ethic and collaboration
  • ML compiler
  • High performance kernel implementation
  • Distributed inference

Minimum Qualifications

  • Software engineering skills in Python
  • Experience in developing large computer vision and machine learning models, particularly on the hardware‑aware model optimizations
  • BS and a minimum of 3 years relevant industry experience

Equal Opportunity

Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.

Pay & Benefits

  • Base pay range for this role: $139,500 – $258,100 (determined by skills, qualifications, experience, and location)
  • Opportunity to become an Apple shareholder through discretionary employee stock programs, including restricted stock unit awards and the Employee Stock Purchase Plan (discounted stock purchase)
  • Comprehensive medical and dental coverage
  • Retirement benefits
  • Discounted Apple products and free services
  • Reimbursement for certain educational expenses, including tuition, for formal education related to advancing your career at Apple
  • Potential eligibility for discretionary bonuses or commission payments, as well as relocation assistance

Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.

Requirements

  • Software engineering skills in Python
  • Experience in developing large computer vision and machine learning models, particularly on the hardware-aware model optimizations
  • Familiar with model compression algorithms including quantization, pruning, distillations, and experience on optimizing large diffusion models or language models
  • Experience with hardware architecture, software & hardware co-design
  • ML compiler
  • High performance kernel implementation
  • Distributed inference

Responsibilities

  • Implementing and delivering various optimization techniques that improve the performance of large language and diffusion models on devices.
  • Collaborating with a diverse range of organizations within Apple.
  • Defining and leading the execution of model compression, distillation, and integrating to the full Apple Intelligence user experiences.

Benefits

medical coveragedental coverageretirement benefitsdiscounted productsfree servicestuition reimbursementdiscretionary bonusescommission paymentsrelocationemployee stock programsrestricted stock unit awardsEmployee Stock Purchase Plan

Skills

Pythoncomputer visiondiffusion modelslarge language modelsmachine learningpruningquantization

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free