Machine Learning Engineer, Anonymization
Mercor
About the role
About Mercor
Mercor is defining the future of work. We partner with leading AI labs and enterprises to provide the human intelligence essential to AI development.
Our vast talent network trains frontier AI models in the same way teachers teach students: by sharing knowledge, experience, and context that can't be captured in code alone. Today, more than 30,000 experts in our network collectively earn over $2 million a day.
Mercor is creating a new category of work where expertise powers AI advancement. Achieving this requires an ambitious, fast-paced and deeply committed team. You’ll work alongside researchers, operators, and AI companies at the forefront of shaping the systems that are redefining society.
Mercor is a profitable Series C company valued at $10 billion. We work in-person five days a week in our San Francisco, NYC, or London offices.
About the Role
As a Machine Learning Engineer focused on Anonymization at Mercor, you will be critical in designing and implementing our industry-best data privacy pipeline. You'll operate at the intersection of advanced ML techniques, sensitive data handling, and robust backend engineering. Your primary focus will be shipping production systems that employ state-of-the-art anonymization and de-identification methods, maximizing the realism and utility of our vast data network for model training while maintaining the highest standards of regulatory compliance. This role requires bringing deep statistical and modeling rigor to challenging problems in privacy-preserving data access.
What You’ll Do
- Own the anonymization platform for the market leader in human data, building systems that enable proprietary data access at enterprise scale without compromising trust.
- Design and ship production ML systems for anonymization and de-identification across multiple data modalities (text, documents, images, audio, video).
- Build and maintain the core backend infrastructure and APIs to securely process and serve anonymized data.
- Benchmark our anonymization pipeline against state-of-the-art-models, industry best practices, and regulatory standards . Define quality metrics and evaluation techniques that generalize to unseen data, continuously running experiments to improve both privacy guarantees and data utility.
- Collaborate cross-functionally with vendors, account stakeholders, Legal, Security, and Engineering teams to translate compliance requirements into robust, model-driven solutions.
- Act as the subject matter expert on data anonymization, flexing between applied ML, complex data pipeline engineering, and driving architectural decisions for data privacy.
What We’re Looking For
- Strong backend engineering skills (ex. Python/Django or similar) plus a solid foundation in applied ML and statistics.
- Proven experience shipping production systems or ML-driven products end-to-end.
- High ownership and comfort operating in ambiguous, fast-changing environments.
- Demonstrated knowledge of industry best practices and common frameworks for data privacy and security.
Why Mercor
- Impact: Your work powers how the world’s leading AI labs train and test their models.
- Learning: Get early insights into frontier model capabilities months before the market.
- Growth: Work on both infrastructure and research-adjacent projects with fast paths to ownership.
- Team: Work on complex, unsolved AI and ML problems alongside a small, high-caliber team of engineers.
Benefits
- Generous equity grant vested over 4 years
- A $10K housing bonus (if you live within 0.5 miles of our office)
- A $1.5K monthly stipend for meals
- Free Equinox membership
- Health insurance
Requirements
- Strong backend engineering skills (ex. Python/Django or similar) plus a solid foundation in applied ML and statistics.
- Proven experience shipping production systems or ML-driven products end-to-end.
- High ownership and comfort operating in ambiguous, fast-changing environments.
- Demonstrated knowledge of industry best practices and common frameworks for data privacy and security.
Responsibilities
- Own the anonymization platform for the market leader in human data, building systems that enable proprietary data access at enterprise scale without compromising trust.
- Design and ship production ML systems for anonymization and de-identification across multiple data modalities (text, documents, images, audio, video).
- Build and maintain the core backend infrastructure and APIs to securely process and serve anonymized data.
- Benchmark our anonymization pipeline against state-of-the-art-models, industry best practices, and regulatory standards .
- Define quality metrics and evaluation techniques that generalize to unseen data, continuously running experiments to improve both privacy guarantees and data utility.
- Collaborate cross-functionally with vendors, account stakeholders, Legal, Security, and Engineering teams to translate compliance requirements into robust, model-driven solutions.
- Act as the subject matter expert on data anonymization, flexing between applied ML, complex data pipeline engineering, and driving architectural decisions for data privacy.
Benefits
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free