C
Senior Research Engineer
Cohere
Remote · France Contract Senior 1mo ago
About the role
About
Evaluation is critical to making progress in scaling intelligence. As models continue to become superhuman in many real-world use cases, we must continue to develop new techniques to accurately measure our models’ performance on frontier capabilities.
In this role, you are responsible for creating next-generation evaluation methods and scalable infrastructure to measure LLM progress.
Responsibilities
- Develop evaluation benchmarks, datasets, and environments for measuring the bleeding edge of model capabilities
- Conduct research to push the state-of-the-art in LLM evaluation methods, including training LLM judges; improving evaluation efficiency; and scalably building high-quality datasets
- Build scalable tools for investigating and understanding evaluation results that are used by all members of technical staff at Cohere, as well as leadership and our CEO
- Learn from and work with the best researchers and engineers in the field
Benefits
- Six weeks’ paid vacation
- Equity / stock options
- RRSP, 401(k), and Pension Scheme contributions
- Coverage for 100% of your insurance premiums across health, dental, vision, and travel
- Additional coverage for accessing mental health providers/services
- Six months of fully paid parental leave, including adoption and surrogacy
- Financial support for egg freezing and IVF in Canada and the UK
- A monthly fitness and wellness allowance
- Globally dispersed company that supports a remote work culture
- A $2,000 annual education benefit for professional development
- A weekly stipend for meals when working remotely and catered lunch when working from one of our global offices
- A monthly arts and culture allowance
- A monthly quality time allowance
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free