All jobs · Machine Learning Engineer jobs

AI Model Evaluator

Mercor

Canada · On-site Contract $50 – $75/hr 3mo ago

About the role

About The Job

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.

Position Details

Type: Contract
Compensation: $50–$75/hour
Commitment: 20 hours/week

Role Responsibilities

Write realistic prompts that reflect how professionals and consumers seek domain-specific guidance.
Evaluate AI-generated responses for factual accuracy, regulatory or clinical correctness, and practical usefulness.
Identify fabricated claims, incorrect references, or misleading reasoning across model outputs.
Score and rank multiple model responses using structured rubrics across dimensions.
Provide written justifications with specific evidence for each evaluation.

Qualifications

Must-Have

Master’s degree or higher in Finance, Accounting, or a relevant professional field.
Professional experience applying domain expertise in a practitioner or advisory capacity.
Familiarity with industry-specific standards, regulations, or clinical guidelines.
Strong written communication and critical reasoning skills.

Application Process (Takes 20–30 mins to complete)

Submit your resume to begin.
Complete the Model Response Evaluation assessment.

Resources & Support

For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
For any help or support, reach out to: support@mercor.com

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

Similar roles

Regional Asset Manager

Nebius Group

backend developer

skoobe

AR/VR iOS/Android App Developer

Pataak It

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free