Skip to content
mimi

Search Evaluation Specialist - Expert | Upto $30/hr Hourly

Mercor

Remote · Canada Part-time $10 – $30/hr Yesterday

About the role

About The Job

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.

Position

Search Generalist Expert

Type

Contract

Compensation

$10–$30/hour

Location

Remote

Role Responsibilities

  • Evaluate AI-generated search responses for factual accuracy, helpfulness, clarity, completeness, and overall quality.
  • Assess whether models use search appropriately and whether search queries are well-formed and effective.
  • Compare model responses side by side and provide concise, defensible rationales.
  • Write and refine prompts, golden answers, rubric criteria, and edge cases for search-related evaluations.
  • Apply project guidelines consistently across ambiguous, multi-step, and real-world search tasks.
  • Identify recurring failure modes and escalate unclear cases or rubric gaps to project leads.

Qualifications

Must-Have

  • Excellent written English and strong online research skills.
  • Strong judgment when synthesizing information from multiple sources.
  • Ability to distinguish factual accuracy from fluency, confidence, or style.
  • High attention to detail and comfort following structured guidelines.
  • Reliable, self-directed, and responsive in an asynchronous remote environment.

Preferred

  • Experience in search quality, fact-checking, content evaluation, trust and safety, annotation, QA, or prompt/rubric writing.
  • Familiarity with search evaluation concepts such as factuality, helpfulness, severity, side-by-side comparisons, or tool-use assessment.
  • Experience working with LLM evaluation workflows or human data projects.
  • Multilingual skills are a plus.
  • Bachelor’s degree preferred; advanced degree or strong professional background is a plus.

Application Process (Takes 20–30 mins to complete)

  • Upload resume
  • AI interview based on your resume
  • Submit form

Resources & Support

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

Requirements

  • Excellent written English and strong online research skills.
  • Strong judgment when synthesizing information from multiple sources.
  • Ability to distinguish factual accuracy from fluency, confidence, or style.
  • High attention to detail and comfort following structured guidelines.
  • Reliable, self-directed, and responsive in an asynchronous remote environment.

Responsibilities

  • Evaluate AI-generated search responses for factual accuracy, helpfulness, clarity, completeness, and overall quality.
  • Assess whether models use search appropriately and whether search queries are well-formed and effective.
  • Compare model responses side by side and provide concise, defensible rationales.
  • Write and refine prompts, golden answers, rubric criteria, and edge cases for search-related evaluations.
  • Apply project guidelines consistently across ambiguous, multi-step, and real-world search tasks.
  • Identify recurring failure modes and escalate unclear cases or rubric gaps to project leads.

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free