Skip to content
mimi

Agentic AI Engineer (Harness & Systems Focus)

Systematic hedge fund

US · On-site Full-time Senior $300k – $500k/yr 1mo ago

About the role

About the Firm

We are a highly reputable mid-sized investment management firm applying advanced AI to improve how investment decisions are researched and executed. Our work centers on building systems that perform reliably in noisy, dynamic, real-world environments—not controlled benchmarks.

The Opportunity

Most AI systems break down outside clean demos. This role focuses on the harder layer underneath:

How do we engineer, validate, and stress-test agentic systems so they behave reliably over long horizons in uncertain environments?

You’ll focus on harness engineering—the infrastructure, evaluation systems, and runtime scaffolding that make autonomous agents actually usable in production. This is less about isolated models and more about building the systems that prove they work.

What You’ll Do

  • Design and build evaluation harnesses for long-running agent workflows (hours to days)
  • Develop infrastructure to simulate complex, noisy environments for agent testing
  • Create metrics and validation frameworks where success is ambiguous or delayed
  • Engineer agent runtime systems: orchestration, memory, retries, and failure handling
  • Build tools for observability, debugging, and introspection of agent behavior
  • Partner with researchers to translate prototypes into reliable, testable systems
  • Develop multi-agent test environments to evaluate coordination and failure modes
  • Continuously stress-test systems against edge cases, drift, and adversarial conditions

What We’re Looking For

  • Senior/staff-level experience in AI/ML systems, infrastructure, or applied research
  • Strong background in systems engineering for AI, not just modeling
  • Experience with evaluation harnesses, benchmarking, or simulation frameworks
  • Familiarity with long-running, stateful agent systems and their failure modes
  • Ability to design robust testing strategies for non-deterministic systems
  • Track record of delivering in ambiguous, high-impact environments
  • Deep understanding of how models behave in production (not just in theory)
  • Comfort owning loosely defined problems end-to-end

Why This Role

  • Work on the hardest unsolved layer of agentic AI: making it reliable
  • Define how systems are evaluated—not just how they’re built
  • Build infrastructure that directly impacts real-world decision-making
  • Operate in a focused, low-bureaucracy environment with rapid iteration
  • Tackle problems where correctness isn’t obvious—and that’s the point

Compensation

Competitive with senior/staff-level roles at leading technology companies, with performance-based upside.

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free