About the Role
About The Job
Mercor
connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include
Benchmark
,
General Catalyst
,
Peter Thiel
,
Adam D'Angelo
,
Larry Summers
, and
Jack Dorsey
.
Position:
General Chat Behavior Evaluator
Type:
Full-time or Part-time Contract Work
Compensation:
$36/hour
Location:
Geography restricted to Taiwan, Malaysia, USA
Role Responsibilities
- Evaluate LLM-generated responses for effectiveness in answering user queries.
- Conduct fact-checking using trusted public sources and external tools.
- Generate high-quality human evaluation data by annotating response strengths, areas for improvement, and factual inaccuracies.
- Assess reasoning quality, clarity, tone, and completeness of responses.
- Ensure model responses align with expected conversational behavior and system guidelines.
Ready to Apply?
Submit your application today and take the next step in your career journey with Mercor.
Apply Now