Full-time

AI Evaluation & Data Engineer for LLM Metrics

Posted by Net2Source (N2S) • región centro, jalisco, Mexico

📍 región centro, jalisco 🕒 May 26, 2026

Apply for this Job Similar Jobs

About the Role

We are looking for AI Evaluation & Data Engineering Specialists  to design, curate, and operationalize datasets and evaluation frameworks for AI product performance assessment. 
This role involves working with large language models (LLMs), human raters, and automation tools to measure model accuracy, correctness, and usability. 
Key Responsibilities Develop and apply data labeling and scoring guidelines  based on Google’s evaluation framework. 
Implement LLM-judge calibration workflows  to align automated and human evaluations. 
Perform error analysis, drift detection , and regression testing of AI model outputs. 
Collaborate with automation engineers to integrate datasets into evaluation pipelines. 
Support rater training , inter-rater reliability checks, and dataset validation reviews. 
Manage data quality assurance  and documentation for contributions to Google-maintained repos...
                

Job Details

Location región centro, jalisco
Job Type Full-time
Category Bases de datos, analítica y BI
Posted May 26, 2026
Deadline July 05, 2026

Ready to Apply?

Submit your application today and take the next step in your career journey with Net2Source (N2S).

Apply Now