About the Role
Task Description:
We are seeking high-level JavaScript and TypeScript experts for a Part-time to Full-time role designing and executing Model Breaking evaluation tasks for state-of-the-art LLMs. Unlike standard development, your objective is to create complex, non-verifiable prompts that challenge a model's natural-language reasoning, technical judgment, and architectural depth.
Each task involves:
Prompt Engineering: Designing Hard difficulty coding problems that cause a model to fail.
Rubric Creation: Developing a strictly binary, reference-free evaluation framework.
Technical Explanation: Drafting a high-quality Reference Response and providing proof of the model's logical failure.
Skills Required:
Advanced JS/TS Proficiency: Deep understanding of asynchronous patterns, memory management, and complex type systems in TypeScript.
Technical Reasoning: Ability to explain the why behind code behavior and identify subtle logical f...
Ready to Apply?
Submit your application today and take the next step in your career journey with Nivox.
Apply Now