We are hiring an AI Model Evaluation Specialist to rigorously test, benchmark, and evaluate our internal AI tools. This role focuses on ensuring model quality, consistency, and alignment with real-world user behavior.
Responsibilities
• Build evaluation datasets and testing frameworks.
• Measure model accuracy, drift, and failure modes.
• Work with research teams to refine model outputs.
• Document findings and improvements.
Qualifications
Background in AI/ML, statistics, or data science with experience evaluating large-scale models.