Machine Learning Engineer - Model Evaluations, Public Sector
10 days ago
San Francisco
We build evaluation frameworks that ensure these models operate reliably, safely, and effectively under real-world constraints. Develop and maintain automated evaluation pipelines for ML models across functional, performance, robustness, and safety metrics, including LLM-judge-based evaluations. Con