Machine Learning Engineer - Model Evaluations, Public Sector
16 hours ago
New York
Develop and maintain automated evaluation pipelines for ML models across functional, performance, robustness, and safety metrics, including LLM-judge–based evaluations. Conduct comparative analyses of model architectures, training procedures, and evaluation outcomes. Cloud experience (AWS, GCP) an