Remote Senior Software Engineer – LLM Evaluation (US-based)
hace 24 horas
Hypothesize on steps in the software engineering cycle (prototyping, architecture design, API design, production implementation, launch, experiments, monitoring, operational maintenance) and evaluate model capabilities on them. Design verification mechanisms that can automatically verify a solution