Mid-Senior AI Engineer
7 days ago
Experience with evaluation and monitoring frameworks for LLM systems, including RAGAS, OpenEvals, DeepEval, LangSmith, OpenTelemetry or LLM-as-a-Judge evaluation frameworks. Build and optimise retrieval pipelines including embeddings, chunking, reranking, memory and vector search. Design, build, and