Software Engineer - ML Performance
2 months ago
New York
Deep dive into underlying codebases of TensorRT, PyTorch, TensorRT-LLM, vllm, sglang, CUDA, and other libraries to debug ML performance issues. Proficiency in enhancing the performance of software systems, particularly in the context of large language models (LLMs). Patreon, Writer, and Robust In...