AI Inference Engineer | GPU-Scale Rust/Python | Equity
2 months ago
Our stack is Rust, Python, CUDA, and CuTe DSL. Develop our internal Rust-based inference server to solve all Python pains and keep up with rapidly growing traffic. Comfortable working across languages and layers: Rust for the serving runtime, Python for model code, CUDA/CuteDSL for kernels.