1.12 Senior AI Software Engineer — Edge Model Optimization & Deployment
8 days ago
San Francisco
Apply model compression techniques such as quantization, pruning, distillation, and weight sharing to achieve efficient real-time inference under strict constraints on power, bandwidth, and latency. Ensure the reliability, robustness, and stability of deployed models operating in challenging, res...