Senior Networking Solution Test Engineer – AI Cluster Debugging
5 days ago
Collaborate closely with development teams to debug NCCL, RoCE/RDMA and related networking components using logs, code inspection and targeted experiments. Strong Linux networking and debugging skills (for example perf, tcpdump, ethtool, iproute2). Strong knowledge of AI networking libraries (such a