Distinguished Software Architect - Deep Learning and HPC Communications
hace 15 días
Expert in following areas: HPC, parallel programming models (MPI, SHMEM), at least one communication runtime (MPI, NCCL, NVSHMEM, OpenSHMEM, UCX, UCC), computer and system architecture, GPU architecture and CUDA. Our GPU communication libraries are crucial for scaling Deep Learning and HPC applicati