Senior Staff Engineer
4 days ago
Cambridge
About Graphcore\n Graphcore is one of the world’s leading innovators in Artificial Intelligence compute. \n It is developing hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs and power the widespread adoption of AI solutions across every industry. \n As part of the SoftBank Group, Graphcore is a member of an elite family of companies responsible for some of the world’s most transformative technologies. Together, they share a bold vision: to enable Artificial Super Intelligence and ensure its benefits are accessible to everyone. \n Graphcore’s teams are drawn from diverse backgrounds and bring a broad range of skills and perspectives. A melting pot of AI research specialists, silicon designers, software engineers and systems architects, Graphcore enjoy a culture of continuous learning and constant innovation. \nJob Summary\n Applicants for this role should possess prior experience in software development, as the primary focus will involve testing a Machine Learning (ML) software development stack. The position requires a strong interest in reliability and performance testing, as well as a genuine passion for debugging and problem-solving. Ideal candidates will be self‑motivated, enthusiastic, and able to work independently while also being effective communicators who can collaborate with team members. They should also be the type of person who takes initiative and gets things done quickly and efficiently. This opportunity is particularly well‑suited for an experienced software engineers seeking to gain hands‑on exposure to the exciting fields of Artificial Intelligence (AI), ML, and AI accelerator hardware. \nThe Team\n The Software QA team is composed of highly skilled and dedicated software engineers with a strong passion for automation and data analysis. They collaborate extensively with industry‑standard machine learning frameworks and models, making contributions to upstream open‑source projects. Operating in a dynamic and fast‑paced environment, the team consistently delivers high‑quality results, ensuring exceptional customer experiences and impactful advancements in the product. \nResponsibilities and Duties\n\n • Create representative ML workloads and gain insights from their execution. (Numerical accuracy, performance analysis and benchmarking).\n, • Build Continuous Integration/Delivery (CI/CD) against hardware simulators/emulators and physical hardware.\n, • Take responsibility for designing and writing tests for projects within the Software team and ML frameworks.\n, • Work closely with all Software development teams, supporting a culture of quality, security and maintainability.\n, • Guide the team in industry best practices and improve the team's code quality and technical capabilities.\n, • At least 6 years of experience in production quality software development\n, • Bachelor/Master's/PhD or equivalent experience in Computer Science, Maths, Machine Learning, Data Science, or related field\n, • Strong proficiency in Python\n, • Proficiency in C++\n, • Experience with Linux operating systems\n, • Exposure to machine learning frameworks such as PyTorch, JAX, Triton, TensorFlow\n, • Experience with distributed workload management systems such as Kubernetes, VLLM, Keras or MLOps pipelines\n, • Experience with hardware architectures or compilers\n, • Experience with test automation in a CI/CD environmentExposure to data analytic methodologies\n