• Develop and optimize AI models tailored to Graphcore’s specialized hardware as part of the Applied AI team.
• Advance AI compute technology by scaling performance-critical models across large-scale systems.
• Act as a proxy for customers to ensure hardware and software work seamlessly with the AI ecosystem.
• Drive innovation in efficient compute, model scaling, and distributed training for multiple modalities.
📋 Job Requirements
• Hold a Bachelor/Master/PhD in Machine Learning, Computer Science, Maths, or a related field.
• Maintain proficiency in deep learning frameworks such as PyTorch or JAX.
• Demonstrate strong software development skills in Python or C++.
• Exhibit expertise in distributed training or inference of ML models across 64+ accelerators.
• Demonstrate a deep understanding of performance bottlenecks and effective strategies to overcome them.
• Possess the ability to design, execute, and report results from complex ML experiments.
🌟 Nice-to-have
• Experience with MLOps for Kubernetes-based clusters or building production systems with LLMs.
• Expertise in efficient computing based on low-precision arithmetic.
• Experience writing C++, Triton, or CUDA kernels for performance optimization.
• Familiarity with HPC systems and networking technologies like Infiniband, NVLink, or RoCE.
• Proven track record of contributing to open-source projects or publishing research papers.
🎯 Responsibilities
• Implement latest ML models and optimize them for performance, scaling to thousands of accelerators.
• Test and evaluate new internal software releases and provide critical feedback to engineering teams.
• Benchmark models and ML techniques to identify and resolve performance bottlenecks.
• Collaborate with Research and Product teams to define and build next-generation AI hardware.
• Engage with the AI community to stay current with the latest global developments.
• Conduct code reviews and provide necessary fixes to ensure software efficiency and reliability.
About Graphcore
😃 What Graphcore offers
• Access a flexible working environment and a generous annual leave policy.
• Receive private medical insurance, a health cash plan, and a dental plan.
• Benefit from a company pension with matching up to 5%.
• Utilize life assurance, income protection, and an employee assistance programme.
• Access a generous parental leave policy and mental wellbeing support.
• Enjoy healthy food, snacks, and a barista bar at the central office.
💖 What makes Graphcore unique
Graphcore, a SoftBank Group company, is building the future of AI compute through a complete stack of silicon, software, and infrastructure. Backed by significant long-term investment, the team solves the toughest problems in semiconductor and AI technology. Graphcore is committed to building an inclusive environment where the brightest minds solve challenges at a global scale.