Skip to content
mimi

Principal Staff Software Developer – AI/ML Performance Validation

Advanced Micro Devices, Inc

Markham · On-site Full-time Lead 1mo ago

About the role

About AMD

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.

The Role

A senior technical contributor that drives end-to-end delivery of software solutions, directly contributing to, and coordinating implementation and optimization across multiple teams for inference and training of machine learning models. The position will involve interfacing with software and hardware engineering teams and AMD partners to plan, develop and optimize use cases. This is an exciting opportunity to work on the cutting edge of GPU Computing for Machine Learning.

The Person

You are a subject matter expert and strong technical contributor with machine learning and GPU programming experience. You excel as part of a team where communication and team skills are highly valued.

Key Responsibilities

  • Work within and coordinate with a small team of Validation architects on defining comprehensive system-level test plans
  • Collaborate with Business Unit, Development, and other Validation teams on defining dependencies and proper compatibility test suites
  • Use modern tools and instruments in day-day operations and educate other team members

Preferred Experience

  • Relevant experience in Machine Learning and/or GPU programming
  • Experience in using and managing Datacenters in cluster environment.
  • Knowledge of GPU and CPU architecture, and experience in GPGPU programming technologies
  • Familiar with AMD ROCM stack or competition products
  • Excellent communication and collaboration skills

Academic Credentials

  • Bachelor’s or Master’s degree in a related discipline

Skills

GPGPUGPUMachine LearningROCm

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free