ML Performance Engineer

Job Description

Job Title: Sr. ML Performance Engineer, AWS Neuron, Annapurna Labs. Location: Toronto, Canada

You will work across compiler, runtime, and framework layers, ensuring AWS accelerators deliver top performance for customers; The role offers the opportunity to contribute to high-performance computing, distributed systems, and deep learning optimization at scale.

Duties:

  • Analyze and optimize performance of large-scale machine learning models across frameworks, compilers, and runtimes.
  • Conduct in-depth profiling to identify and resolve performance bottlenecks in ML workloads;
  • Work directly with customers to optimize models for AWS accelerators, addressing specific technical requirements.
  • Design and implement compiler optimizations to automate performance improvements.
  • Collaborate with engineering teams to enhance the AWS Neuron SDK’s efficiency and scalability.
  • Participate in design reviews, code discussions, and cross-functional decision-making;
  • Develop high-impact solutions that support global ML workloads in a fast-paced, startup-like environment.

Qualifications:

  • 5+ years of professional software development experience.
  • Strong background in system-level performance optimization and distributed systems.
  • Proficiency in one or more programming languages such as Python, C++, or Java.
  • Experience leading software architecture, design, and scaling projects;
  • Knowledge of ML frameworks such as PyTorch, TensorFlow, or JAX.
  • Experience with full software development life cycle, including testing and operations.
  • Bachelor’s degree in Computer Science or equivalent experience;
  • Background in FPGA programming or hardware acceleration is a plus.

Benefits and Instructions:

  • Competitive salary and benefits package, including equity and flexible work options.
  • Career growth opportunities through mentorship and internal learning programs.
  • Inclusive team culture that values diversity and work-life balance;
  • Access to cutting-edge AI hardware and software infrastructure.

JOB TYPE

Full-time

COMPENSATION

$115k +

SHARE

Important: To avoid application spam, include this statement at the end of your resume or application: 'I found this position on ( Quantum Jobs List ) .' Applications without it will be disqualified.

Back to all Quantum jobs
arrow mark