Software Architect - Deep Learning, HPC

Job Description

Job Title: Senior Software Architect - Deep Learning and HPC Communications. Location: Remote – Germany, Switzerland, Poland, UK

The work involves co-designing hardware and software solutions to accelerate data transfer between GPUs in massive clusters; The position offers the chance to shape next-generation computing platforms that drive breakthroughs in artificial intelligence and scientific research.

Duties:

  • Analyze communication performance across GPU clusters and identify system bottlenecks.
  • Design and develop next-generation communication technologies that enhance AI and HPC workload performance;
  • Collaborate on co-design initiatives involving GPU hardware, networking, and software architecture.
  • Build proofs-of-concept, run simulations, and conduct quantitative modeling for scalability testing;
  • Use large-scale simulation environments to evaluate performance of GPU clusters consisting of hundreds of thousands of GPUs.
  • Partner with cross-functional teams across different time zones to refine designs and align system-level performance goals.

Qualifications:

  • Master’s or PhD in Computer Science, Computer Engineering, or related discipline.
  • Over five years of experience in software architecture or HPC system design.
  • Proficiency in C/C++ development, debugging, and optimization;
  • Strong understanding of operating systems, network architecture, and communication scaling for AI and HPC.
  • Experience with parallel programming models such as MPI or SHMEM and at least one communication runtime like NCCL, NVSHMEM, OpenSHMEM, UCX, or UCC.
  • Deep knowledge of Linux systems and distributed computing;
  • Strong communication skills for working in a global collaborative environment.

Preferred:

  • Expertise in CUDA programming and NVIDIA GPUs.
  • Familiarity with deep learning frameworks such as PyTorch or TensorFlow;
  • Understanding of high-performance networks like NVLink, InfiniBand, or RoCE;
  • Experience optimizing deep learning parallelisms and integrating them with communication subsystems.
  • Proven record of innovation and teamwork in multi-disciplinary projects.

Benefits and Instructions:

  • Competitive salary and full benefits package.
  • Opportunity to work on world-class research and innovation projects in AI and HPC.
  • Flexible and inclusive remote work environment;
  • Encourages professional growth, collaboration, and diversity;
  • To apply, upload your English resume and complete the online application.

JOB TYPE

Full-time

COMPENSATION

€95k +

SHARE

Important: To avoid application spam, include this statement at the end of your resume or application: 'I found this position on ( Quantum Jobs List ) .' Applications without it will be disqualified.

Back to all Quantum jobs
arrow mark