Advanced

C++ for ML Infrastructure

The course for engineers building the infrastructure that ML runs on. Covers distributed training communication (NCCL, MPI), custom CUDA kernels for attention and matmul, model parallelism strategies, and building a parameter server in C++. Used by engineers at ML infrastructure teams.

What's included

  • NCCL & MPI
  • Custom CUDA kernels
  • Attention implementation
  • Model parallelism
  • Parameter servers
  • Certificate of completion
  • Lifetime access to materials
  • Priority support

Duration

20 hours

Students

480+

Rating

⭐ 5.0

30-day guarantee
Certificate included
Lifetime access

Practical, rigorous, and immediately applicable. Velmio courses are genuinely different.

Mark T.

Head of Data, NHS Digital

Order Summary

C++ for ML Infrastructure

$1,199 one-time
Total due$1,199
256-bit SSL · Payments secured by MoonPay

By enrolling you agree to our Terms and Privacy Policy.