Advanced

C++ for ML Infrastructure

The course for engineers building the infrastructure that ML runs on. Covers distributed training communication (NCCL, MPI), custom CUDA kernels for attention and matmul, model parallelism strategies, and building a parameter server in C++. Used by engineers at ML infrastructure teams.

What's included

NCCL & MPI
Custom CUDA kernels
Attention implementation
Model parallelism
Parameter servers
Certificate of completion
Lifetime access to materials
Priority support

Duration

20 hours

Students

480+

Rating

⭐ 4.9 (9)

Certificate included

Lifetime access

Secure checkout

After enrolment you will receive access instructions by email within 24 hours. Course materials are delivered via our online learning portal.

“Practical, rigorous, and immediately applicable. Velmio courses are genuinely different.”

Mark T.

Head of Data

Order Summary

C++ for ML Infrastructure

$1,199 one-time

Total due$1,199

By enrolling you agree to our Terms & Conditions and Privacy Policy.