Bridge the gap between Python ML models and production C++ runtimes. You'll integrate libtorch (C++ API for PyTorch), write custom ONNX Runtime operators, and build a C++ inference server that serves predictions at sub-millisecond latency.
Duration
10 hours
Students
270+
Rating
⭐ 4.8 (19)
After enrolment you will receive access instructions by email within 24 hours. Course materials are delivered via our online learning portal.
“Practical, rigorous, and immediately applicable. Velmio courses are genuinely different.”
Mark T.
Head of Data
Order Summary
High-Performance C++ for AI Systems
By enrolling you agree to our Terms & Conditions and Privacy Policy.