Bridge the gap between Python ML models and production C++ runtimes. You'll integrate libtorch (C++ API for PyTorch), write custom ONNX Runtime operators, and build a C++ inference server that serves predictions at sub-millisecond latency.
Duration
10 hours
Students
1,100+
Rating
⭐ 4.9
“Practical, rigorous, and immediately applicable. Velmio courses are genuinely different.”
Mark T.
Head of Data, NHS Digital
Order Summary
High-Performance C++ for AI Systems
By enrolling you agree to our Terms and Privacy Policy.