Advanced

C++ and AI Inference Engines

Build a production AI inference engine from scratch in C++. Covers operator implementation, memory layout, quantisation (INT8/FP16), batching strategies, CUDA kernel integration, and deployment as a shared library consumed by Python runtimes.

What's included

  • Operator implementation
  • Quantisation (INT8/FP16)
  • CUDA integration
  • Batching & scheduling
  • Python bindings
  • Certificate of completion
  • Lifetime access to materials
  • Priority support

Duration

14 hours

Students

180+

Rating

⭐ 4.6 (14)

Certificate included
Lifetime access
Secure checkout

After enrolment you will receive access instructions by email within 24 hours. Course materials are delivered via our online learning portal.

Practical, rigorous, and immediately applicable. Velmio courses are genuinely different.

Mark T.

Head of Data

Order Summary

C++ and AI Inference Engines

$799 one-time
Total due$799
256-bit SSL · Payments secured by MoonPay

By enrolling you agree to our Terms & Conditions and Privacy Policy.