Advanced

C++ and AI Inference Engines

Build a production AI inference engine from scratch in C++. Covers operator implementation, memory layout, quantisation (INT8/FP16), batching strategies, CUDA kernel integration, and deployment as a shared library consumed by Python runtimes.

What's included

  • Operator implementation
  • Quantisation (INT8/FP16)
  • CUDA integration
  • Batching & scheduling
  • Python bindings
  • Certificate of completion
  • Lifetime access to materials
  • Priority support

Duration

14 hours

Students

720+

Rating

⭐ 4.9

30-day guarantee
Certificate included
Lifetime access

Practical, rigorous, and immediately applicable. Velmio courses are genuinely different.

Mark T.

Head of Data, NHS Digital

Order Summary

C++ and AI Inference Engines

$799 one-time
Total due$799
256-bit SSL · Payments secured by MoonPay

By enrolling you agree to our Terms and Privacy Policy.