Latest
Lecture #15 provides an in-depth conceptual explanation of NVIDIA's Cutlass library's tensor layout algebra system, focusing on how it handles shapes, strides, and tiling operations for efficient GPU tensor computations.
GPU MODE Lecture 14: Practitioners Guide to TritonLecture #14 provides a practical introduction to writing and optimizing GPU kernels using Triton, featuring comparisons with CUDA and hands-on examples like tensor copying, image processing, and matrix multiplication.
Quantizing timm Image Classifiers with ONNX Runtime and TensorRT in UbuntuLearn how to quantize timm image classification models with ONNX Runtime and TensorRT for int8 inference.
Quantizing YOLOX with ONNX Runtime and TensorRT in UbuntuLearn how to quantize YOLOX models with ONNX Runtime and TensorRT for int8 inference.
GPU MODE Lecture 13: Ring AttentionLecture #13 explores ring attention, a distributed computing technique for training long-context transformers, discussing its motivation and underlying mechanisms.
Tutorials
Step-by-step tutorials for setting up essential tools and platforms, designed to provide a solid foundation for a diverse range of projects.
Fine-Tuning Image Classifiers with PyTorch and the timm library for BeginnersLearn how to fine-tune image classification models with PyTorch and the timm library by creating a hand gesture recognizer in this easy-to-follow guide for beginners.
Training YOLOX Models for Real-Time Object Detection in PyTorchLearn how to train YOLOX models for real-time object detection in PyTorch by creating a hand gesture detection model.
ONNX Runtime in UnityTutorials for integrating ONNX Runtime into the Unity game engine.
TensorFlow.js in UnityIn this tutorial series, we explore how to create TensorFlow.js plugins for the Unity game engine.
Notes
My notes from various books.
GPU MODE Lecture NotesMy notes from the GPU MODE (formerly CUDA MODE) reading group lectures run by Andreas Kopf and Mark Saroufim.
EducationMy notes from resources on education.
HistoryMy notes from resources on history.
Mastering LLMs Course NotesMy notes from the course Mastering LLMs: A Conference For Developers & Data Scientists by Hamel Husain and Dan Becker.
About Me
I’m Christian Mills, a deep learning consultant specializing in practical AI implementations. I help clients leverage cutting-edge AI technologies to solve real-world problems.
I combine hands-on experience with technical expertise and clear communication to guide projects from conception to deployment.
My Expertise
- Custom AI solution development
- Automated synthetic data pipelines
- Real-time object detection and tracking systems
- LLM integration and fine-tuning
- AI Strategy Consulting
Interested in working together? Fill out my Quick AI Project Assessment form or learn more about me.