Hands-on course materials for ML engineers to master extreme model quantization and on-device LLM deployment: PyTorch, llama.cpp, Android (educational)
python arm course pytorch notebooks quantization model-compression qat edge-ai quantization-aware-training llm llama-cpp genai hardware-aware-optimization
-
Updated
Dec 8, 2025 - Jupyter Notebook