ONNX – C4: Container, Code, Cloud & Context

Searching in

Enter search term to find items

to navigate, to select, and to close

Edge AI with ONNX Runtime: Running Models On-Device

Posted on January 10, 2025 by Nithin Mohan TK 6 min read

Last year, I deployed an AI model to a mobile device. The first attempt failed—the model was too large, inference was too slow, and battery drain was unacceptable. After optimizing 15+ models for edge deployment using ONNX Runtime, I’ve learned what works. Here’s the complete guide to running AI models on-device with ONNX Runtime. Figure […]