OwLite

OwLite is the easiest AI compression toolkit.

Why Choose OwLite for AI Compression?

Solve AI compression with a toolkit

Just a Few Lines
of Code Away

Simply add a few lines of OwLite code into your existing PyTorch training scripts to unlock the full potential of your AI with model compression.

Start with Powerful Recommended Settings

No experience in model compression? No problem.

With SqueezeBit's advanced algorithms, you can easily apply cutting-edge compression techniques to your models. Get started with just one click and achieve professional-level compression with ease.

Concerned About Security & Privacy?

Protecting your team's data is our top priority. Both training data and model weights stay entirely on your server. OwLite only uses ONNX model structure information during the entire process, ensuring complete security.

Never Compromise on performance

Fully customizable Quantization Aware Training, only on OwLite

Recover Performance with Quantization-Aware Training

Thanks to our unique compiler technology, seamless back-and-forth compatibility between the PyTorch model and TensorRT engine is now possible, making Quantization-Aware Training incredibly easy.

With the enablement of QAT, performance recovery in lightweight models becomes much easier.