Publié le 01/07/26
Using the Windows Package Manager is the quickest way to trigger the setup.
Follow the step-by-step instructions below.
The loader auto-caches the model archive (several GBs included).
Your resources are automatically evaluated to lock in the premium configuration.
olmOCR-2-7B-1025-FP8 delivers stateâofâtheâart optical character recognition with a massive 7âbillion parameter base, enabling unprecedented accuracy on complex document layouts. Built on the FP8 quantization scheme, it achieves a balanced tradeâoff between inference speed and memory footprint, making it suitable for both cloud and edge deployments. The architecture incorporates a refined vision encoder that processes highâresolution scans up to 1025âŻĂâŻ1025 pixels, preserving fine glyphs and contextual spacing. A dedicated language model head leverages multilingual tokenizers, supporting over 100 languages while maintaining a low error rate on cursive and printed text. Benchmark results show a 3.2âŻ% absolute gain over the previous generation on the PubLayNet dataset, and the model is openly released under an permissive license for research and commercial use.
| Model | olmOCR-2-7B-1025-FP8 |
| Parameters | 7âŻB |
| Input Resolution | 1025âŻĂâŻ1025 |
| Quantization | FP8 |
| Supported Languages | 100+ |
| License | Permissive (Apache 2.0) |