v0.12.2
Web search
A new web search API is now available in Ollama. Ollama provides a generous free tier of web searches for individuals to use, and higher rate limits are available via Ollama’s cloud. This web search capability can augment models with the latest information from the web to reduce hallucinations and improve accuracy.
What's Changed
- Models with Qwen3's architecture including MoE now run in Ollama's new engine
- Fixed issue where built-in tools for gpt-oss were not being rendered correctly
- Support multi-regex pretokenizers in Ollama's new engine
- Ollama's new engine can now load tensors by matching a prefix or suffix
Full Changelog: v0.12.1...v0.12.2