Deprecated: Function get_magic_quotes_gpc() is deprecated in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 99

Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 619

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1169

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176
8000 Releases Β· ollama/ollama Β· GitHub
Nothing Special   »   [go: up one dir, main page]

Skip to content

Releases: ollama/ollama

v0.12.6

15 Oct 23:02
1813ff8

Choose a tag to compare

What's Changed

  • Ollama's app now supports searching when running DeepSeek-V3.1, Qwen3 and other models that support tool calling.
  • Flash attention is now enabled by default for Gemma 3, improving performance and memory utilization
  • Fixed issue where Ollama would hang while generating responses
  • Fixed issue where qwen3-coder would act in raw mode when using /api/generate or ollama run qwen3-coder <prompt>
  • Fixed qwen3-embedding providing invalid results
  • Ollama will now evict models correctly when num_gpu is set
  • Fixed issue where tool_index with a value of 0 would not be sent to the model

Experimental Vulkan Support

Experimental support for Vulkan is now available when you build locally from source. This will enable additional GPUs from AMD, and Intel which are not currently supported by Ollama. To build locally, install the Vulkan SDK and set VULKAN_SDK in your environment, then follow the developer instructions. In a future release, Vulkan support will be included in the binary release as well. Please file issues if you run into any problems.

New Contributors

Full Changelog: v0.12.5...v0.12.6

v0.12.5

10 Oct 16:30
3d32249

Choose a tag to compare

What's Changed

  • Thinking models now support structured outputs when using the /api/chat API
  • Ollama's app will now wait until Ollama is running to allow for a conversation to be started
  • Fixed issue where "think": false would show an error instead of being silently ignored
  • Fixed deepseek-r1 output issues
  • macOS 12 Monterey and macOS 13 Ventura are no longer supported
  • AMD gfx900 and gfx906 (MI50, MI60, etc) GPUs are no longer supported via ROCm. We're working to support these GPUs via Vulkan in a future release.

New Contributors

Full Changelog: v0.12.4...v0.12.5-rc0

v0.12.4

03 Oct 16:38
15e3611

Choose a tag to compare

v0.12.4 Pre-release
Pre-release

What's Changed

  • Flash attention is now enabled by default for Qwen 3 and Qwen 3 Coder
  • Fixed minor memory estimation issues when scheduling models on NVIDIA GPUs
  • Fixed an issue where keep_alive in the API would accept different values for the /api/chat and /api/generate endpoints
  • Fixed tool calling rendering with qwen3-coder
  • More reliable and accurate VRAM detection
  • OLLAMA_FLASH_ATTENTION can now be overridden to 0 for models that have flash attention enabled by default
  • macOS 12 Monterey and macOS 13 Ventura are no longer supported
  • Fixed crash where templates were not correctly defined
  • Fix memory calculations on NVIDIA iGPUs
  • AMD gfx900 and gfx906 (MI50, MI60, etc) GPUs are no longer supported via ROCm. We're working to support these GPUs via Vulkan in a future release.

New Contributors

Full Changelog: v0.12.3...v0.12.4-rc3

v0.12.3

26 Sep 05:08
b04e46d

Choose a tag to compare

New models

  • DeepSeek-V3.1-Terminus: DeepSeek-V3.1-Terminus is a hybrid model that supports both thinking mode and non-thinking mode. It delivers more stable & reliable outputs across benchmarks compared to the previous version:

    Run on Ollama's cloud:

    ollama run deepseek-v3.1:671b-cloud
    

    Run locally (requires 500GB+ of VRAM)

    ollama run deepseek-v3.1
    
  • Kimi-K2-Instruct-0905: Kimi K2-Instruct-0905 is the latest, most capable version of Kimi K2. It is a state-of-the-art mixture-of-experts (MoE) language model, featuring 32 billion activated parameters and a total of 1 trillion parameters.

    ollama run kimi-k2:1t-cloud
    

What's Changed

  • Fixed issue where tool calls provided as stringified JSON would not be parsed correctly
  • ollama push will now provide a URL to follow to sign in
  • Fixed issues where qwen3-coder would output unicode characters incorrectly
  • Fix issue where loading a model with /load would crash

New Contributors

Full Changelog: v0.12.2...v0.12.3

v0.12.2

24 Sep 21:19
2e74254

Choose a tag to compare

Web search

ollama_web_search

A new web search API is now available in Ollama. Ollama provides a generous free tier of web searches for individuals to use, and higher rate limits are available via Ollama’s cloud. This web search capability can augment models with the latest information from the web to reduce hallucinations and improve accuracy.

What's Changed

  • Models with Qwen3's architecture including MoE now run in Ollama's new engine
  • Fixed issue where built-in tools for gpt-oss were not being rendered correctly
  • Support multi-regex pretokenizers in Ollama's new engine
  • Ollama's new engine can now load tensors by matching a prefix or suffix

Full Changelog: v0.12.1...v0.12.2

v0.12.1

21 Sep 23:19
64883e3

Choose a tag to compare

New models

What's Changed

  • Qwen3-Coder now supports tool calling
  • Ollama's app will now longer show "connection lost" in error when connecting to cloud models
  • Fixed issue where Gemma3 QAT models would not output correct tokens
  • Fix issue where & characters in Qwen3-Coder would not be parsed correctly when function calling
  • Fixed issues where ollama signin would not work properly on Linux

Full Changelog: v0.12.0...v0.12.1

v0.12.0

18 Sep 17:29
9f3a37f

Choose a tag to compare

Cloud models

Ollama_cloud_background

Cloud models are now available in preview, allowing you to run a group of larger models with fast, datacenter-grade hardware.

To run a cloud model, use:

ollama run qwen3-coder:480b-cloud

What's Changed

  • Models with the Bert architecture now run on Ollama's engine
  • Models with the Qwen 3 architecture now run on Ollama's engine
  • Fix issue where older NVIDIA GPUs would not be detected if newer drivers were installed
  • Fixed issue where models would not be imported correctly with ollama create
  • Ollama will skip parsing the initial <think> if provided in the prompt for /api/generate by @rick-github

New Contributors

Full Changelog: v0.11.11...v0.12.0

v0.11.11

11 Sep 21:02

Choose a tag to compare

What's Changed

  • Support for CUDA 13
  • Improved memory usage when using gpt-oss in Ollama's app
  • Better scrolling better in Ollama's app when submitting long prompts
  • Cmd +/- will now zoom and shrink text in Ollama's app
  • Assistant messages can now by copied in Ollama's app
  • Fixed error that would occur when attempting to import satefensor files by @rick-github in #12176
  • Improved memory estimates for hybrid and recurrent models by @gabe-l-hart in #12186
  • Fixed error that would occur when when batch size was greater than context length
  • Flash attention & KV cache quantization validation fixes by @jessegross in #12231
  • Add dimensions field to embed requests by @mxyng in #12242
  • Enable new memory estimates in Ollama's new engine by default by @jessegross in #12252
  • Ollama will no longer load split vision models in the Ollama engine by @jessegross in #12241

New Contributors

Full Changelog: v0.11.10...v0.11.11

v0.11.10

04 Sep 17:27
5994e8e

Choose a tag to compare

New models

  • EmbeddingGemma a new open embedding model that delivers best-in-class performance for its size

What's Changed

  • Support for EmbeddingGemma

Full Changelog: v0.11.9...v0.11.10

v0.11.9

02 Sep 20:14
fb92b61

Choose a tag to compare

What's Changed

  • Improved performance via overlapping GPU and CPU computations
  • Fixed issues where unrecognized AMD GPU would cause an error
  • Reduce crashes due to unhandled errors in some Mac and Linux installations of Ollama

New Contributors

Full Changelog: v0.11.8...v0.11.9-rc0

0