Ollama 0.1.16 adds Mixtral support

The latest release of Ollama, version v0.1.16, adds support for Mixtral and other models based on the Mixture of Experts (MoE) architecture. This update includes new models like Mixtral, a high-quality mixture of experts model, and Dolphin Mixtral, an uncensored model optimized for coding tasks. It’s important to note that these models require at least 48GB of memory. Additionally, a fix was implemented for an issue related to load_duration in the /api/generate response. For full details, visit the GitHub release page.

Ollama 0.1.16 adds Mixtral support

Related

Leave a ReplyCancel reply

Shingles Vaccine Linked to Lower Dementia Risk in Long-Term Study

DeepMind’s Silence: How Openness in AI Research Is Fading

Why Passwords Aren’t the Problem—But How We Use Them Is

Claude 3.7 Sonnet Set to Expand Context Window to 500K Tokens

IngressNightmare: Critical Flaws in NGINX Controller Expose Kubernetes Clusters to RCE

Google’s Gemini 2.5 Pro Thinks Slower to Answer Smarter

In Pursuit of Efficiency: Rethinking AI with DeepSeek-V3-0324

AI-Generated Research: Charting New Territory in Peer-Reviewed Science

Awesome MCP Clients, A New Way To Interact With LLMs