Model Update2026-06-02Hugging Face Blog

JetBrains Introduces Mellum2: 12B MoE Model

JetBrains has launched Mellum2, a 12-billion-parameter Mixture-of-Experts (MoE) model designed to deliver efficient and scalable AI performance for enterprise and developer applications. This release marks a significant step forward in making powerful AI more accessible while optimizing computational resources. The Mixture-of-Experts architecture is key to Mellum2's efficiency. Unlike traditional dense models that activate all parameters for every input, MoE models selectively activate only the most relevant "expert" subnetworks for each task. This means Mellum2 can achieve performance comparable to much larger models while using significantly less computational power and memory. With 12 billion total parameters, Mellum2 strikes a balance between capability and efficiency. It is large enough to handle complex tasks like code generation, natural language understanding, and data analysis, yet optimized enough to run on more modest hardware than many competing models. This makes it particularly attractive for organizations that want to deploy AI without massive infrastructure investments. JetBrains, known primarily for its developer tools like IntelliJ IDEA and PyCharm, is positioning Mellum2 as a natural extension of its developer-focused ecosystem. The model is designed to integrate seamlessly with JetBrains IDEs, providing intelligent code completion, bug detection, and documentation generation directly within the development environment. For enterprise applications, Mellum2 offers strong performance on benchmarks while maintaining cost-effectiveness. The MoE architecture allows organizations to scale AI usage without proportionally scaling compute costs, making it suitable for high-volume production deployments. The launch of Mellum2 reflects a broader industry trend toward more efficient AI architectures. As the limitations of simply scaling up model sizes become apparent, approaches like mixture-of-experts offer a path to better performance without exponential increases in computational requirements. JetBrains has also emphasized the model's transparency and controllability, providing developers with tools to understand and fine-tune its behavior. This focus on developer experience aligns with the company's long-standing commitment to making powerful tools accessible and usable for software professionals.

Related news