-
1
Laguna M.1
Poolside
Empower your coding with unmatched reasoning and efficiency.
Laguna M.1 is recognized as Poolside's premier model for agentic coding, meticulously designed in-house to optimize software development processes. This sophisticated model incorporates 225 billion parameters and employs a Mixture of Experts architecture with 23 billion parameters activated, all trained on a colossal dataset of 30 trillion tokens using a network of 6,144 NVIDIA H200 GPUs. Poolside committed to developing Laguna M.1 from the ground up, utilizing proprietary data, a specialized training codebase, and an asynchronous on-policy reinforcement learning strategy within its agent framework, all specifically oriented towards agentic coding applications. The model's architecture is crafted to deliver top-tier performance within Poolside's coding agent, empowering it to adeptly reason through programming tasks, engage with an array of tools, modify code, run tests, and support extensive autonomous development sessions. Tailored for developers and teams facing complex coding obstacles, Laguna M.1 boasts enhanced capabilities in reasoning, understanding architecture, managing terminal actions, and executing multi-step processes, far exceeding the abilities of lighter models. Overall, its comprehensive feature set establishes it as an indispensable tool for professionals immersed in high-stakes software projects, making it a vital component in the landscape of agentic coding solutions.
-
2
Hy3
Tencent
Unleash intelligent reasoning with cutting-edge context capabilities.
The Hy3 preview showcases Tencent Hy's latest and most sophisticated model within the Hy series, boasting an impressive 295 billion parameters arranged in a Mixture-of-Experts framework, with 21 billion parameters activated and a remarkable 3.8 billion allocated to the MTP layer, all while supporting a vast context window of up to 256,000 tokens. This innovative model marks a significant milestone as it utilizes Tencent Hy's newly enhanced infrastructure, which is specifically designed to improve its effectiveness in various practical applications such as complex reasoning, following directives, contextual learning, coding assignments, and overall inference skills. By blending swift and comprehensive cognitive processing, it can provide clear responses for basic questions while also allowing for detailed analysis of complex mathematical, programming, and logical problems. The model is engineered to demonstrate extensive capabilities in comprehending lengthy contexts, following instructions accurately, utilizing tools effectively, and executing agent workflows with precision, with evaluations performed not only against traditional benchmarks but also in realistic business and development scenarios. Additionally, its versatile design allows for effective adaptation across a wide array of situations, significantly expanding its potential for use in numerous applications, thus making it a vital tool in advancing the field.
-
3
CodeGen
Salesforce
Revolutionize coding with powerful, efficient, open-source synthesis.
CodeGen is an innovative open-source framework aimed at producing code via program synthesis, employing TPU-v4 in its training process. It distinguishes itself as a formidable competitor to OpenAI Codex in the field of code generation tools, showcasing its potential to enhance developer productivity and streamline coding tasks.
-
4
StarCoder
BigCode
Transforming coding challenges into seamless solutions with innovation.
StarCoder and StarCoderBase are sophisticated Large Language Models crafted for coding tasks, built from freely available data sourced from GitHub, which includes an extensive array of over 80 programming languages, along with Git commits, GitHub issues, and Jupyter notebooks. Similarly to LLaMA, these models were developed with around 15 billion parameters trained on an astonishing 1 trillion tokens. Additionally, StarCoderBase was specifically optimized with 35 billion Python tokens, culminating in the evolution of what we now recognize as StarCoder.
Our assessments revealed that StarCoderBase outperforms other open-source Code LLMs when evaluated against well-known programming benchmarks, matching or even exceeding the performance of proprietary models like OpenAI's code-cushman-001 and the original Codex, which was instrumental in the early development of GitHub Copilot. With a remarkable context length surpassing 8,000 tokens, the StarCoder models can manage more data than any other open LLM available, thus unlocking a plethora of possibilities for innovative applications. This adaptability is further showcased by our ability to engage with the StarCoder models through a series of interactive dialogues, effectively transforming them into versatile technical aides capable of assisting with a wide range of programming challenges. Furthermore, this interactive capability enhances user experience, making it easier for developers to obtain immediate support and insights on complex coding issues.
-
5
Llama 2
Meta
Revolutionizing AI collaboration with powerful, open-source language models.
We are excited to unveil the latest version of our open-source large language model, which includes model weights and initial code for the pretrained and fine-tuned Llama language models, ranging from 7 billion to 70 billion parameters. The Llama 2 pretrained models have been crafted using a remarkable 2 trillion tokens and boast double the context length compared to the first iteration, Llama 1. Additionally, the fine-tuned models have been refined through the insights gained from over 1 million human annotations. Llama 2 showcases outstanding performance compared to various other open-source language models across a wide array of external benchmarks, particularly excelling in reasoning, coding abilities, proficiency, and knowledge assessments. For its training, Llama 2 leveraged publicly available online data sources, while the fine-tuned variant, Llama-2-chat, integrates publicly accessible instruction datasets alongside the extensive human annotations mentioned earlier. Our project is backed by a robust coalition of global stakeholders who are passionate about our open approach to AI, including companies that have offered valuable early feedback and are eager to collaborate with us on Llama 2. The enthusiasm surrounding Llama 2 not only highlights its advancements but also marks a significant transformation in the collaborative development and application of AI technologies. This collective effort underscores the potential for innovation that can emerge when the community comes together to share resources and insights.
-
6
Code Llama
Meta
Transforming coding challenges into seamless solutions for everyone.
Code Llama is a sophisticated language model engineered to produce code from text prompts, setting itself apart as a premier choice among publicly available models for coding applications. This groundbreaking model not only enhances productivity for seasoned developers but also supports newcomers in tackling the complexities of learning programming. Its adaptability allows Code Llama to serve as both an effective productivity tool and a pedagogical resource, enabling programmers to develop more efficient and well-documented software. Furthermore, users can generate code alongside natural language explanations by inputting either format, which contributes to its flexibility for various programming tasks. Offered for free for both research and commercial use, Code Llama is based on the Llama 2 architecture and is available in three specific versions: the core Code Llama model, Code Llama - Python designed exclusively for Python development, and Code Llama - Instruct, which is fine-tuned to understand and execute natural language commands accurately. As a result, Code Llama stands out not just for its technical capabilities but also for its accessibility and relevance to diverse coding scenarios.
-
7
Qwen3.6
Alibaba
Unlock powerful AI solutions for coding and reasoning.
Qwen3.6 is a next-generation large language model developed by Alibaba, designed to deliver advanced reasoning, coding, and multimodal capabilities. It builds on the Qwen3.5 series with a strong emphasis on stability, efficiency, and real-world usability. The model supports multimodal inputs, enabling it to process text, images, and video for more complex analysis and decision-making. One of its key strengths is agentic AI, allowing it to perform multi-step tasks and operate more autonomously in workflows. Qwen3.6 is particularly optimized for coding, capable of handling complex engineering tasks at a repository level rather than just individual functions. It uses a mixture-of-experts architecture, with billions of parameters but only a subset activated during each inference, improving efficiency. The model is available in both open-weight and proprietary versions, giving developers flexibility in deployment and customization. It can be integrated into enterprise systems, APIs, and cloud environments for production use. Qwen3.6 also offers strong multimodal reasoning, enabling it to analyze documents, visuals, and structured data together. It is designed to support a wide range of applications, from software development to data analysis and automation. The model includes enhancements in performance, scalability, and usability compared to earlier versions. It reflects a broader shift toward agent-based AI systems that can execute tasks rather than just provide responses. Overall, Qwen3.6 represents a powerful and versatile AI model for modern enterprise and developer use cases.
-
8
PaLM 2
Google
Revolutionizing AI with advanced reasoning and ethical practices.
PaLM 2 marks a significant advancement in the realm of large language models, furthering Google's legacy of leading innovations in machine learning and ethical AI initiatives.
This model showcases remarkable skills in intricate reasoning tasks, including coding, mathematics, classification, question answering, multilingual translation, and natural language generation, outperforming earlier models, including its predecessor, PaLM. Its superior performance stems from a groundbreaking design that optimizes computational scalability, incorporates a carefully curated mixture of datasets, and implements advancements in the model's architecture.
Moreover, PaLM 2 embodies Google’s dedication to responsible AI practices, as it has undergone thorough evaluations to uncover any potential risks, biases, and its usability in both research and commercial contexts. As a cornerstone for other innovative applications like Med-PaLM 2 and Sec-PaLM, it also drives sophisticated AI functionalities and tools within Google, such as Bard and the PaLM API. Its adaptability positions it as a crucial resource across numerous domains, demonstrating AI's capacity to boost both productivity and creative solutions, ultimately paving the way for future advancements in the field.
-
9
Amazon Nova
Amazon
Revolutionary foundation models for unmatched intelligence and performance.
Amazon Nova signifies a groundbreaking advancement in foundation models (FMs), delivering sophisticated intelligence and exceptional price-performance ratios, exclusively accessible through Amazon Bedrock.
The series features Amazon Nova Micro, Amazon Nova Lite, and Amazon Nova Pro, each tailored to process text, image, or video inputs and generate text outputs, addressing varying demands for capability, precision, speed, and operational expenses.
Amazon Nova Micro is a model centered on text, excelling in delivering quick responses at an incredibly low price point.
On the other hand, Amazon Nova Lite is a cost-effective multimodal model celebrated for its rapid handling of image, video, and text inputs.
Lastly, Amazon Nova Pro distinguishes itself as a powerful multimodal model that provides the best combination of accuracy, speed, and affordability for a wide range of applications, making it particularly suitable for tasks like video summarization, answering queries, and solving mathematical problems, among others.
These innovative models empower users to choose the most suitable option for their unique needs while experiencing unparalleled performance levels in their respective tasks.
This flexibility ensures that whether for simple text analysis or complex multimodal interactions, there is an Amazon Nova model tailored to meet every user's specific requirements.
-
10
MiMo-V2.5-Pro
Xiaomi Technology
Revolutionizing AI with unparalleled efficiency and advanced reasoning.
Xiaomi MiMo-V2.5-Pro is a cutting-edge open-source AI model built to handle complex reasoning, coding, and long-horizon tasks with high efficiency. It features a Mixture-of-Experts architecture with over one trillion total parameters and a large active parameter set for optimized performance. The model supports an extended context window of up to one million tokens, enabling it to process large amounts of information in a single workflow. It is designed for advanced agentic capabilities, allowing it to autonomously complete multi-step tasks over extended periods. MiMo-V2.5-Pro has demonstrated strong results in benchmarks related to software engineering, reasoning, and general AI performance. It is capable of building complete applications, optimizing engineering systems, and solving complex technical challenges. The model uses hybrid attention mechanisms to balance performance and efficiency across long contexts. It is also optimized for token efficiency, reducing resource usage while maintaining high-quality outputs. The model can integrate with development tools and frameworks to support real-world use cases. Xiaomi has open-sourced MiMo-V2.5-Pro, providing developers with access to its architecture, weights, and deployment tools. This allows organizations to customize and scale the model for their specific needs. Its ability to handle long workflows makes it suitable for tasks that require sustained reasoning and coordination. By combining scalability, efficiency, and advanced intelligence, MiMo-V2.5-Pro represents a significant advancement in open-source AI technology.
-
11
MiMo-V2.5
Xiaomi Technology
Revolutionizing AI with unmatched multimodal understanding and efficiency.
Xiaomi MiMo-V2.5 is a powerful open-source AI model designed to deliver advanced agentic capabilities alongside native multimodal understanding. It can process and reason across text, images, and audio within a unified system, enabling more complex and realistic interactions. The model is built using a sparse Mixture-of-Experts architecture with hundreds of billions of parameters, allowing it to scale efficiently while maintaining strong performance. It supports an extended context window of up to one million tokens, making it suitable for long-horizon tasks and detailed workflows. MiMo-V2.5 incorporates dedicated visual and audio encoders that enhance its ability to interpret and analyze multimodal inputs. It is capable of performing a wide range of tasks, including coding, reasoning, document analysis, and multimedia understanding. The model demonstrates strong benchmark performance across coding, reasoning, and multimodal evaluation tests. It is optimized for token efficiency, reducing computational cost while maintaining high-quality outputs. MiMo-V2.5 is designed to integrate with development tools and frameworks for real-world use cases. Xiaomi has released the model as open source, providing access to its weights, tokenizer, and architecture. This allows developers to customize and deploy the model for specific applications. Its ability to combine perception and reasoning makes it suitable for advanced AI workflows. By unifying multimodality and agentic intelligence, MiMo-V2.5 represents a significant advancement in open-source AI technology.
-
12
North Mini Code
Cohere
Empower your coding with compact, efficient agentic capabilities.
North Mini Code marks the launch of Cohere's innovative agentic coding model, specifically designed for developers, and represents the initial offering in its next generation of advanced models. This compact and effective open-source solution is tailored for the independent developer community, providing exceptional software development capabilities without requiring extensive hardware resources. Utilizing a mixture-of-experts architecture, it features a total of 30 billion parameters, with 3 billion actively engaged, delivering powerful agentic coding functionalities in a streamlined format. The model is meticulously optimized for a variety of tasks, including code generation, agentic software engineering, and terminal operations, boasting an impressive context length of 256K and a maximum generation capacity of 64K. It is crafted with real-world developer practices in mind, allowing for the management of sub-agents, architecture mapping, code reviews, and supporting coding agents in overcoming complex software challenges. By integrating these capabilities, developers can significantly boost their productivity and efficiency in software development projects, making it an invaluable tool in their arsenal. As a result, North Mini Code not only facilitates better coding practices but also fosters a collaborative environment for developers to thrive.
-
13
LTM-1
Magic AI
Revolutionizing coding assistance with unparalleled context and accuracy.
Magic’s innovative LTM-1 technology enables context windows that are 50 times greater than the standard ones found in traditional transformer models. Consequently, Magic has created a Large Language Model (LLM) capable of efficiently handling extensive contextual information for generating recommendations. This breakthrough empowers our coding assistant to thoroughly examine and utilize your entire code repository. By drawing on a wealth of factual knowledge and its own previous interactions, larger context windows greatly improve the accuracy and cohesiveness of AI-generated responses. We are enthusiastic about the possibilities this research presents for enhancing user experiences in coding assistance tools, paving the way for smarter, more intuitive interactions. Ultimately, we believe these advancements will significantly transform how developers engage with their coding environments.