List of Hugging Face Integrations
This is a list of platforms and tools that integrate with Hugging Face. This list is updated as of February 2026.
-
1
GLM-4.5
Z.ai
Unleashing powerful reasoning and coding for every challenge.Z.ai has launched its newest flagship model, GLM-4.5, which features an astounding total of 355 billion parameters (with 32 billion actively utilized) and is accompanied by the GLM-4.5-Air variant, which includes 106 billion parameters (12 billion active) tailored for advanced reasoning, coding, and agent-like functionalities within a unified framework. This innovative model is capable of toggling between a "thinking" mode, ideal for complex, multi-step reasoning and tool utilization, and a "non-thinking" mode that allows for quick responses, supporting a context length of up to 128K tokens and enabling native function calls. Available via the Z.ai chat platform and API, and with open weights on sites like HuggingFace and ModelScope, GLM-4.5 excels at handling diverse inputs for various tasks, including general problem solving, common-sense reasoning, coding from scratch or enhancing existing frameworks, and orchestrating extensive workflows such as web browsing and slide creation. The underlying architecture employs a Mixture-of-Experts design that incorporates loss-free balance routing, grouped-query attention mechanisms, and an MTP layer to support speculative decoding, ensuring it meets enterprise-level performance expectations while being versatile enough for a wide array of applications. Consequently, GLM-4.5 sets a remarkable standard for AI capabilities, pushing the boundaries of technology across multiple fields and industries. This advancement not only enhances user experience but also drives innovation in artificial intelligence solutions. -
2
Command A Reasoning
Cohere AI
Elevate reasoning capabilities with scalable, enterprise-ready performance.Cohere’s Command A Reasoning is the company’s advanced language model, crafted for tackling complex reasoning tasks while seamlessly integrating into AI agent frameworks. This model showcases remarkable reasoning skills and maintains high efficiency and controllability, allowing it to scale efficiently across various GPU setups and handle context windows of up to 256,000 tokens, which is extremely useful for processing large documents and intricate tasks. By leveraging a token budget, businesses can fine-tune the accuracy and speed of output, enabling a single model to proficiently meet both detailed and high-volume application requirements. It serves as the core component of Cohere’s North platform, delivering exceptional benchmark results and illustrating its capabilities in multilingual contexts across 23 different languages. With a focus on safety in corporate environments, the model balances functionality with robust safeguards against harmful content. Moreover, an easy-to-use deployment option enables the model to function securely on a single H100 or A100 GPU, facilitating private and scalable implementations. This versatile blend of features ultimately establishes Command A Reasoning as an invaluable resource for organizations looking to elevate their AI-driven strategies, thereby enhancing operational efficiency and effectiveness. -
3
Command A Translate
Cohere AI
Unmatched translation quality, secure, customizable, and enterprise-ready.Cohere's Command A Translate stands out as a powerful machine translation tool tailored for businesses, delivering secure and high-quality translations in 23 relevant languages. Built on an impressive 111-billion-parameter framework, it boasts an 8K-input and 8K-output context window, ensuring exceptional performance that surpasses rivals like GPT-5, DeepSeek-V3, DeepL Pro, and Google Translate in various assessments. Organizations dealing with sensitive data can take advantage of its private deployment options, which allow complete control over their information. Additionally, the innovative “Deep Translation” workflow utilizes a multi-step refinement approach to greatly enhance translation accuracy, especially for complex scenarios. Validation from RWS Group further highlights its capability to tackle challenging translation tasks effectively. Moreover, researchers can access the model's parameters via Hugging Face under a CC-BY-NC license, enabling extensive customization, fine-tuning, and adaptability for private use. This flexibility makes Command A Translate an invaluable asset for enterprises striving to improve their global communication efforts. Ultimately, it empowers organizations to navigate diverse linguistic landscapes with confidence and precision. -
4
PyMuPDF
Artifex
Effortlessly manipulate PDFs and Office documents with precision.PyMuPDF is a highly effective library designed specifically for Python, enabling users to accurately read, extract, and manipulate PDF files. It provides developers with the ability to access various elements within PDF documents such as text, images, fonts, annotations, and metadata, allowing for a broad spectrum of operations like content extraction, editing of objects, rendering of pages, searching for text, and modifying page content. Moreover, users can also manage components of the PDF, including links and annotations, while executing advanced tasks such as splitting, merging, inserting, or removing pages, as well as drawing shapes and managing color spaces. This library is crafted to be both lightweight and robust, ensuring that it uses minimal memory while maximizing performance efficiency. In addition, PyMuPDF Pro builds upon the foundational features by offering capabilities for reading and writing Microsoft Office-format files and enhancing integration options for workflows involving Large Language Models and Retrieval Augmented Generation techniques. Consequently, developers are empowered to work seamlessly across a variety of document types, solidifying PyMuPDF's reputation as an essential tool for diverse applications in document management. With continuous updates and improvements, the library ensures that users have access to the latest functionalities and optimizations, further enhancing its utility in the ever-evolving landscape of document processing. -
5
Amazon Quick Suite
Amazon
Unlock insights effortlessly with powerful data automation tools.Amazon QuickSuite is a cohesive platform that merges generative AI with analytics, designed to empower business professionals, data analysts, and subject matter experts in converting data, workflows, and internal knowledge into actionable insights and automation solutions. The platform encompasses various functionalities, such as interactive dashboards and visualizations enhanced by the QuickSight service, natural language query options, generative business intelligence, workflow automation, thorough data exploration, research support, and compatibility with enterprise systems and SaaS applications. Users can easily connect a variety of data sources, including spreadsheets, cloud data warehouses, third-party platforms, and local databases, allowing them to ask questions in plain language, design dashboards, schedule reports, or kickstart automated tasks. Furthermore, from a workflow standpoint, it provides non-technical users with the necessary tools to optimize regular activities like report generation, notifications, and data integration via intelligent, automated workflows, significantly boosting overall productivity and efficiency. This extensive range of features not only streamlines operations but also cultivates a data-centric culture within organizations, driving enhanced decision-making and improved operational performance. Ultimately, the versatility of Amazon QuickSuite positions it as an essential tool for any organization seeking to leverage data more effectively. -
6
Luminal
Luminal
Accelerate AI inference with unmatched speed, efficiency, flexibility.Luminal is an advanced machine-learning framework that prioritizes performance, ease of use, and modularity, utilizing static graphs and compiler-based optimization techniques to handle intricate neural networks efficiently. By converting models into a streamlined set of minimal "primops," consisting of only 12 essential operations, Luminal can perform compiler passes that replace these with optimized kernels suited for particular devices, enabling high-performance execution on GPUs and other hardware platforms. The framework features modules that act as the core building blocks of networks, complemented by a standardized forward API and the GraphTensor interface, which allows for the definition and execution of typed tensors and graphs during compile time. With a focus on maintaining a small and adaptable core, Luminal promotes extensibility through the incorporation of external compilers that support diverse datatypes, devices, training methodologies, and quantization strategies. To facilitate user adoption, a quick-start guide is provided, helping users to clone the repository, build a straightforward "Hello World" model, or run more complex models such as LLaMA 3 with GPU support, simplifying the process for developers looking to tap into its capabilities. Overall, Luminal's flexible architecture positions it as a formidable resource for both newcomers and seasoned experts in the field of machine learning, bridging the gap between simplicity and advanced functionality. -
7
HunyuanOCR
Tencent
Transforming creativity through advanced multimodal AI capabilities.Tencent Hunyuan is a diverse suite of multimodal AI models developed by Tencent, integrating various modalities such as text, images, video, and 3D data, with the purpose of enhancing general-purpose AI applications like content generation, visual reasoning, and streamlining business operations. This collection includes different versions that are specifically designed for tasks such as interpreting natural language, understanding and combining visual and textual information, generating images from text prompts, creating videos, and producing 3D visualizations. The Hunyuan models leverage a mixture-of-experts approach and incorporate advanced techniques like hybrid "mamba-transformer" architectures to perform exceptionally in tasks that involve reasoning, long-context understanding, cross-modal interactions, and effective inference. A prominent instance is the Hunyuan-Vision-1.5 model, which enables "thinking-on-image," fostering sophisticated multimodal comprehension and reasoning across a variety of visual inputs, including images, video clips, diagrams, and spatial data. This powerful architecture positions Hunyuan as a highly adaptable asset in the fast-paced domain of AI, capable of tackling a wide range of challenges while continuously evolving to meet new demands. As the landscape of artificial intelligence progresses, Hunyuan’s versatility is expected to play a crucial role in shaping future applications. -
8
AWS EC2 Trn3 Instances
Amazon
Unleash unparalleled AI performance with cutting-edge computing power.The newest Amazon EC2 Trn3 UltraServers showcase AWS's cutting-edge accelerated computing capabilities, integrating proprietary Trainium3 AI chips specifically engineered for superior performance in both deep-learning training and inference. These UltraServers are available in two configurations: the "Gen1," which consists of 64 Trainium3 chips, and the more advanced "Gen2," which can accommodate up to 144 Trainium3 chips per server. The Gen2 model is particularly remarkable, achieving an extraordinary 362 petaFLOPS of dense MXFP8 compute power, complemented by 20 TB of HBM memory and a staggering 706 TB/s of total memory bandwidth, making it one of the most formidable AI computing solutions on the market. To enhance interconnectivity, a sophisticated "NeuronSwitch-v1" fabric is integrated, facilitating all-to-all communication patterns essential for training large models, implementing mixture-of-experts frameworks, and supporting vast distributed training configurations. This innovative architectural design not only highlights AWS's dedication to advancing AI technology but also sets new benchmarks for performance and efficiency in the industry. As a result, organizations can leverage these advancements to push the limits of their AI capabilities and drive transformative results. -
9
trail
trail
The AI Governance CopilotTrail ML acts as a copilot platform for AI governance, aimed at helping organizations create dependable, compliant, and transparent AI systems by automating the cumbersome tasks associated with governance and documentation. The platform integrates a wide range of critical functionalities, including management of AI registries, policy development, risk evaluation, automated documentation processes, oversight of development, audit trails, and compliance workflows, all within a unified system. This allows teams to efficiently organize and oversee all AI applications, track decisions from the initial stages of data and model development to final results, and significantly reduce the workload associated with manual documentation and governance responsibilities. Furthermore, Trail ML encompasses various governance frameworks and templates, encourages the formulation of customized AI policies, and supports teams in identifying and mitigating risks while preparing for audits and meeting standards such as ISO 42001 and regulations like the EU AI Act. By leveraging a blend of curated knowledge, risk libraries, and AI-powered automation, the platform facilitates the management of governance duties, transforms regulatory requirements into actionable steps, and promotes collaboration among stakeholders. This ultimately leads to a more streamlined governance environment, allowing organizations to prioritize innovation over compliance challenges. As a result, teams can allocate more resources to creative initiatives while maintaining adherence to necessary regulations. -
10
voyage-4-large
Voyage AI
Revolutionizing semantic embeddings for optimized accuracy and efficiency.The Voyage 4 model family from Voyage AI signifies a pioneering stage in the development of text embedding models, engineered to produce exceptional semantic vectors via a unique shared embedding space that allows for the generation of compatible embeddings among the various models within the series, thus empowering developers to effortlessly integrate models for both document and query embedding, which significantly boosts accuracy while also considering latency and cost factors. This lineup includes the voyage-4-large, the premier model that utilizes a mixture-of-experts architecture to reach state-of-the-art retrieval accuracy while achieving nearly 40% lower serving costs than comparable dense models; voyage-4, which effectively balances quality with performance; voyage-4-lite, which provides high-quality embeddings with a minimized parameter count and lower computational requirements; and the open-weight voyage-4-nano, ideal for local development and prototyping, distributed under an Apache 2.0 license. The seamless interoperability among these four models, all operating within the same shared embedding space, allows for interchangeable embeddings that foster innovative asymmetric retrieval techniques, which can greatly elevate performance across a wide range of applications. This integrated approach equips developers with a dynamic toolkit that can be customized to address various project demands, establishing the Voyage 4 family as an attractive option in the continuously evolving field of AI-driven technologies. Furthermore, the diverse capabilities and flexibility of these models enable organizations to experiment and adapt their embedding strategies to optimize specific use cases effectively. -
11
Texel.ai
Texel.ai
Transform your GPU tasks: accelerate, optimize, and save!Significantly improve the performance of your GPU tasks. Accelerate the training of AI models, video editing, and numerous other activities by up to tenfold, while possibly cutting costs by nearly 90%. This approach not only enhances operational efficiency but also ensures better utilization of resources, leading to a more productive workflow overall. By implementing these strategies, you can achieve remarkable results in various computational tasks. -
12
Cleanlab
Cleanlab
Elevate data quality and streamline your AI processes effortlessly.Cleanlab Studio provides an all-encompassing platform for overseeing data quality and implementing data-centric AI processes seamlessly, making it suitable for both analytics and machine learning projects. Its automated workflow streamlines the machine learning process by taking care of crucial aspects like data preprocessing, fine-tuning foundational models, optimizing hyperparameters, and selecting the most suitable models for specific requirements. By leveraging machine learning algorithms, the platform pinpoints issues related to data, enabling users to retrain their models on an improved dataset with just one click. Users can also access a detailed heatmap that displays suggested corrections for each category within the dataset. This wealth of insights becomes available at no cost immediately after data upload. Furthermore, Cleanlab Studio includes a selection of demo datasets and projects, which allows users to experiment with these examples directly upon logging into their accounts. The platform is designed to be intuitive, making it accessible for individuals looking to elevate their data management capabilities and enhance the results of their machine learning initiatives. With its user-centric approach, Cleanlab Studio empowers users to make informed decisions and optimize their data strategies efficiently. -
13
Unremot
Unremot
Accelerate AI development effortlessly with ready-to-use APIs.Unremot acts as a vital platform for those looking to develop AI products, featuring more than 120 ready-to-use APIs that allow for the creation and launch of AI solutions at twice the speed and one-third of the usual expense. Furthermore, even intricate AI product APIs can be activated in just a few minutes, with minimal to no coding skills required. Users can choose from a wide variety of AI APIs available on Unremot to easily incorporate into their offerings. To enable Unremot to access the API, you only need to enter your specific API private key. Utilizing Unremot's dedicated URL to link your product API simplifies the entire procedure, enabling completion in just minutes instead of the usual days or weeks. This remarkable efficiency not only conserves time but also boosts the productivity of developers and organizations, making it an invaluable resource for innovation. As a result, teams can focus more on enhancing their products rather than getting bogged down by technical hurdles. -
14
Tune AI
NimbleBox
Unlock limitless opportunities with secure, cutting-edge AI solutions.Leverage the power of specialized models to achieve a competitive advantage in your industry. By utilizing our cutting-edge enterprise Gen AI framework, you can move beyond traditional constraints and assign routine tasks to powerful assistants instantly – the opportunities are limitless. Furthermore, for organizations that emphasize data security, you can tailor and deploy generative AI solutions in your private cloud environment, guaranteeing safety and confidentiality throughout the entire process. This approach not only enhances efficiency but also fosters a culture of innovation and trust within your organization. -
15
ChainForge
ChainForge
Empower your prompt engineering with innovative visual programming solutions.ChainForge is a versatile open-source visual programming platform designed to improve prompt engineering and the evaluation of large language models. It empowers users to thoroughly test the effectiveness of their prompts and text-generation models, surpassing simple anecdotal evaluations. By allowing simultaneous experimentation with various prompt concepts and their iterations across multiple LLMs, users can identify the most effective combinations. Moreover, it evaluates the quality of responses generated by different prompts, models, and configurations to pinpoint the optimal setup for specific applications. Users can establish evaluation metrics and visualize results across prompts, parameters, models, and configurations, thus fostering a data-driven methodology for informed decision-making. The platform also supports the management of multiple conversations concurrently, offers templating for follow-up messages, and permits the review of outputs at each interaction to refine communication strategies. Additionally, ChainForge is compatible with a wide range of model providers, including OpenAI, HuggingFace, Anthropic, Google PaLM2, Azure OpenAI endpoints, and even locally hosted models like Alpaca and Llama. Users can easily adjust model settings and utilize visualization nodes to gain deeper insights and improve outcomes. Overall, ChainForge stands out as a robust tool specifically designed for prompt engineering and LLM assessment, fostering a culture of innovation and efficiency while also being user-friendly for individuals at various expertise levels. -
16
Chainlit
Chainlit
Accelerate conversational AI development with seamless, secure integration.Chainlit is an adaptable open-source library in Python that expedites the development of production-ready conversational AI applications. By leveraging Chainlit, developers can quickly create chat interfaces in just a few minutes, eliminating the weeks typically required for such a task. This platform integrates smoothly with top AI tools and frameworks, including OpenAI, LangChain, and LlamaIndex, enabling a wide range of application development possibilities. A standout feature of Chainlit is its support for multimodal capabilities, which allows users to work with images, PDFs, and various media formats, thereby enhancing productivity. Furthermore, it incorporates robust authentication processes compatible with providers like Okta, Azure AD, and Google, thereby strengthening security measures. The Prompt Playground feature enables developers to adjust prompts contextually, optimizing templates, variables, and LLM settings for better results. To maintain transparency and effective oversight, Chainlit offers real-time insights into prompts, completions, and usage analytics, which promotes dependable and efficient operations in the domain of language models. Ultimately, Chainlit not only simplifies the creation of conversational AI tools but also empowers developers to innovate more freely in this fast-paced technological landscape. Its extensive features make it an indispensable asset for anyone looking to excel in AI development. -
17
Hunyuan Motion 1.0
Tencent Hunyuan
Value for Users, Tech for GoodHunyuan Motion, commonly known as HY-Motion 1.0, is an innovative AI system designed to convert text into dynamic 3D motion, utilizing a sophisticated billion-parameter Diffusion Transformer along with flow matching techniques to produce high-quality, skeleton-based animations in just seconds. This groundbreaking model understands intricate descriptions in both English and Chinese, enabling it to generate smooth and lifelike motion sequences that can be seamlessly integrated into standard 3D animation pipelines by exporting in formats such as SMPL, SMPLH, FBX, or BVH, which are compatible with popular software tools like Blender, Unity, Unreal Engine, and Maya. Its advanced training methodology encompasses a three-phase pipeline: it undergoes extensive pre-training on thousands of hours of motion data, followed by careful fine-tuning on selected sequences, and is enhanced through reinforcement learning based on human feedback, significantly enhancing its ability to interpret complex instructions and deliver motion that is not only realistic but also temporally consistent. Moreover, what sets this model apart is its remarkable capacity to adapt to a variety of animation styles and project needs, making it an invaluable resource for creators across the gaming and film sectors. This flexibility positions HY-Motion 1.0 as a game-changing asset in modern animation technology. -
18
Molmo 2
Ai2
Breakthrough AI to solve the world's biggest problemsMolmo 2 introduces a state-of-the-art collection of open vision-language models, offering fully accessible weights, training data, and code, which enhances the capabilities of the original Molmo series by extending grounded image comprehension to include video and various image inputs. This significant upgrade facilitates advanced video analysis tasks such as pointing, tracking, dense captioning, and question-answering, all exhibiting strong spatial and temporal reasoning across multiple frames. The suite is comprised of three unique models: an 8 billion-parameter version designed for thorough video grounding and QA tasks, a 4 billion-parameter model that emphasizes efficiency, and a 7 billion-parameter model powered by Olmo, featuring a completely open end-to-end architecture that integrates the core language model. Remarkably, these latest models outperform their predecessors on important benchmarks, establishing new benchmarks for open-model capabilities in image and video comprehension tasks. Additionally, they frequently compete with much larger proprietary systems while being trained on a significantly smaller dataset compared to similar closed models, illustrating their impressive efficiency and performance in the domain. This noteworthy accomplishment signifies a major step forward in making AI-driven visual understanding technologies more accessible and effective, paving the way for further innovations in the field. The advancements presented by Molmo 2 not only enhance user experience but also broaden the potential applications of AI in various industries.