List of NVIDIA DGX Cloud Serverless Inference Integrations

This is a list of platforms and tools that integrate with NVIDIA DGX Cloud Serverless Inference. This list is updated as of August 2025.

  • 1
    Leader badge
    Google Cloud Platform Reviews & Ratings

    Google Cloud Platform

    Google

    Empower your business with scalable, secure cloud solutions.
    More Information
    Company Website
    Company Website
    Google Cloud serves as an online platform where users can develop anything from basic websites to intricate business applications, catering to organizations of all sizes. New users are welcomed with a generous offer of $300 in credits, enabling them to experiment, deploy, and manage their workloads effectively, while also gaining access to over 25 products at no cost. Leveraging Google's foundational data analytics and machine learning capabilities, this service is accessible to all types of enterprises and emphasizes security and comprehensive features. By harnessing big data, businesses can enhance their products and accelerate their decision-making processes. The platform supports a seamless transition from initial prototypes to fully operational products, even scaling to accommodate global demands without concerns about reliability, capacity, or performance issues. With virtual machines that boast a strong performance-to-cost ratio and a fully-managed application development environment, users can also take advantage of high-performance, scalable, and resilient storage and database solutions. Furthermore, Google's private fiber network provides cutting-edge software-defined networking options, along with fully managed data warehousing, data exploration tools, and support for Hadoop/Spark as well as messaging services, making it an all-encompassing solution for modern digital needs.
  • 2
    Leader badge
    Microsoft Azure Reviews & Ratings

    Microsoft Azure

    Microsoft

    Empower your ideas with agile, secure cloud solutions.
    Microsoft Azure is a dynamic cloud computing platform designed to streamline the development, testing, and management of applications with speed and security. By leveraging Azure, you can creatively turn your ideas into effective solutions, taking advantage of more than 100 services that support building, deploying, and managing applications across various environments such as the cloud, on-premises, or at the edge, all while using your preferred tools and frameworks. The ongoing innovations from Microsoft ensure that your current development requirements are met while also setting the stage for your future product goals. With a strong commitment to open-source values and support for all programming languages and frameworks, Azure grants you the flexibility to create and deploy in a manner that best fits your needs. Whether your infrastructure is on-premises, cloud-based, or edge-focused, Azure is equipped to evolve alongside your existing setup. It also provides specialized services for hybrid cloud frameworks, allowing for smooth integration and effective management. Security is a key pillar of Azure, underpinned by a skilled team and proactive compliance strategies that are trusted by a wide range of organizations, including enterprises, governments, and startups. With Azure, you gain a dependable cloud solution, supported by outstanding performance metrics that confirm its reliability. Furthermore, this platform not only addresses your immediate requirements but also prepares you for the future's dynamic challenges while fostering a culture of innovation and growth.
  • 3
    Leader badge
    Amazon Web Services (AWS) Reviews & Ratings

    Amazon Web Services (AWS)

    Amazon

    Empower your innovation with unparalleled cloud resources and services.
    For those seeking computing power, data storage, content distribution, or other functionalities, AWS offers the essential resources to develop sophisticated applications with improved adaptability, scalability, and reliability. As the largest and most prevalent cloud platform globally, Amazon Web Services (AWS) features over 175 comprehensive services distributed across numerous data centers worldwide. A wide array of users, from swiftly evolving startups to major enterprises and influential governmental organizations, utilize AWS to lower costs, boost efficiency, and speed up their innovative processes. With a more extensive selection of services and features than any other cloud provider—ranging from fundamental infrastructure like computing, storage, and databases to innovative technologies such as machine learning, artificial intelligence, data lakes, analytics, and the Internet of Things—AWS simplifies the transition of existing applications to the cloud. This vast range of offerings not only enables businesses to harness the full potential of cloud technologies but also fosters optimized workflows and heightened competitiveness in their industries. Ultimately, AWS empowers organizations to stay ahead in a rapidly evolving digital landscape.
  • 4
    Splunk Cloud Platform Reviews & Ratings

    Splunk Cloud Platform

    Splunk

    Transform your data into insights with effortless scalability.
    Splunk simplifies the transformation of data into actionable insights, offering a secure and reliable service that scales effortlessly. By relying on our Splunk experts to manage your IT backend, you can focus on maximizing the value of your data. The infrastructure provided and managed by Splunk ensures a smooth, cloud-based data analytics experience that can be set up within as little as 48 hours. Regular updates to the software mean you will always have access to the latest features and improvements. In just a few days, with minimal requirements, you can tap into the full potential of your data for actionable insights. Complying with FedRAMP security standards, Splunk Cloud enables U.S. federal agencies and their partners to make informed decisions and take action swiftly. The inclusion of mobile applications and natural language processing features further enhances productivity and provides contextual insights, expanding the reach of your solutions with ease. Whether you are overseeing infrastructure or ensuring compliance with data regulations, Splunk Cloud is built to scale efficiently, delivering powerful solutions tailored to your evolving needs. Ultimately, this agility and effectiveness can markedly improve your organization's operational performance and strategic decision-making capabilities. As a result, embracing Splunk can lead to a significant competitive advantage in today’s data-driven landscape.
  • 5
    Helm Reviews & Ratings

    Helm

    Helm

    Unleash creativity with a versatile, freedom-driven synthesizer experience!
    Helm is designed to work seamlessly with GNU/Linux, Mac OSX, and Windows operating systems. You have the option to use Helm either as a standalone synthesizer or as a plugin compatible with formats like LV2, VST, VST3, and AU, available in both 32-bit and 64-bit versions. This versatility means you can enjoy Helm without worrying about digital rights management (DRM), allowing you to explore, modify, and share the source code freely, whether in its original state or after making alterations. Serving as a software synthesizer, Helm provides users the capability to create electronic music directly from their computers. Its guiding principle of being "free as in freedom" signifies that you retain full control over the software, rather than being governed by it. Financially, Helm follows a "pay what you want" model, enabling you to use it at no cost if that is your preference. Additionally, any sounds generated by Helm are the property of the user, giving you copyright ownership over each sound you create. The interface features small power buttons in the top left corner that allow you to easily toggle various modules on or off. Among these essential modules, the SUB module acts as a key sound generator, handling a single oscillator that usually produces a note an octave lower than the one being played. This user-friendly design encourages experimentation, making it easy for you to develop and refine your own distinctive sounds. You can explore endless sonic possibilities and unleash your creativity with Helm's intuitive layout.
  • 6
    Nebius Reviews & Ratings

    Nebius

    Nebius

    Unleash AI potential with powerful, affordable training solutions.
    An advanced platform tailored for training purposes comes fitted with NVIDIA® H100 Tensor Core GPUs, providing attractive pricing options and customized assistance. This system is specifically engineered to manage large-scale machine learning tasks, enabling effective multihost training that leverages thousands of interconnected H100 GPUs through the cutting-edge InfiniBand network, reaching speeds as high as 3.2Tb/s per host. Users can enjoy substantial financial benefits, including a minimum of 50% savings on GPU compute costs in comparison to top public cloud alternatives*, alongside additional discounts for GPU reservations and bulk ordering. To ensure a seamless onboarding experience, we offer dedicated engineering support that guarantees efficient platform integration while optimizing your existing infrastructure and deploying Kubernetes. Our fully managed Kubernetes service simplifies the deployment, scaling, and oversight of machine learning frameworks, facilitating multi-node GPU training with remarkable ease. Furthermore, our Marketplace provides a selection of machine learning libraries, applications, frameworks, and tools designed to improve your model training process. New users are encouraged to take advantage of a free one-month trial, allowing them to navigate the platform's features without any commitment. This unique blend of high performance and expert support positions our platform as an exceptional choice for organizations aiming to advance their machine learning projects and achieve their goals. Ultimately, this offering not only enhances productivity but also fosters innovation and growth in the field of artificial intelligence.
  • 7
    NVIDIA NIM Reviews & Ratings

    NVIDIA NIM

    NVIDIA

    Empower your AI journey with seamless integration and innovation.
    Explore the latest innovations in AI models designed for optimization, connect AI agents to data utilizing NVIDIA NeMo, and implement solutions effortlessly through NVIDIA NIM microservices. These microservices are designed for ease of use, allowing the deployment of foundational models across multiple cloud platforms or within data centers, ensuring data protection while facilitating effective AI integration. Additionally, NVIDIA AI provides opportunities to access the Deep Learning Institute (DLI), where learners can enhance their technical skills, gain hands-on experience, and deepen their expertise in areas such as AI, data science, and accelerated computing. AI models generate outputs based on complex algorithms and machine learning methods; however, it is important to recognize that these outputs can occasionally be flawed, biased, harmful, or unsuitable. Interacting with this model means understanding and accepting the risks linked to potential negative consequences of its responses. It is advisable to avoid sharing any sensitive or personal information without explicit consent, and users should be aware that their activities may be monitored for security purposes. As the field of AI continues to evolve, it is crucial for users to remain informed and cautious regarding the ramifications of implementing such technologies, ensuring proactive engagement with the ethical implications of their usage. Staying updated about the ongoing developments in AI will help individuals make more informed decisions regarding their applications.
  • 8
    Oracle Cloud Infrastructure Reviews & Ratings

    Oracle Cloud Infrastructure

    Oracle

    Empower your digital transformation with cutting-edge cloud solutions.
    Oracle Cloud Infrastructure is designed to support both traditional workloads and cutting-edge cloud development tools tailored for contemporary requirements. Its architecture is equipped to detect and address modern security threats, thereby accelerating innovation. By combining cost-effectiveness with outstanding performance, it significantly lowers the total cost of ownership for users. As a Generation 2 enterprise cloud, Oracle Cloud showcases remarkable compute and networking features while providing a broad spectrum of infrastructure and platform cloud services. Specifically tailored to meet the needs of mission-critical applications, it allows businesses to maintain legacy workloads while advancing toward future goals. Importantly, the Generation 2 Cloud can run the Oracle Autonomous Database, which is celebrated as the first and only self-driving database in the industry. In addition, Oracle Cloud offers an extensive array of cloud computing solutions, including application development, business analytics, data management, integration, security, artificial intelligence, and blockchain technology, ensuring organizations are well-equipped to succeed in an increasingly digital environment. This all-encompassing strategy firmly establishes Oracle Cloud as a frontrunner in the rapidly changing cloud landscape. Consequently, organizations leveraging Oracle Cloud can confidently embrace transformation and drive their digital initiatives forward.
  • 9
    Yotta Reviews & Ratings

    Yotta

    Yotta

    Transforming asset management for a smarter, connected future.
    Alloy propels asset-rich environments into the future by offering cutting-edge connected asset management solutions. It distinguishes itself as the only platform that can seamlessly connect people, systems, and assets. Designed for scalability, the solution accommodates a variety of asset types while integrating smoothly with all existing systems via robust APIs. This approach encompasses every physical and organizational structure vital for your organization’s operations. As data volumes grow, Alloy facilitates effective management, enabling you to make informed and timely decisions. We ensure your team has access to essential information that helps them make critical choices when it counts the most. Furthermore, Alloy provides a flexible and mobile asset management platform that generates valuable operational insights related to your grounds management strategy, guaranteeing that your enterprise remains efficient and adaptable in a constantly changing environment. With Alloy, you can confidently navigate the complexities of asset management, propelling your organization toward future success while fostering a culture of innovation and responsiveness.
  • 10
    CoreWeave Reviews & Ratings

    CoreWeave

    CoreWeave

    Empowering AI innovation with scalable, high-performance GPU solutions.
    CoreWeave distinguishes itself as a cloud infrastructure provider dedicated to GPU-driven computing solutions tailored for artificial intelligence applications. Their platform provides scalable and high-performance GPU clusters that significantly improve both the training and inference phases of AI models, serving industries like machine learning, visual effects, and high-performance computing. Beyond its powerful GPU offerings, CoreWeave also features flexible storage, networking, and managed services that support AI-oriented businesses, highlighting reliability, cost-efficiency, and exceptional security protocols. This adaptable platform is embraced by AI research centers, labs, and commercial enterprises seeking to accelerate their progress in artificial intelligence technology. By delivering infrastructure that aligns with the unique requirements of AI workloads, CoreWeave is instrumental in fostering innovation across multiple sectors, ultimately helping to shape the future of AI applications. Moreover, their commitment to continuous improvement ensures that clients remain at the forefront of technological advancements.
  • 11
    NVIDIA DGX Cloud Reviews & Ratings

    NVIDIA DGX Cloud

    NVIDIA

    Empower innovation with seamless AI infrastructure in the cloud.
    The NVIDIA DGX Cloud offers a robust AI infrastructure as a service, streamlining the process of deploying extensive AI models and fostering rapid innovation. This platform presents a wide array of tools tailored for machine learning, deep learning, and high-performance computing, allowing enterprises to execute their AI tasks effectively in the cloud. Additionally, its effortless integration with leading cloud services provides the scalability, performance, and adaptability required to address intricate AI challenges, while also removing the burdens associated with on-site hardware management. This makes it an invaluable resource for organizations looking to harness the power of AI without the typical constraints of physical infrastructure.
  • 12
    NVIDIA AI Foundations Reviews & Ratings

    NVIDIA AI Foundations

    NVIDIA

    Empowering innovation and creativity through advanced AI solutions.
    Generative AI is revolutionizing a multitude of industries by creating extensive opportunities for knowledge workers and creative professionals to address critical challenges facing society today. NVIDIA plays a pivotal role in this evolution, offering a comprehensive suite of cloud services, pre-trained foundational models, and advanced frameworks, complemented by optimized inference engines and APIs, which facilitate the seamless integration of intelligence into business applications. The NVIDIA AI Foundations suite equips enterprises with cloud solutions that bolster generative AI capabilities, enabling customized applications across various sectors, including text analysis (NVIDIA NeMo™), digital visual creation (NVIDIA Picasso), and life sciences (NVIDIA BioNeMo™). By utilizing the strengths of NeMo, Picasso, and BioNeMo through NVIDIA DGX™ Cloud, organizations can unlock the full potential of generative AI technology. This innovative approach is not confined solely to creative tasks; it also supports the generation of marketing materials, the development of storytelling content, global language translation, and the synthesis of information from diverse sources like news articles and meeting records. As businesses leverage these cutting-edge tools, they can drive innovation, adapt to emerging trends, and maintain a competitive edge in a rapidly changing digital environment, ultimately reshaping how they operate and engage with their audiences.
  • 13
    NVIDIA Cloud Functions Reviews & Ratings

    NVIDIA Cloud Functions

    NVIDIA

    Effortlessly deploy AI workloads with secure, scalable solutions.
    NVIDIA Cloud Functions (NVCF) serves as a specialized serverless API designed for the deployment and oversight of AI operations on GPUs, guaranteeing essential aspects like security, scalability, and reliable performance. The platform supports multiple access avenues, such as HTTP polling, HTTP streaming, and gRPC protocols, facilitating interactions with various workloads. NVCF is particularly well-suited for short-lived, preemptable tasks like inferencing and fine-tuning of models. Users have the flexibility to select from two distinct function types: "Container" and "Helm Chart," allowing for tailored customization according to individual requirements. Given that workloads are temporary and can be interrupted, it is vital for users to consistently save their progress. Furthermore, models, containers, helm charts, and other critical assets are managed within the NGC Private Registry for efficient storage and retrieval. To help users get started with NVCF, a quickstart guide for functions is available, detailing a thorough workflow for setting up and deploying a container-based function using the fastapi_echo_sample container. This guide not only emphasizes the simplicity of the setup process but also motivates users to delve deeper into the capabilities of NVIDIA’s serverless framework, thereby maximizing their experience and utilization of the platform. As users become familiar with NVCF, they can unlock new opportunities for innovation in AI applications.
  • 14
    Llama Reviews & Ratings

    Llama

    Meta

    Empowering researchers with inclusive, efficient AI language models.
    Llama, a leading-edge foundational large language model developed by Meta AI, is designed to assist researchers in expanding the frontiers of artificial intelligence research. By offering streamlined yet powerful models like Llama, even those with limited resources can access advanced tools, thereby enhancing inclusivity in this fast-paced and ever-evolving field. The development of more compact foundational models, such as Llama, proves beneficial in the realm of large language models since they require considerably less computational power and resources, which allows for the exploration of novel approaches, validation of existing studies, and examination of potential new applications. These models harness vast amounts of unlabeled data, rendering them particularly effective for fine-tuning across diverse tasks. We are introducing Llama in various sizes, including 7B, 13B, 33B, and 65B parameters, each supported by a comprehensive model card that details our development methodology while maintaining our dedication to Responsible AI practices. By providing these resources, we seek to empower a wider array of researchers to actively participate in and drive forward the developments in the field of AI. Ultimately, our goal is to foster an environment where innovation thrives and collaboration flourishes.
  • Previous
  • You're on page 1
  • Next