List of Amundsen Integrations
This is a list of platforms and tools that integrate with Amundsen. This list is updated as of May 2026.
-
1
Google Cloud serves as an online platform where users can develop anything from basic websites to intricate business applications, catering to organizations of all sizes. New users are welcomed with a generous offer of $300 in credits, enabling them to experiment, deploy, and manage their workloads effectively, while also gaining access to over 25 products at no cost. Leveraging Google's foundational data analytics and machine learning capabilities, this service is accessible to all types of enterprises and emphasizes security and comprehensive features. By harnessing big data, businesses can enhance their products and accelerate their decision-making processes. The platform supports a seamless transition from initial prototypes to fully operational products, even scaling to accommodate global demands without concerns about reliability, capacity, or performance issues. With virtual machines that boast a strong performance-to-cost ratio and a fully-managed application development environment, users can also take advantage of high-performance, scalable, and resilient storage and database solutions. Furthermore, Google's private fiber network provides cutting-edge software-defined networking options, along with fully managed data warehousing, data exploration tools, and support for Hadoop/Spark as well as messaging services, making it an all-encompassing solution for modern digital needs.
-
2
Google Cloud BigQuery
Google
Unlock insights effortlessly with powerful, AI-driven analytics solutions.BigQuery serves as a serverless, multicloud data warehouse that simplifies the handling of diverse data types, allowing businesses to quickly extract significant insights. As an integral part of Google’s data cloud, it facilitates seamless data integration, cost-effective and secure scaling of analytics capabilities, and features built-in business intelligence for disseminating comprehensive data insights. With an easy-to-use SQL interface, it also supports the training and deployment of machine learning models, promoting data-driven decision-making throughout organizations. Its strong performance capabilities ensure that enterprises can manage escalating data volumes with ease, adapting to the demands of expanding businesses. Furthermore, Gemini within BigQuery introduces AI-driven tools that bolster collaboration and enhance productivity, offering features like code recommendations, visual data preparation, and smart suggestions designed to boost efficiency and reduce expenses. The platform provides a unified environment that includes SQL, a notebook, and a natural language-based canvas interface, making it accessible to data professionals across various skill sets. This integrated workspace not only streamlines the entire analytics process but also empowers teams to accelerate their workflows and improve overall effectiveness. Consequently, organizations can leverage these advanced tools to stay competitive in an ever-evolving data landscape. -
3
Snowflake
Snowflake
Unlock scalable data management for insightful, secure analytics.Snowflake is a leading AI Data Cloud platform designed to help organizations harness the full potential of their data by breaking down silos and streamlining data management with unmatched scale and simplicity. The platform’s interoperable storage capability offers near-infinite access to data across multiple clouds and regions, enabling seamless collaboration and analytics. Snowflake’s elastic compute engine ensures top-tier performance for diverse workloads, automatically scaling to meet demand and optimize costs. Cortex AI, Snowflake’s integrated AI service, provides enterprises secure access to industry-leading large language models and conversational AI capabilities to accelerate data-driven decision making. Snowflake’s comprehensive cloud services automate infrastructure management, helping businesses reduce operational complexity and improve reliability. Snowgrid extends data and app connectivity globally across regions and clouds with consistent security and governance. The Horizon Catalog is a powerful governance tool that ensures compliance, privacy, and controlled access to data assets. Snowflake Marketplace facilitates easy discovery and collaboration by connecting customers to vital data and applications within the AI Data Cloud ecosystem. Trusted by more than 11,000 customers globally, including leading brands across healthcare, finance, retail, and media, Snowflake drives innovation and competitive advantage. Their extensive developer resources, training, and community support empower organizations to build, deploy, and scale AI and data applications securely and efficiently. -
4
Amazon Web Services (AWS) is a global leader in cloud computing, providing the broadest and deepest set of cloud capabilities on the market. From compute and storage to advanced analytics, AI, and agentic automation, AWS enables organizations to build, scale, and transform their businesses. Enterprises rely on AWS for secure, compliant infrastructure while startups leverage it to launch quickly and innovate without heavy upfront costs. The platform’s extensive service catalog includes solutions for machine learning (Amazon SageMaker), serverless computing (AWS Lambda), global content delivery (Amazon CloudFront), and managed databases (Amazon DynamoDB). With the launch of Amazon Q Developer and AWS Transform, AWS is also pioneering the next wave of agentic AI and modernization technologies. Its infrastructure spans 120 availability zones in 38 regions, with expansion plans into Saudi Arabia, Chile, and Europe’s Sovereign Cloud, guaranteeing unmatched global reach. Customers benefit from real-time scalability, security trusted by the world’s largest enterprises, and automation that streamlines complex operations. AWS is also home to the largest global partner network, marketplace, and developer community, making adoption easier and more collaborative. Training, certifications, and digital courses further support workforce upskilling in cloud and AI. Backed by years of operational expertise and constant innovation, AWS continues to redefine how the world builds and runs technology in the cloud era.
-
5
MySQL
Oracle
Powerful, reliable database solution for modern web applications.MySQL is recognized as the leading open source database in the world. Its impressive history of reliability, performance, and ease of use has made it the go-to choice for many web applications, including major platforms like Facebook, Twitter, and YouTube, as well as the five most visited websites. Additionally, MySQL is a popular option for embedded database solutions, with many independent software vendors and original equipment manufacturers distributing it. The database's flexibility and powerful capabilities further enhance its popularity across diverse sectors, making it a critical tool for developers and businesses alike. Its continued evolution ensures that it remains relevant in an ever-changing technological landscape. -
6
SQL Server
Microsoft
Empowering businesses with intelligent data solutions and flexibility.Microsoft SQL Server 2019 merges cutting-edge intelligence with robust security features, presenting a wealth of additional tools at no extra expense while maintaining exceptional performance and flexibility tailored for on-premises needs. Users can effortlessly migrate to the cloud, fully leveraging its operational efficiency and nimbleness without modifying their existing codebase. By harnessing Azure, organizations can speed up the generation of insights and engage in predictive analytics more effectively. The development process remains versatile, empowering users to select their preferred technologies, including those from the open-source community, all backed by Microsoft's continuous innovations. This platform facilitates straightforward data integration within applications and provides an extensive range of cognitive services designed to nurture human-like intelligence, accommodating any data volume. AI is fundamentally woven into the data platform, enabling faster insight extraction from data stored both on-premises and in the cloud. Combining proprietary enterprise data with global datasets allows organizations to cultivate a culture steeped in intelligence. Moreover, the adaptable data platform ensures a uniform user experience across diverse environments, significantly reducing the time required to launch new innovations; this flexibility enables developers to create and deploy applications in multiple settings, ultimately boosting overall operational productivity and effectiveness. As a result, businesses can respond swiftly to market changes and evolving customer demands. -
7
Amazon Athena
Amazon
"Effortless data analysis with instant insights using SQL."Amazon Athena is an interactive query service that makes it easy to analyze data stored in Amazon S3 by utilizing standard SQL. Being a serverless offering, it removes the burden of infrastructure management, enabling users to pay only for the queries they run. Its intuitive interface allows you to directly point to your data in Amazon S3, define the schema, and start querying using standard SQL commands, with most results generated in just a few seconds. Athena bypasses the need for complex ETL processes, empowering anyone with SQL knowledge to quickly explore extensive datasets. Furthermore, it provides seamless integration with AWS Glue Data Catalog, which helps in creating a unified metadata repository across various services. This integration not only allows users to crawl data sources for schema identification and update the Catalog with new or modified table definitions, but also aids in managing schema versioning. Consequently, this functionality not only simplifies data management but also significantly boosts the efficiency of data analysis within the AWS ecosystem. Overall, Athena's capabilities make it an invaluable tool for data analysts looking for rapid insights without the overhead of traditional data preparation methods. -
8
Elasticsearch
Elastic
Transform your data into actionable insights with ease.Elastic is a prominent search technology firm that has created a suite known as the Elastic Stack, which includes Elasticsearch, Kibana, Beats, and Logstash. These software-as-a-service solutions enable users to leverage data for real-time analytics, security measures, search functionalities, and logging at scale. With a community of over 100,000 members spread across 45 nations, Elastic's products have been downloaded more than 400 million times since their launch. Currently, numerous organizations, including notable names like Cisco, eBay, Dell, Goldman Sachs, Groupon, HP, Microsoft, Netflix, Uber, Verizon, and Yelp, rely on Elastic Stack and Elastic Cloud to enhance their critical systems, driving significant revenue growth and reducing costs. Headquartered in both Amsterdam, The Netherlands, and Mountain View, California, Elastic employs a workforce of more than 1,000 individuals across more than 35 countries, contributing to its global impact in the tech industry. This extensive reach and adoption highlight Elastic's vital role in transforming how enterprises manage and utilize their data. -
9
Apache Cassandra
Apache Software Foundation
Unmatched scalability and reliability for your data management needs.Apache Cassandra serves as an exemplary database solution for scenarios demanding exceptional scalability and availability, all while ensuring peak performance. Its capacity for linear scalability, combined with robust fault-tolerance features, makes it a prime candidate for effective data management, whether implemented on traditional hardware or in cloud settings. Furthermore, Cassandra stands out for its capability to replicate data across multiple datacenters, which minimizes latency for users and provides an added layer of security against regional outages. This distinctive blend of functionalities not only enhances operational resilience but also fosters efficiency, making Cassandra an attractive choice for enterprises aiming to optimize their data handling processes. Such attributes underscore its significance in an increasingly data-driven world. -
10
Apache Hive
Apache Software Foundation
Streamline your data processing with powerful SQL-like queries.Apache Hive serves as a data warehousing framework that empowers users to access, manipulate, and oversee large datasets spread across distributed systems using a SQL-like language. It facilitates the structuring of pre-existing data stored in various formats. Users have the option to interact with Hive through a command line interface or a JDBC driver. As a project under the auspices of the Apache Software Foundation, Apache Hive is continually supported by a group of dedicated volunteers. Originally integrated into the Apache® Hadoop® ecosystem, it has matured into a fully-fledged top-level project with its own identity. We encourage individuals to delve deeper into the project and contribute their expertise. To perform SQL operations on distributed datasets, conventional SQL queries must be run through the MapReduce Java API. However, Hive streamlines this task by providing a SQL abstraction, allowing users to execute queries in the form of HiveQL, thus eliminating the need for low-level Java API implementations. This results in a much more user-friendly and efficient experience for those accustomed to SQL, leading to greater productivity when dealing with vast amounts of data. Moreover, the adaptability of Hive makes it a valuable tool for a diverse range of data processing tasks. -
11
Amazon Redshift
Amazon
Unlock powerful analytics with scalable, serverless cloud solutions.Amazon Redshift is a high-performance cloud data warehouse platform from AWS designed to power modern analytics, business intelligence, and agentic AI workloads across enterprise environments. The platform enables organizations to unify and analyze structured and unstructured data from Amazon Redshift warehouses, Amazon S3 data lakes, and third-party or federated data sources through an integrated lakehouse architecture within Amazon SageMaker. Redshift delivers strong scalability and industry-leading price-performance, helping businesses process large-scale analytics workloads while optimizing infrastructure costs and operational efficiency. AWS Graviton-powered Redshift RG instances significantly improve throughput and query performance while reducing per-vCPU costs and supporting native processing of open data formats such as Apache Iceberg and Apache Parquet. The platform also offers Redshift Serverless, which allows organizations to quickly run and scale analytics without provisioning, configuring, or managing infrastructure resources manually. Zero-ETL integrations simplify data movement by connecting streaming services, operational databases, and enterprise applications directly into analytics workflows for near real-time insights without the need for complex pipelines. Amazon Redshift integrates with Amazon SageMaker to support SQL analytics, machine learning workflows, and unified access to enterprise data across hybrid analytics environments. The solution also integrates with Amazon Bedrock, enabling organizations to use Redshift as a structured knowledge base that enhances the accuracy and contextual relevance of generative AI applications. Businesses can use Amazon Redshift for a variety of use cases including financial forecasting, demand planning, business intelligence optimization, machine learning acceleration, and data monetization strategies. -
12
Vertica
Rocket Software
Unlock powerful analytics and AI across diverse environments.Vertica is an enterprise analytics database platform that delivers high-performance data warehousing, large-scale analytics, and AI-powered data processing for organizations operating across hybrid cloud and mission-critical environments. Following its acquisition by Rocket Software, Vertica became a core component of Rocket’s modernization strategy focused on helping enterprises combine trusted infrastructure with advanced analytics and artificial intelligence capabilities. The platform is designed to process massive volumes of enterprise data while supporting complex analytical workloads, real-time reporting, and AI-driven decision-making across cloud, on-premises, private cloud, and hybrid deployments. Vertica enables organizations to modernize legacy systems and unlock deeper business insights by running advanced analytics and generative AI directly on trusted enterprise data sources without disrupting operational stability or existing workflows. The platform supports scalable query processing, enterprise data warehousing, and integrated analytics that help businesses accelerate innovation, optimize operational efficiency, and improve strategic decision-making. Vertica also strengthens Rocket Software’s enterprise data portfolio alongside Rocket DataEdge and Rocket ContentEdge solutions, creating an integrated modernization ecosystem for enterprise data governance, analytics, connectivity, and intelligence. Businesses can use Vertica to consolidate large-scale analytics workloads, modernize core systems, support AI adoption initiatives, and deploy enterprise analytics infrastructure across flexible environments that meet evolving operational and regulatory requirements. The platform is designed to support organizations that require high-speed analytics, scalable AI-ready infrastructure, and modern data architectures capable of handling mission-critical workloads. -
13
IBM Db2
IBM
Unlock data potential with AI-driven management solutions today!IBM Db2 represents a comprehensive array of data management solutions, with a strong emphasis on the Db2 relational database. These solutions incorporate AI-driven features aimed at facilitating the management of both structured and unstructured data within a variety of on-premises and multicloud environments. By making data more accessible, the Db2 suite enables companies to fully harness the benefits of AI technology. Most of the Db2 components are seamlessly integrated into the IBM Cloud Pak® for Data platform, offered either as supplementary features or as inherent data source services, which guarantees that nearly all data is available across hybrid or multicloud infrastructures to support AI-centric applications. Users can easily consolidate their transactional data repositories and quickly gain insights through intelligent, universal querying across multiple data sources. The multimodel capabilities contribute to cost reduction by eliminating the need for data replication and migration. Furthermore, Db2 provides remarkable flexibility, allowing for deployment across any cloud service provider, thus enhancing operational agility and responsiveness. This range of deployment options ensures that organizations can modify their data management approaches to align with their evolving requirements, ultimately fostering innovation and adaptability in their operations. This adaptability is crucial for maintaining a competitive edge in today’s rapidly changing business landscape. -
14
AWS Glue
Amazon
Transform data integration effortlessly with serverless simplicity and speed.AWS Glue is a fully managed, serverless solution tailored for data integration, facilitating the easy discovery, preparation, and merging of data for a variety of applications, including analytics, machine learning, and software development. The service incorporates all essential functionalities for effective data integration, allowing users to conduct data analysis and utilize insights in a matter of minutes, significantly reducing the timeline from months to mere moments. The data integration workflow comprises several stages, such as identifying and extracting data from multiple sources, followed by the processes of enhancing, cleaning, normalizing, and merging the data before it is systematically organized in databases, data warehouses, and data lakes. Various users, each with their specific tools, typically oversee these distinct responsibilities, ensuring a comprehensive approach to data management. By operating within a serverless framework, AWS Glue removes the burden of infrastructure management from its users, as it automatically provisions, configures, and scales the necessary resources for executing data integration tasks. This feature allows organizations to concentrate on gleaning insights from their data instead of grappling with operational challenges. In addition to streamlining data workflows, AWS Glue also fosters collaboration and productivity among teams, enabling businesses to respond swiftly to changing data needs. The overall efficiency gained through this service positions companies to thrive in today’s data-driven environment. -
15
Apache Druid
Druid
Unlock real-time analytics with unparalleled performance and resilience.Apache Druid stands out as a robust open-source distributed data storage system that harmonizes elements from data warehousing, timeseries databases, and search technologies to facilitate superior performance in real-time analytics across diverse applications. The system's ingenious design incorporates critical attributes from these three domains, which is prominently reflected in its ingestion processes, storage methodologies, query execution, and overall architectural framework. By isolating and compressing individual columns, Druid adeptly retrieves only the data necessary for specific queries, which significantly enhances the speed of scanning, sorting, and grouping tasks. Moreover, the implementation of inverted indexes for string data considerably boosts the efficiency of search and filter operations. With readily available connectors for platforms such as Apache Kafka, HDFS, and AWS S3, Druid integrates effortlessly into existing data management workflows. Its intelligent partitioning approach markedly improves the speed of time-based queries when juxtaposed with traditional databases, yielding exceptional performance outcomes. Users benefit from the flexibility to easily scale their systems by adding or removing servers, as Druid autonomously manages the process of data rebalancing. In addition, its fault-tolerant architecture guarantees that the system can proficiently handle server failures, thus preserving operational stability. This resilience and adaptability make Druid a highly appealing option for organizations in search of dependable and efficient analytics solutions, ultimately driving better decision-making and insights. -
16
OpenMetadata
OpenMetadata
"Centralize, discover, and govern your metadata effortlessly."OpenMetadata functions as a holistic, open platform designed to integrate metadata, enhancing data discovery, observability, and governance through a unified interface. Leveraging a Unified Metadata Graph and over 80 pre-built connectors, it consolidates metadata from diverse sources including databases, pipelines, BI tools, and ML systems, which provides teams with a rich context to efficiently search, filter, and visualize their organizational assets. The platform is constructed on an API- and schema-first framework that enables flexible customization of metadata entities and relationships, allowing businesses to define their metadata architecture with accuracy. With just four core system components, OpenMetadata is designed for easy installation and operation, delivering scalable performance that allows both technical and non-technical users to collaborate effectively on tasks related to discovery, lineage tracking, quality assurance, observability, collaboration, and governance without requiring complex infrastructure. This adaptability positions it as an essential asset for organizations striving to maximize the potential of their data resources. Moreover, its user-friendly design fosters a collaborative environment where diverse teams can engage with data insights more intuitively. -
17
Oracle Cloud Infrastructure
Oracle
Empower your digital transformation with cutting-edge cloud solutions.Oracle Cloud Infrastructure is designed to support both traditional workloads and cutting-edge cloud development tools tailored for contemporary requirements. Its architecture is equipped to detect and address modern security threats, thereby accelerating innovation. By combining cost-effectiveness with outstanding performance, it significantly lowers the total cost of ownership for users. As a Generation 2 enterprise cloud, Oracle Cloud showcases remarkable compute and networking features while providing a broad spectrum of infrastructure and platform cloud services. Specifically tailored to meet the needs of mission-critical applications, it allows businesses to maintain legacy workloads while advancing toward future goals. Importantly, the Generation 2 Cloud can run the Oracle Autonomous Database, which is celebrated as the first and only self-driving database in the industry. In addition, Oracle Cloud offers an extensive array of cloud computing solutions, including application development, business analytics, data management, integration, security, artificial intelligence, and blockchain technology, ensuring organizations are well-equipped to succeed in an increasingly digital environment. This all-encompassing strategy firmly establishes Oracle Cloud as a frontrunner in the rapidly changing cloud landscape. Consequently, organizations leveraging Oracle Cloud can confidently embrace transformation and drive their digital initiatives forward. -
18
PostgreSQL
PostgreSQL Global Development Group
Dependable, feature-rich database system for performance and security.PostgreSQL is a robust and well-established open-source object-relational database system that has been under continuous development for over thirty years, earning a strong reputation for its dependability, rich features, and exceptional performance. The official documentation provides thorough resources for both installation and usage, making it an essential reference for newcomers and seasoned users alike. Moreover, the vibrant open-source community supports numerous forums and platforms where enthusiasts can deepen their understanding of PostgreSQL, explore its capabilities, and discover job openings in the field. Participating in this community can greatly enrich your knowledge while strengthening your ties to the PostgreSQL network. Recently, the PostgreSQL Global Development Group revealed updates for all currently supported versions, including 15.1, 14.6, 13.9, 12.13, 11.18, and 10.23, which fix 25 bugs reported in recent months. It is important to note that this update represents the final release for PostgreSQL 10, which will no longer receive any security patches or bug fixes moving forward. Therefore, if you are still using PostgreSQL 10 in a production environment, it is strongly advised to organize an upgrade to a newer version to maintain support and security. Transitioning to a more recent version will not only help safeguard your data but also enable you to benefit from the latest features and enhancements introduced in newer updates. Furthermore, keeping your database system up-to-date can significantly improve overall performance and provide better compatibility with modern applications. -
19
Apache Spark
Apache Software Foundation
Transform your data processing with powerful, versatile analytics.Apache Spark™ is a powerful analytics platform crafted for large-scale data processing endeavors. It excels in both batch and streaming tasks by employing an advanced Directed Acyclic Graph (DAG) scheduler, a highly effective query optimizer, and a streamlined physical execution engine. With more than 80 high-level operators at its disposal, Spark greatly facilitates the creation of parallel applications. Users can engage with the framework through a variety of shells, including Scala, Python, R, and SQL. Spark also boasts a rich ecosystem of libraries—such as SQL and DataFrames, MLlib for machine learning, GraphX for graph analysis, and Spark Streaming for processing real-time data—which can be effortlessly woven together in a single application. This platform's versatility allows it to operate across different environments, including Hadoop, Apache Mesos, Kubernetes, standalone systems, or cloud platforms. Additionally, it can interface with numerous data sources, granting access to information stored in HDFS, Alluxio, Apache Cassandra, Apache HBase, Apache Hive, and many other systems, thereby offering the flexibility to accommodate a wide range of data processing requirements. Such a comprehensive array of functionalities makes Spark a vital resource for both data engineers and analysts, who rely on it for efficient data management and analysis. The combination of its capabilities ensures that users can tackle complex data challenges with greater ease and speed. -
20
Delta Lake
Delta Lake
Transform big data management with reliable ACID transactions today!Delta Lake acts as an open-source storage solution that integrates ACID transactions within Apache Spark™ and enhances operations in big data environments. In conventional data lakes, various pipelines function concurrently to read and write data, often requiring data engineers to invest considerable time and effort into preserving data integrity due to the lack of transactional support. With the implementation of ACID transactions, Delta Lake significantly improves data lakes, providing a high level of consistency thanks to its serializability feature, which represents the highest standard of isolation. For more detailed exploration, you can refer to Diving into Delta Lake: Unpacking the Transaction Log. In the big data landscape, even metadata can become quite large, and Delta Lake treats metadata with the same importance as the data itself, leveraging Spark's distributed processing capabilities for effective management. As a result, Delta Lake can handle enormous tables that scale to petabytes, containing billions of partitions and files with ease. Moreover, Delta Lake's provision for data snapshots empowers developers to access and restore previous versions of data, making audits, rollbacks, or experimental replication straightforward, while simultaneously ensuring data reliability and consistency throughout the system. This comprehensive approach not only streamlines data management but also enhances operational efficiency in data-intensive applications. -
21
Datafold
Datafold
Revolutionize data management for peak performance and efficiency.Prevent data outages by taking a proactive approach to identify and address data quality issues before they make it to production. You can achieve comprehensive test coverage of your data pipelines in just a single day, elevating your performance from zero to a hundred percent. With automated regression testing spanning billions of rows, you will gain insights into the effects of each code change. Simplify your change management processes, boost data literacy, ensure compliance, and reduce response times for incidents. By implementing automated anomaly detection, you can stay one step ahead of potential data challenges, ensuring you remain well-informed. Datafold’s adaptable machine learning model accommodates seasonal fluctuations and trends in your data, allowing for the establishment of dynamic thresholds tailored to your needs. Streamline your data analysis efforts significantly with the Data Catalog, designed to facilitate the easy discovery of relevant datasets and fields while offering straightforward exploration of distributions through a user-friendly interface. Take advantage of features such as interactive full-text search, comprehensive data profiling, and a centralized metadata repository, all crafted to optimize your data management experience. By utilizing these innovative tools, you can revolutionize your data processes, resulting in enhanced efficiency and improved business outcomes. Ultimately, embracing these advancements will position your organization to harness the full potential of your data assets.
- Previous
- You're on page 1
- Next