List of OpenMetadata Integrations
This is a list of platforms and tools that integrate with OpenMetadata. This list is updated as of June 2026.
-
1
Oracle Cloud Infrastructure
Oracle
Empower your digital transformation with cutting-edge cloud solutions.Oracle Cloud Infrastructure is designed to support both traditional workloads and cutting-edge cloud development tools tailored for contemporary requirements. Its architecture is equipped to detect and address modern security threats, thereby accelerating innovation. By combining cost-effectiveness with outstanding performance, it significantly lowers the total cost of ownership for users. As a Generation 2 enterprise cloud, Oracle Cloud showcases remarkable compute and networking features while providing a broad spectrum of infrastructure and platform cloud services. Specifically tailored to meet the needs of mission-critical applications, it allows businesses to maintain legacy workloads while advancing toward future goals. Importantly, the Generation 2 Cloud can run the Oracle Autonomous Database, which is celebrated as the first and only self-driving database in the industry. In addition, Oracle Cloud offers an extensive array of cloud computing solutions, including application development, business analytics, data management, integration, security, artificial intelligence, and blockchain technology, ensuring organizations are well-equipped to succeed in an increasingly digital environment. This all-encompassing strategy firmly establishes Oracle Cloud as a frontrunner in the rapidly changing cloud landscape. Consequently, organizations leveraging Oracle Cloud can confidently embrace transformation and drive their digital initiatives forward. -
2
PostgreSQL
PostgreSQL Global Development Group
Dependable, feature-rich database system for performance and security.PostgreSQL is a robust and well-established open-source object-relational database system that has been under continuous development for over thirty years, earning a strong reputation for its dependability, rich features, and exceptional performance. The official documentation provides thorough resources for both installation and usage, making it an essential reference for newcomers and seasoned users alike. Moreover, the vibrant open-source community supports numerous forums and platforms where enthusiasts can deepen their understanding of PostgreSQL, explore its capabilities, and discover job openings in the field. Participating in this community can greatly enrich your knowledge while strengthening your ties to the PostgreSQL network. Recently, the PostgreSQL Global Development Group revealed updates for all currently supported versions, including 15.1, 14.6, 13.9, 12.13, 11.18, and 10.23, which fix 25 bugs reported in recent months. It is important to note that this update represents the final release for PostgreSQL 10, which will no longer receive any security patches or bug fixes moving forward. Therefore, if you are still using PostgreSQL 10 in a production environment, it is strongly advised to organize an upgrade to a newer version to maintain support and security. Transitioning to a more recent version will not only help safeguard your data but also enable you to benefit from the latest features and enhancements introduced in newer updates. Furthermore, keeping your database system up-to-date can significantly improve overall performance and provide better compatibility with modern applications. -
3
Amazon Kinesis
Amazon
Capture, analyze, and react to streaming data instantly.Seamlessly collect, manage, and analyze video and data streams in real time with ease. Amazon Kinesis streamlines the process of gathering, processing, and evaluating streaming data, empowering users to swiftly derive meaningful insights and react to new information without hesitation. Featuring essential capabilities, Amazon Kinesis offers a budget-friendly solution for managing streaming data at any scale, while allowing for the flexibility to choose the best tools suited to your application's specific requirements. You can leverage Amazon Kinesis to capture a variety of real-time data formats, such as video, audio, application logs, website clickstreams, and IoT telemetry data, for purposes ranging from machine learning to comprehensive analytics. This platform facilitates immediate processing and analysis of incoming data, removing the necessity to wait for full data acquisition before initiating the analysis phase. Additionally, Amazon Kinesis enables rapid ingestion, buffering, and processing of streaming data, allowing you to reveal insights in a matter of seconds or minutes, rather than enduring long waits of hours or days. The capacity to quickly respond to live data significantly improves decision-making and boosts operational efficiency across a multitude of sectors. Moreover, the integration of real-time data processing fosters innovation and adaptability, positioning organizations to thrive in an increasingly data-driven environment. -
4
AWS Storage Gateway
Amazon
Transform your storage strategy with seamless hybrid cloud integration.AWS Storage Gateway serves as a hybrid cloud storage solution that enables on-premises users to access an almost infinite range of cloud storage options. Many clients leverage this tool to enhance storage management efficiency and reduce costs across multiple hybrid cloud environments. These environments include scenarios such as migrating tape backups to the cloud, reducing local storage needs by utilizing cloud file shares, and providing rapid access to AWS data for local applications, in addition to catering to various needs such as migration, archiving, processing, and disaster recovery. To support these capabilities, the service features three different types of gateways: Tape Gateway, File Gateway, and Volume Gateway, which ensure seamless integration between local applications and cloud storage while maintaining local data caching for quick access. Users can interact with the service through either a virtual machine or a dedicated hardware gateway appliance, employing standard storage protocols like NFS, SMB, and iSCSI. This adaptability allows organizations to fine-tune their storage solutions according to diverse requirements and enhance overall performance. Furthermore, the flexibility provided by AWS Storage Gateway empowers businesses to scale their storage solutions effectively as their needs evolve over time. -
5
Presto
Presto
Revolutionize dining with seamless, safe, contactless solutions today!We are excited to unveil our groundbreaking Contactless Dining Solution, which requires no monthly fee. As the foremost provider of contactless dining technology on a global scale, we support over 100 million active users each month and have successfully distributed more than 300,000 systems. This innovative solution enables restaurants to offer a comprehensive and smooth contactless dining experience, allowing guests to peruse the entire menu, place their orders, and settle their bills directly at the table, all without any physical interaction. By signing up today, you can switch to a fully contactless service within just three days, while enjoying the advantage of no ongoing fees (although standard payment processing charges will apply), and there's no need to alter your existing POS system. While our solution is accessible worldwide, due to overwhelming demand, supplies are limited, making it crucial to secure your reservation quickly. Join the ever-growing community of over 100 million monthly users who are already taking advantage of Presto, as we maintain our leadership in the contactless dining sector across both the U.S. and Europe. Don't miss out on the opportunity to revolutionize your restaurant's service and elevate the dining experience for your guests by adopting this cutting-edge technology today! Additionally, this transition not only enhances efficiency but also prioritizes safety, which is more important now than ever. -
6
Delta Lake
Delta Lake
Transform big data management with reliable ACID transactions today!Delta Lake acts as an open-source storage solution that integrates ACID transactions within Apache Spark™ and enhances operations in big data environments. In conventional data lakes, various pipelines function concurrently to read and write data, often requiring data engineers to invest considerable time and effort into preserving data integrity due to the lack of transactional support. With the implementation of ACID transactions, Delta Lake significantly improves data lakes, providing a high level of consistency thanks to its serializability feature, which represents the highest standard of isolation. For more detailed exploration, you can refer to Diving into Delta Lake: Unpacking the Transaction Log. In the big data landscape, even metadata can become quite large, and Delta Lake treats metadata with the same importance as the data itself, leveraging Spark's distributed processing capabilities for effective management. As a result, Delta Lake can handle enormous tables that scale to petabytes, containing billions of partitions and files with ease. Moreover, Delta Lake's provision for data snapshots empowers developers to access and restore previous versions of data, making audits, rollbacks, or experimental replication straightforward, while simultaneously ensuring data reliability and consistency throughout the system. This comprehensive approach not only streamlines data management but also enhances operational efficiency in data-intensive applications. -
7
MLflow
MLflow
Streamline your machine learning journey with effortless collaboration.MLflow is a comprehensive open-source platform aimed at managing the entire machine learning lifecycle, which includes experimentation, reproducibility, deployment, and a centralized model registry. This suite consists of four core components that streamline various functions: tracking and analyzing experiments related to code, data, configurations, and results; packaging data science code to maintain consistency across different environments; deploying machine learning models in diverse serving scenarios; and maintaining a centralized repository for storing, annotating, discovering, and managing models. Notably, the MLflow Tracking component offers both an API and a user interface for recording critical elements such as parameters, code versions, metrics, and output files generated during machine learning execution, which facilitates subsequent result visualization. It supports logging and querying experiments through multiple interfaces, including Python, REST, R API, and Java API. In addition, an MLflow Project provides a systematic approach to organizing data science code, ensuring it can be effortlessly reused and reproduced while adhering to established conventions. The Projects component is further enhanced with an API and command-line tools tailored for the efficient execution of these projects. As a whole, MLflow significantly simplifies the management of machine learning workflows, fostering enhanced collaboration and iteration among teams working on their models. This streamlined approach not only boosts productivity but also encourages innovation in machine learning practices. -
8
Glue
Glue
Experience immersive collaboration that feels like being together.Regardless of how far apart your team members may be, Glue's captivating virtual spaces enable interactions that mimic the experience of being together in person. Each participant is represented by a 3D avatar that mirrors their movements and gestures, which supports non-verbal cues in addition to verbal communication. The use of spatial audio allows users to sense the distance and direction of others, enhancing the overall experience by making it clear whether someone is nearby or further away. In this collective virtual setting, avatars accurately track users' head and hand motions, significantly improving the non-verbal dialogue that typical video conferencing lacks. This immersive platform revolutionizes collaboration, making discussions in Glue feel genuinely as natural as they would in a physical environment. Additionally, the innovative technology employs 3D directional sound, ensuring that audio volume diminishes with distance, which further amplifies the feeling of connection among participants. Consequently, Glue transforms the dynamics of remote teamwork by fostering a collaborative space that authentically mirrors face-to-face engagements, ultimately enhancing productivity and camaraderie. Such advancements in virtual interaction not only make work more enjoyable but also increase the effectiveness of team communication across distances. -
9
Amundsen
Amundsen
Transform data chaos into clarity for impactful insights.Unlock the potential of your data by fostering confidence for more impactful analysis and modeling. By breaking down barriers between information silos, you can significantly boost productivity. Instantly access insights into your data while also observing how your colleagues are utilizing it. Enjoy a seamless search experience for data within your organization using an intuitive text-based interface. The search functionality leverages an algorithm similar to PageRank, allowing for personalized recommendations based on various factors such as names, descriptions, tags, and user interactions with tables and dashboards. Build trust in your data by depending on automated, curated metadata, which offers comprehensive details about tables and columns, insights on frequent users, timestamps of the latest updates, relevant statistics, and, when allowed, previews of the data. Improve data management efficiency by establishing connections to the ETL jobs and code that create the datasets. Provide clear definitions for table and column descriptions to reduce unnecessary debates about which data to use and the meanings of individual columns. Identify which datasets are most frequently accessed, owned, or bookmarked by your peers, thereby enhancing collaboration. Furthermore, gain insights into popular queries linked to a specific table by examining dashboards created from that dataset, which enhances your analytical capabilities. Ultimately, this holistic strategy ensures that your data-driven choices are informed and anchored in trustworthy information, leading to more effective outcomes. -
10
Apache Superset
Apache
Unlock powerful data insights with seamless exploration and visualization.Superset is a powerful and intuitive platform filled with functionalities that enhance data exploration and visualization for users of all expertise levels, supporting a wide range of representations from simple line charts to complex geospatial visualizations. It can connect to any SQL-compatible data source through SQLAlchemy, making it adaptable to modern cloud-native databases and systems that manage massive volumes of petabyte-scale data. Furthermore, Superset's lightweight design and scalability enable it to leverage existing data infrastructure efficiently, negating the necessity for an additional layer of data ingestion. This characteristic positions it as an exceptional option for organizations aiming to optimize their data analytics capabilities effortlessly. Ultimately, Superset’s versatility and user-friendliness make it a valuable asset for businesses striving to enhance their insights through data. -
11
Apache NiFi
Apache Software Foundation
Effortlessly streamline data workflows with unparalleled flexibility and control.Apache NiFi offers a user-friendly, robust, and reliable framework for processing and distributing data. This platform is tailored to facilitate complex and scalable directed graphs, enabling efficient data routing, transformation, and mediation tasks within systems. One of its standout features is a web-based interface that allows for seamless integration of design, control, feedback, and monitoring processes. Highly configurable, Apache NiFi is built to withstand data loss while ensuring low latency and high throughput, complemented by dynamic prioritization capabilities. Users can adapt data flows in real-time and benefit from functionalities such as back pressure and data provenance, which provide visibility into the data's lifecycle from inception to completion. Additionally, the system is designed for extensibility, enabling users to develop their own processors and accelerating the development and testing phases. Security is a significant priority, with features like SSL, SSH, HTTPS, and encrypted content being standard offerings. Moreover, it supports multi-tenant authorization and has an extensive internal policy management system. NiFi encompasses various web applications, such as a web UI, an API, and customizable UIs that necessitate user configuration of mappings to the root path. This accessibility and flexibility make it an excellent option for organizations aiming to optimize their data workflows efficiently, ensuring that they can adapt to evolving data needs. -
12
OpenSearch
OpenSearch
Empower your data journey with secure, customizable analytics.OpenSearch is a community-driven suite for search and analytics that is open-source and built on the Apache 2.0 licensed versions of Elasticsearch 7.10.2 and Kibana 7.10.2. It features the OpenSearch search engine daemon alongside OpenSearch Dashboards, which facilitate visualization and user interaction. This platform enables users to effortlessly ingest, secure, search, aggregate, visualize, and analyze their data, making it particularly advantageous for a range of applications, such as application search and log analytics. Users benefit from an adaptable open-source solution that they can tailor, enhance, monetize, and resell to fit their specific requirements. Additionally, OpenSearch is dedicated to providing a secure and high-quality environment for search and analytics, continually evolving with a promising roadmap that includes innovative features and enhancements designed to effectively meet the diverse needs of its users. As a result, it fosters a robust community that contributes to its ongoing development and improvement. -
13
Apache Pinot
Apache Corporation
Optimize OLAP queries effortlessly with low-latency performance.Pinot is designed to optimize the handling of OLAP queries with low latency when working with static data. It supports a variety of pluggable indexing techniques, such as Sorted Index, Bitmap Index, and Inverted Index. Although it does not currently facilitate joins, this can be circumvented by employing Trino or PrestoDB for executing queries. The platform offers an SQL-like syntax that enables users to perform selection, aggregation, filtering, grouping, ordering, and distinct queries on the data. It comprises both offline and real-time tables, where real-time tables are specifically implemented to fill gaps in offline data availability. Furthermore, users have the capability to customize the anomaly detection and notification processes, allowing for precise identification of significant anomalies. This adaptability ensures users can uphold robust data integrity while effectively addressing their analytical requirements, ultimately enhancing their overall data management strategy. -
14
SQLAlchemy
SQLAlchemy
"Empower your database interactions with unmatched adaptability and efficiency."SQLAlchemy is a powerful Python library that functions as both a toolkit for SQL and an object-relational mapper, giving developers the ability to leverage SQL's full potential with remarkable adaptability. As SQL databases grow in size and performance demands, they often shift away from being mere collections of objects; similarly, emphasizing abstraction can cause these object collections to lose their traditional structure of tables and rows. SQLAlchemy aims to reconcile these contrasting ideas effectively. It perceives the database not just as a compilation of tables, but as a relational algebra engine, allowing for the selection of rows from tables, joins, and a variety of select statements that can be combined into more sophisticated queries. The expression language of SQLAlchemy is founded on this principle, significantly enhancing its capabilities. Furthermore, SQLAlchemy is well-known for its optional object-relational mapper (ORM) feature, which applies the data mapper pattern and offers a solid framework for seamless database interactions. This combination of functionalities positions SQLAlchemy as an adaptable tool suited for both straightforward and complex database operations, ensuring that developers can efficiently manage their data needs. Ultimately, SQLAlchemy empowers users to interact with databases in a way that is both intuitive and effective. -
15
LDAP
LDAP
Unlock LDAP's potential with essential resources and insights.LDAP functions as an essential resource for details related to the Lightweight Directory Access Protocol, which is a versatile and compliant framework that enables seamless communication with directory servers. Commonly utilized for user authentication and the administration of user, group, and application information, an LDAP directory server serves as an adaptable data repository that can accommodate numerous applications. It offers crucial information about directory services and the complexities involved with the LDAP protocol. Furthermore, it assists users in locating the appropriate directory server, client API, or LDAP-focused tools suited for different environments. The platform also features links to various standards documents and reference materials for individuals seeking a more profound comprehension of the protocol. In addition, it presents articles concerning directory services, providing updates on software releases, new standards documents, specifications, and practical guides aimed at enriching user knowledge and experience with LDAP. This comprehensive collection of resources ensures that users remain informed and adept in utilizing LDAP effectively. -
16
MariaDB
MariaDB
Empowering enterprise data management with versatility and scalability.The MariaDB Platform stands out as a robust open-source database solution tailored for enterprise use. It is versatile enough to handle transactional, analytical, and hybrid workloads while accommodating both relational and JSON data formats. Its scalability ranges from single databases to extensive data warehouses and fully distributed SQL systems capable of processing millions of transactions every second, enabling interactive analytics on vast datasets. Additionally, MariaDB offers deployment options on standard hardware as well as across major public cloud services, including its own fully managed cloud database, MariaDB SkySQL. For further details, you can explore MariaDB.com, which offers comprehensive insights into its features and capabilities. Overall, MariaDB is designed to meet the diverse needs of modern data management. -
17
Flink
Flink
Fresh groceries delivered in 10 minutes, sustainably and affordably!No matter what your grocery requirements are, we deliver fresh, organic produce directly to your home. Experience delivery in just 10 minutes at prices that rival those of traditional supermarkets. Flink acts as your on-the-go grocery store, providing daily fresh selections at supermarket prices. Currently, we serve all major cities across Germany and have begun to branch out into select regions of the Netherlands and France! You can easily verify our delivery zones through the app, and additional cities will be added soon. There may be instances where our quick packaging results in an incorrect item being delivered with your order; if this occurs, just contact us via the Support feature in the app, and we will address it swiftly. Our delivery service is available Monday through Saturday from 8 am to 11 pm, offering everything from fresh fruits and vegetables to your favorite snacks. You can also pay for your groceries seamlessly within the app, as we offer a range of online payment methods. Our delivery hubs are strategically placed in high-density urban areas, and we use electric bicycles for an environmentally friendly delivery method, ensuring that your groceries arrive rapidly and sustainably. Additionally, as we continue to expand our service areas, you can look forward to even more convenient grocery delivery options becoming available in the near future, enhancing your shopping experience. We remain committed to providing you with the best service possible and are excited about our growth journey ahead. -
18
Apache Airflow
The Apache Software Foundation
Effortlessly create, manage, and scale your workflows!Airflow is an open-source platform that facilitates the programmatic design, scheduling, and oversight of workflows, driven by community contributions. Its architecture is designed for flexibility and utilizes a message queue system, allowing for an expandable number of workers to be managed efficiently. Capable of infinite scalability, Airflow enables the creation of pipelines using Python, making it possible to generate workflows dynamically. This dynamic generation empowers developers to produce workflows on demand through their code. Users can easily define custom operators and enhance libraries to fit the specific abstraction levels they require, ensuring a tailored experience. The straightforward design of Airflow pipelines incorporates essential parametrization features through the advanced Jinja templating engine. The era of complex command-line instructions and intricate XML configurations is behind us! Instead, Airflow leverages standard Python functionalities for workflow construction, including date and time formatting for scheduling and loops that facilitate dynamic task generation. This approach guarantees maximum flexibility in workflow design. Additionally, Airflow’s adaptability makes it a prime candidate for a wide range of applications across different sectors, underscoring its versatility in meeting diverse business needs. Furthermore, the supportive community surrounding Airflow continually contributes to its evolution and improvement, making it an ever-evolving tool for modern workflow management.