-
1
DataHub
DataHub
Revolutionize data management with real-time visibility and flexibility.
Metadata serves as the essential framework for contemporary data systems, and how well it is managed can significantly impact the clarity or confusion of your operations. DataHub delivers robust, enterprise-level metadata management that can efficiently scale from thousands to millions of entities while ensuring speed and ease of use. You can import metadata from over 100 different sources using adaptable push and pull methods, standardize it into a cohesive graph model, and access it through high-performance APIs. DataHub's metadata structure is designed for expansion—allowing you to incorporate custom attributes, entity types, and relationships without needing to modify the underlying code. Monitor the evolution of metadata with comprehensive versioning and audit trails, gaining insights into changes in schemas, ownership, and policies over time. Furthermore, automatically propagate metadata across interconnected entities; for instance, when you tag a dataset, those tags will seamlessly transfer to associated dashboards.
-
2
AnalyticsCreator
AnalyticsCreator
Deliver trusted, production-ready data products faster on Microsoft SQL Server, Synapse, and Fabric
Accelerate your data initiatives with AnalyticsCreator—a metadata-driven data warehouse automation solution purpose-built for the Microsoft data ecosystem. AnalyticsCreator simplifies the design, development, and deployment of modern data architectures, including dimensional models, data marts, data vaults, and blended modeling strategies that combine best practices from across methodologies.
Seamlessly integrate with key Microsoft technologies such as SQL Server, Azure Synapse Analytics, Microsoft Fabric (including OneLake and SQL Endpoint Lakehouse environments), and Power BI. AnalyticsCreator automates ELT pipeline generation, data modeling, historization, and semantic model creation—reducing tool sprawl and minimizing the need for manual SQL coding across your data engineering lifecycle.
Designed for CI/CD-driven data engineering workflows, AnalyticsCreator connects easily with Azure DevOps and GitHub for version control, automated builds, and environment-specific deployments. Whether working across development, test, and production environments, teams can ensure faster, error-free releases while maintaining full governance and audit trails.
Additional productivity features include automated documentation generation, end-to-end data lineage tracking, and adaptive schema evolution to handle change management with ease. AnalyticsCreator also offers integrated deployment governance, allowing teams to streamline promotion processes while reducing deployment risks.
By eliminating repetitive tasks and enabling agile delivery, AnalyticsCreator helps data engineers, architects, and BI teams focus on delivering business-ready insights faster. Empower your organization to accelerate time-to-value for data products and analytical models—while ensuring governance, scalability, and Microsoft platform alignment every step of the way.
-
3
Gearset
Gearset
Defining what great Salesforce DevOps looks like
Gearset offers an all-encompassing solution for overseeing and managing your Salesforce metadata, enabling you to handle everything from comparisons and version control to deployments and reversions. With its robust difference detection tools, change monitoring, and seamless CI/CD integration, Gearset simplifies the management of intricate organizational configurations while ensuring reliability.
Easily compare different organizations or branches to pinpoint specific changes, verify deployments prior to activation, and coordinate release schedules among teams and various environments.
Whether you're managing regular minor updates or orchestrating extensive releases, Gearset's suite of metadata management capabilities—including branching techniques, bundles, and options for rolling back—guarantees precision, consistency, and traceability across all environments.
Establish order and confidence in your Salesforce change management processes, making metadata governance an effortless component of your DevOps practices.
-
4
DvSum
DvSum
Transform data chaos into clarity with advanced AI solutions.
DvSum is an innovative platform driven by AI that simplifies the process for data and analytics teams to uncover, track, and manage their data effectively. By leveraging advanced AI algorithms, DvSum automatically organizes, identifies, and refines your data, presenting it as a comprehensive Data Catalog. With DvSum Data Intelligence at your disposal, your organization can accelerate its journey toward achieving digital transformation and enhanced analytics capabilities. This tool not only streamlines data governance but also empowers teams to make more informed decisions based on accurate insights.
-
5
K2View
K2View
Empower your enterprise with agile, innovative data solutions.
K2View is committed to empowering enterprises to fully utilize their data for enhanced agility and innovation.
Our Data Product Platform facilitates this by generating and overseeing a reliable dataset for each business entity as needed and in real-time. This dataset remains continuously aligned with its original sources, adjusts seamlessly to changes, and is readily available to all authorized users.
We support a variety of operational applications, such as customer 360, data masking, test data management, data migration, and the modernization of legacy applications, enabling businesses to achieve their goals in half the time and at a fraction of the cost compared to other solutions. Additionally, our approach ensures that organizations can swiftly adapt to evolving market demands while maintaining data integrity and security.
-
6
Alation
Alation
Empower decision-making with intelligent, intuitive data recommendations.
The Alation Agentic Data Intelligence Platform brings intelligence, automation, and trust to enterprise data and AI initiatives. Built to unify every aspect of data management, it combines cataloging, governance, search, discovery, lineage, and analytics within a single platform. Its AI-driven agents, including the Documentation Agent, Data Quality Agent, and Data Products Builder, act as intelligent assistants that automate repetitive tasks and scale best practices across organizations. Powered by the Active Metadata Graph and workflow automation, Alation ensures that data is continuously enriched, accurate, and ready for analytics and AI. It creates a marketplace of trusted data products, enabling teams to quickly access, share, and reuse reliable assets. With deep integration capabilities and 120+ pre-built connectors across leading cloud, analytics, and BI platforms, Alation fits seamlessly into modern data ecosystems. Its governance framework helps organizations build trusted AI by ensuring transparency, compliance, and ethical use of data. Businesses benefit from improved efficiency, reduced risk, and the ability to make strategic decisions with confidence. Used by 40% of the Fortune 100, Alation has become a critical enabler of strong data cultures and scalable AI adoption. By combining human expertise with AI-powered automation, it transforms data into a foundation for innovation and growth.
-
7
Clarifai
Clarifai
Empowering industries with advanced AI for transformative insights.
Clarifai stands out as a prominent AI platform adept at processing image, video, text, and audio data on a large scale. By integrating computer vision, natural language processing, and audio recognition, our platform serves as a robust foundation for developing superior, quicker, and more powerful AI applications. We empower both enterprises and public sector entities to convert their data into meaningful insights.
Our innovative technology spans various sectors, including Defense, Retail, Manufacturing, and Media and Entertainment, among others. We assist our clients in crafting cutting-edge AI solutions tailored for applications such as visual search, content moderation, aerial surveillance, visual inspection, and intelligent document analysis. Established in 2013 by Matt Zeiler, Ph.D., Clarifai has consistently been a frontrunner in the realm of computer vision AI, earning recognition by clinching the top five positions in image classification at the prestigious 2013 ImageNet Challenge. With its headquarters located in Delaware, Clarifai continues to drive advancements in AI, supporting a wide array of industries in their digital transformation journeys.
-
8
ER/Studio is an enterprise data modeling and architecture platform that helps organizations design, align, and govern data across complex, distributed environments. It translates business requirements into technical implementation through integrated conceptual, logical, and physical models, creating a consistent foundation for analytics, AI initiatives, modernization, compliance, and operational systems. ER/Studio supports modern data architectures, including data warehouses, lakehouses, data mesh frameworks, and data vault methodologies, ensuring models reflect how platforms are built today. By maintaining clear relationships between definitions and database structures, it establishes a trusted, enterprise-wide view of data.
Collaboration is enabled through a centralized, multi-user repository with version control, role-based access, and parallel development. Teams can work simultaneously while preserving model integrity and full change history. The web-based portal, Team Server, extends visibility beyond architects, allowing business and technical stakeholders to explore models, review metadata, and provide feedback through a browser interface. This shared environment improves transparency and alignment between design and execution.
Governance and standardization are embedded within the modeling process. Business glossaries and data dictionaries link directly to technical objects so approved definitions remain synchronized with implementations. Built-in impact analysis provides visibility into downstream dependencies before changes are deployed, reducing risk and strengthening coordination. Metadata can be synchronized with platforms such as Microsoft Purview and Collibra to enhance lineage visibility, documentation accuracy, and compliance oversight.
Available in Standard, Professional, and Enterprise editions, ER/Studio scales from individual practitioners to enterprise-wide architecture programs with advanced collaboration and governance needs.
-
9
Hackolade
Hackolade
Design, Govern, and Evolve Schemas Across Databases, APIs, and Pipelines
Hackolade Studio is a next-generation data modeling solution designed for today’s diverse and hybrid data environments. Initially created to fill the gap in visual modeling tools for NoSQL, Hackolade has expanded into a multi-model platform supporting a wide range of modern technologies.
It enables agile schema design and governance for both structured and semi-structured data, making it well-suited for teams working across relational databases, NoSQL stores, data warehouses, and streaming systems. Supported technologies include Azure SQL, Oracle, PostgreSQL, SQL Server, MongoDB, Cassandra, DynamoDB, Neo4j, BigQuery, Databricks, Redshift, Snowflake, and Kafka with Confluent Schema Registry, as well as OpenAPI and GraphQL for API modeling.
Hackolade also offers support for data exchanges stored on AWS S3, Azure Blob Storage and ADLS Gen1 and Gen 2, for formats such as JSON Schema, Avro, Parquet, Protobuf, and YAML. It also integrates with metadata governance tools like Unity Catalog and Collibra. These integrations help organizations maintain compliance, manage lineage, and ensure high data quality across systems.
Key features include forward and reverse engineering, schema versioning, type mapping, and collaborative model design. Whether modeling new systems, documenting legacy databases, or managing API data contracts, Hackolade provides a centralized, visual interface that helps teams design and evolve schemas efficiently.
Enterprises in finance, healthcare, telecom, and retail use Hackolade to support initiatives in data governance, data mesh, API-first development, and cloud migration, making it a key tool in the modern data stack.
-
10
MANTA
Manta
Unlock clarity in data flow for better decision-making.
Manta functions as a comprehensive data lineage platform, acting as the central repository for all data movements within an organization. It is capable of generating lineage from various sources including report definitions, bespoke SQL scripts, and ETL processes. The analysis of lineage is based on real code, allowing for the visualization of both direct and indirect data flows on a graphical interface. Users can easily see the connections between files, report fields, database tables, and specific columns, which helps teams grasp data flows in a meaningful context. This clarity promotes better decision-making and enhances overall data governance within the enterprise.
-
11
Datameer
Datameer
Unlock powerful insights and streamline your data analysis.
Datameer serves as the essential data solution for examining, preparing, visualizing, and organizing insights from Snowflake. It facilitates everything from analyzing unprocessed datasets to influencing strategic business choices, making it a comprehensive tool for all data-related needs.
-
12
data.world
data.world
Empowering teams to simplify data management for innovation.
data.world is a cloud-based platform meticulously crafted for modern data ecosystems, facilitating effortless management of updates, migrations, and ongoing maintenance. The straightforward setup process is enhanced by a growing array of pre-built integrations compatible with all leading cloud data warehouses. When quick results are paramount, teams should focus on tackling real business issues instead of wrestling with complicated data management tools. data.world streamlines the experience for all users, not just data specialists, equipping them to obtain clear, accurate, and timely responses to a wide range of business questions. Our platform boasts a cloud-native data catalog that links disparate and distributed data to familiar business concepts, creating an accessible, cohesive knowledge base for everyone. Additionally, in addition to our enterprise offerings, data.world nurtures the largest collaborative open data community worldwide, where participants work together on various projects, including social bot detection and prestigious data journalism endeavors, fostering innovation and collective learning. This vibrant environment not only promotes knowledge sharing but also empowers users to harness data in inventive and meaningful ways, ultimately driving impactful solutions across different sectors.
-
13
Azure Data Catalog
Microsoft
Streamline data discovery, collaboration, and innovation effortlessly today!
In the current environment driven by data, the search for relevant information often takes more time than the analysis itself. Azure Data Catalog acts as a versatile metadata repository tailored for organizations, streamlining the identification of data resources. This fully-managed platform accommodates a diverse array of users—including analysts, data scientists, and developers—allowing them to register, improve, locate, understand, and leverage various data sources. You can utilize your preferred tools, as Data Catalog aids in the discovery and use of the data you need. Your data remains securely stored in your chosen locations, while Data Catalog assists in effortless access and management through a user-friendly interface. By encouraging broad usage and promoting ongoing value creation within the data ecosystem, Data Catalog enables users to exchange insights, advice, and best practices, fostering a collaborative environment where everyone can gain advantages. This solution not only democratizes the discovery of data assets but also ensures that all users can participate in the process meaningfully. Such a strategy enhances cooperation and contributes to a richer overall data landscape within organizations, ultimately driving innovation and informed decision-making.
-
14
Erwin Data Intelligence (erwin DI) combines data cataloging with data literacy initiatives to boost awareness and accessibility of data resources, while offering guidance on their proper usage and ensuring compliance with data policies and best practices. It systematically collects, converts, and assembles metadata from a wide array of data sources, business applications, operational workflows, and data models into a unified catalog. This catalog is then made available in an understandable format through role-specific, contextual views, empowering stakeholders to make strategic decisions based on trustworthy insights. Additionally, erwin DI fosters enterprise data governance and supports digital transformation efforts, as well as any projects that rely on data for optimal outcomes. The platform facilitates the scheduling of regular metadata scans from various data sources, simplifying the tracking of data elements from their origin to their final destination, including during transit, and enabling smooth data integration across multiple platforms. Moreover, it equips data consumers to discover and analyze data relevant to their specific roles, thereby enhancing data engagement within the organization. Ultimately, erwin DI acts as a robust solution for maximizing the potential and value extracted from data assets while promoting a culture of data-driven decision-making across all levels of the enterprise. This comprehensive approach ensures that organizations can fully leverage their data capabilities for sustained growth and innovation.
-
15
Facilitate effective data utilization for AI and analytics in a business-centric way through intelligent cataloging, reinforced by proactive governance of metadata and policies. The IBM Watson® Knowledge Catalog emerges as an essential resource for uncovering data, models, and additional assets, significantly improving the self-service exploration experience. Functioning as a cloud-based storage solution for enterprise metadata, it enables the activation of information for applications in AI, machine learning (ML), and deep learning. Users can conveniently access, curate, categorize, and share data and knowledge assets, along with their interrelations, from any location. By effectively organizing, defining, and managing enterprise data, organizations can guarantee that they possess the necessary context to create value for diverse purposes, including meeting regulatory requirements and pursuing data monetization initiatives. Moreover, it maintains data integrity, monitors compliance and audit readiness, and builds client trust through diligent policy management and the adaptive masking of sensitive information. Featuring intuitive dashboards and workflows that can be easily shared with team members or integrated with analytical tools, businesses can efficiently consume and transform data to align with their operational needs. By harnessing these capabilities, organizations can significantly elevate their decision-making processes and foster innovation throughout their operations. Ultimately, this comprehensive approach not only streamlines data management but also empowers teams to respond agilely to market changes.
-
16
Dataedo
Dataedo
Unlock data insights effortlessly with streamlined metadata management solutions.
Effectively uncover, document, and manage your metadata with ease. Dataedo provides a variety of automated metadata scanners that connect with various database technologies, extracting data structures and metadata to fill your metadata repository. With just a few clicks, you can construct a detailed catalog of your data while outlining each element. Simplify table and column names using intuitive aliases, and deepen your comprehension of data assets by including user-defined descriptions and custom fields. Utilize sample data to enhance your understanding of the contents within your data assets, allowing you to better evaluate the information before use and ensure its quality. Uphold high data standards through effective data profiling methods. Promote widespread access to data knowledge throughout your organization. By enhancing data literacy and democratizing access, you empower every member of your organization to utilize data more efficiently with an easy-to-use on-premises data catalog solution. Ultimately, a well-organized data catalog plays a crucial role in nurturing data literacy, which will lead to more informed decision-making processes across all levels of your organization. This collective knowledge can drive innovation and foster a data-driven culture.
-
17
neptune.ai
neptune.ai
Streamline your machine learning projects with seamless collaboration.
Neptune.ai is a powerful platform designed for machine learning operations (MLOps) that streamlines the management of experiment tracking, organization, and sharing throughout the model development process. It provides an extensive environment for data scientists and machine learning engineers to log information, visualize results, and compare different model training sessions, datasets, hyperparameters, and performance metrics in real-time. By seamlessly integrating with popular machine learning libraries, Neptune.ai enables teams to efficiently manage both their research and production activities. Its diverse features foster collaboration, maintain version control, and ensure the reproducibility of experiments, which collectively enhance productivity and guarantee that machine learning projects are transparent and well-documented at every stage. Additionally, this platform empowers users with a systematic approach to navigating intricate machine learning workflows, thus enabling better decision-making and improved outcomes in their projects. Ultimately, Neptune.ai stands out as a critical tool for any team looking to optimize their machine learning efforts.
-
18
JPedal
IDR Solutions
Effortlessly master PDFs in Java with minimal code.
JPedal simplifies the process of handling PDF files in Java, allowing developers to accomplish common tasks with just a few lines of code. For over two decades, IDRsolutions has been dedicated to enhancing this software, ensuring it can effectively address any challenging PDF issues. It fully supports all PDF 2.0 specifications, incorporating features like Encryption and Blending, Forms and Annotations, as well as PostScript and OpenType fonts. The library is rich with sample code and APIs that seamlessly fit into your applications, enabling feature additions with merely 2-3 lines of code. JPedal utilizes its proprietary font engine and custom image libraries to deliver superior image quality while maximizing performance for Java applications. The development of JPedal is ongoing, with nightly builds and monthly updates ensuring users have access to the latest improvements. Furthermore, the same team that develops the code is also available to provide support, ensuring a cohesive experience for users. This commitment to development and support makes JPedal a reliable choice for Java developers dealing with PDF functionalities.
-
19
Decube
Decube
Empowering organizations with comprehensive, trustworthy, and timely data.
Decube is an all-encompassing platform for data management tailored to assist organizations with their needs in data observability, data cataloging, and data governance. By delivering precise, trustworthy, and prompt data, our platform empowers organizations to make more informed decisions.
Our tools for data observability grant comprehensive visibility throughout the data lifecycle, simplifying the process for organizations to monitor the origin and movement of data across various systems and departments. Featuring real-time monitoring, organizations can swiftly identify data incidents, mitigating their potential disruption to business activities.
The data catalog segment of our platform serves as a unified repository for all data assets, streamlining the management and governance of data access and usage within organizations. Equipped with data classification tools, organizations can effectively recognize and handle sensitive information, thereby ensuring adherence to data privacy regulations and policies.
Moreover, the data governance aspect of our platform offers extensive access controls, allowing organizations to oversee data access and usage with precision. Our capabilities also enable organizations to produce detailed audit reports, monitor user activities, and substantiate compliance with regulatory standards, all while fostering a culture of accountability within the organization. Ultimately, Decube is designed to enhance data management processes and facilitate informed decision-making across the board.
-
20
Inferyx
Inferyx
Unlock seamless growth with innovative, integrated data solutions.
Break away from the constraints of isolated applications, excessive budgets, and antiquated skill sets by utilizing our cutting-edge data and analytics platform to boost growth. This advanced platform is specifically designed for efficient data management and comprehensive analytics, enabling smooth scaling across diverse technological landscapes. Its innovative architecture is built to understand the movement and transformation of data throughout its lifecycle, which lays the groundwork for developing resilient enterprise AI applications capable of enduring future obstacles. With a highly modular and versatile design, our platform supports a wide array of components, making integration a breeze. The multi-tenant architecture is intentionally crafted to enhance scalability. Moreover, sophisticated data visualization tools streamline the analysis of complex data structures, fostering the development of enterprise AI applications in a user-friendly, low-code predictive environment. Built on a distinctive hybrid multi-cloud framework that employs open-source community software, our platform is not only adaptable and secure but also cost-efficient, making it the perfect option for organizations striving for efficiency and innovation. Additionally, this platform empowers businesses to effectively leverage their data while simultaneously promoting teamwork across departments, nurturing a culture that prioritizes data-informed decision-making for long-term success.
-
21
Dataplex Universal Catalog is a pay-as-you-go governance solution designed to unify how organizations manage, discover, and govern their data and AI assets. It combines technical, operational, and business metadata in one catalog, enabling transparency and consistency across projects and regions. AI-driven features such as tailored data insights and semantic search help uncover hidden patterns and speed up decision-making. The platform integrates deeply with Vertex AI, allowing users to instantly locate datasets, AI models, and related artifacts while adhering to IAM permissions. With automated lineage, profiling, and quality checks, teams can ensure compliance and maintain trusted data pipelines. Dataplex Universal Catalog also empowers organizations to build decentralized data meshes by logically organizing data into business domains. Its premium tier unlocks advanced exploration, profiling, and quality assessment for complex governance scenarios. For analytics teams, BigQuery integration provides end-to-end governance directly within the warehouse environment. For open data architectures, BigLake integration ensures consistent governance across Iceberg-based lakehouses. Overall, Dataplex Universal Catalog enables enterprises to balance accessibility with control, democratizing data insights while safeguarding trust and compliance.
-
22
Collate
Collate
Empowering data teams with automated discovery and governance.
Collate is an AI-driven metadata platform designed to provide data teams with automated tools for tasks like discovery, observability, quality, and governance, utilizing efficient agent-based workflows. Built on OpenMetadata, it boasts a unified metadata graph and includes more than 90 seamless connectors that facilitate the collection of metadata from diverse sources, including databases, data warehouses, BI tools, and data pipelines. The platform ensures data integrity by offering in-depth column-level lineage and data profiling, along with no-code quality tests. AI agents are essential for optimizing functions such as data discovery, permission-based querying, alert notifications, and large-scale incident management workflows. In addition, the platform features real-time dashboards, interactive analyses, and a collaborative business glossary that is beneficial to both technical and non-technical users, enhancing the management of valuable data assets. Its automated governance and continuous monitoring uphold compliance with regulations like GDPR and CCPA, significantly cutting down the time required to address data issues while lowering the total cost of ownership. This holistic strategy not only boosts operational efficiency but also promotes a culture of data stewardship within the organization, encouraging all stakeholders to prioritize data quality and governance. Ultimately, Collate empowers teams to harness the full potential of their data assets effectively.
-
23
PoolParty
Semantic Web Company
Unlock smart solutions with advanced semantic data integration.
Integrate a state-of-the-art Semantic AI platform to develop smart applications and systems. Employ PoolParty to optimize the generation of metadata, which ensures that information is readily available for utilization, sharing, and analysis. By effectively linking unstructured and structured data, PoolParty connects various databases and disparate data sources seamlessly. Experience the benefits of sophisticated graph-based data and content analytics, driven by leading machine learning techniques. Make the most of your data with PoolParty, as it improves its quality, leading to more precise outcomes from AI applications and enhanced decision-making abilities. Understand why top global companies are embracing Knowledge Graphs and consider how your organization can benefit as well. Engage with experts, collaborators, and client demonstrations to fully realize the potential of semantic technologies and comprehensive perspectives. We have successfully guided over 180 enterprise clients in navigating the challenges of information management, promoting a more streamlined data environment. By adopting these cutting-edge solutions, you can maintain a competitive edge in an ever-evolving digital landscape while ensuring your organization is equipped for future challenges. Stay proactive and forward-thinking to thrive in this dynamic technological era.
-
24
Ataccama ONE
Ataccama
Transform your data management for unparalleled growth and security.
Ataccama offers a transformative approach to data management, significantly enhancing enterprise value. By integrating Data Governance, Data Quality, and Master Data Management into a single AI-driven framework, it operates seamlessly across both hybrid and cloud settings. This innovative solution empowers businesses and their data teams with unmatched speed and security, all while maintaining trust, security, and governance over their data assets. As a result, organizations can make informed decisions with confidence, ultimately driving better outcomes and fostering growth.
-
25
Atlan
Atlan
Transform your data experience with effortless discovery and governance.
Welcome to the modern data workspace, where discovering all your data assets, from tables to business intelligence reports, is made incredibly easy. Our sophisticated search technology, combined with an intuitive browsing interface, guarantees that finding the correct asset is straightforward. Atlan enhances the process of identifying low-quality data by automatically creating data quality profiles, which help users quickly recognize any existing issues. With capabilities such as automatic detection of variable types, analysis of frequency distributions, identification of missing values, and detection of outliers, Atlan addresses every facet of data quality management comprehensively. This platform streamlines the complexities associated with effectively governing and managing your data ecosystem. Furthermore, Atlan’s smart bots scrutinize SQL query histories to create data lineage maps and pinpoint personally identifiable information (PII), facilitating the development of dynamic access policies and ensuring robust governance. In addition, those who lack a technical background can easily conduct queries across multiple data lakes, warehouses, and databases thanks to our user-friendly, Excel-like query builder. Not only that, but seamless integrations with popular tools like Tableau and Jupyter also enhance collaboration around data, significantly changing the way teams collaborate and share insights. This comprehensive strategy not only empowers users but also cultivates a more data-driven culture across organizations, encouraging informed decision-making at every level. Ultimately, Atlan revolutionizes the way organizations interact with their data, paving the way for greater innovation and efficiency.