Solutions Review’s listing of the best data catalog tools is an annual mashup of products that best represent current market conditions, according to the crowd. Our editors selected the best data catalog tools and software based on each solution’s Authority Score; a meta-analysis of real user sentiment through the web’s most trusted business software review sites and our own proprietary five-point inclusion criteria.
The editors at Solutions Review have developed this resource to assist buyers in search of the best data catalog tools to fit then needs of their organization. Choosing the right vendor and solution can be a complicated process — one that requires in-depth research and often comes down to more than just the solution and its technical capabilities. To make your search a little easier, we’ve profiled the best data catalog tools and software all in one place. We’ve also included platform and product line names and introductory software tutorials straight from the source so you can see each solution in action.
Note: Companies are listed in alphabetical order.
Tool: Aginity Pro
Related products: Aginity Team
Description: Aginity offers an active analytics catalog that lets users and organizations write, save and organize their analytic code. When saving code to a catalog, developers can put a title, description and other metadata around their code so it’s easy to understand the intent and context of what the code is trying to do. All of the analytic code can then be shared with others by providing either view or edit access. Every object saves in the catalog is an object that can be referenced in the code editor for execution with simple syntax.
Tool: Alation Data Catalog
Description: Alation is a complete repository for enterprise data, providing a single point of reference for business glossaries, data dictionaries, and Wiki articles. The product profiles data and monitors usage to ensure that users have accurate insight into data accuracy. Alation also provides insight into how users are creating and sharing information from raw data. Customers tout the product for its expansive partner ecosystem, and Alation has focused on increasing data literacy when metadata is distributed across business and IT.
Tool: Alex Data Marketplace
Related products: Alex Scanner Marketplace
Description: Alex Solutions is a technology agnostic unified enterprise data catalog. It features a business glossary that enables users to define and maintain key business terms and link them to physical data assets, processes, and outputs. Policy-driven data quality combines data lineage with data profiling and machine learning-based intelligent tagging. Alex also offers intelligent tagging that helps users add business context to physical data assets. Deployment and integration are simple, and the product’s user interface is friendly to business users.
Tool: Alteryx Connect
Related products: Alteryx Designer, Alteryx Server, Alteryx Promote
Description: Alteryx data cataloging is available through Alteryx Connect. The product centralizes business terms and definitions, metrics, and information assets for discoverability and collaboration. Connect lets users discover the types of information their data contains, where the information comes from, who is using it, and how it is used. The tool features powerful search to find and reuse information in analytic apps, workflows, macros, visualizations, dashboards, and data science models as well.
Description: Cambridge Semantics offers a data discovery and integration platform called Anzo that lets users find, connect and blend data. Anzo connects to both internal and external data sources including cloud or on-prem data lakes. The product also features data cataloging that utilizes graph models encoding a Semantic Layer that describes data in business context. Users can add Data Layers for data cleansing, transformation, semantic model alignment, relationship linking, and access control as well.
Tool: Collibra Catalog
Related products: Collibra Platform, Collibra Privacy & Risk
Description: Collibra’s Data Dictionary documents an organization’s technical metadata and how it is used. It describes the structure of a piece of data, its relationship to other data, and its origin, format, and use. The solution serves as a searchable repository for users who need to understand how and where data is stored and how it can be used. Users can also document roles and responsibilities and utilize workflows to define and map data. Collibra is unique because the product was built with business end-users in mind.
Tool: Cloudera Navigator
Related products: Cloudera Data Platform, Cloudera Data Catalog
Description: Cloudera Navigator is a data governance solution for Hadoop that provides data discovery, continuous optimization, audit, lineage, metadata management, and policy enforcement. The product lets users explore and tag data through a search-based interface. Navigator consolidates metadata and supports custom tags and comments as well, and it’s easy to track, classify, and locate data to comply with business governance and compliance. Cloudera Navigator is a part of Cloudera Enterprise.
Tool: Denodo Platform
Description: The Denodo Platform offers data virtualization for joining multistructured data sources from database management systems, documents, and a wide variety of other big data, cloud, and enterprise sources. Connectivity support includes relational databases, legacy data, flat files, CML, packed applications, and emerging data types including Hadoop. The tool features a dynamic data catalog for accessing data via a searchable, contextualized interface.
Tool: erwin Data Catalog
Related products: erwin Data Intelligence Suite, erwin Data Governance, erwin Data Literacy, erwin EDGE Portfolio
Description: erwin offers a unified software platform for combining data governance, enterprise architecture, business process, and data modeling. The product is delivered as a managed service that allows users to discover and harvest data, as well as structure and deploy data sources by connecting physical metadata to specific business terms and definitions. erwin imports metadata from data integration tools, as well as cloud-based platforms, and can evaluate complex lineages across systems and use cases.
Tool: Watson Knowledge Catalog
Related products: IBM InfoSphere Information Server, IBM InfoSphere Information Governance Catalog
Description: IBM Watson Catalog provides AI-assisted self-service discovery of data, machine learning models and more. The product lets users access, curate, categorize and share data, knowledge assets and their relationships, regardless of where the data resides. Key capabilities include real-time data virtualization support, automated metadata generation, dynamic data masking, and automated scanning and risk assessments of unstructured data via Watson Knowledge Catalog InstaScan.
Tool: Infogix Data360 Govern
Description: Infogix offers a suite of integrated data governance capabilities that include business glossaries, data cataloging, data lineage, and metadata management. The tool also provides customizable dashboards and zero-code workflows that adapt as each organizational data capability matures. Reference customers use Infogix for data governance and for risk, compliance and data value management. The product is also flexible and easy to use, and supports smaller data analysis jobs as well.
Tool: Informatica Enterprise Data Catalog
Related products: Informatica Intelligent Data Platform, Informatica Metadata Manager, Informatica Business Glossary, Informatica [email protected]
Description: Informatica Enterprise Data Catalog is a machine learning-based data catalog that lets you classify and organize data assets across any environment. The product also provides a metadata system of record for the enterprise. Enterprise Data Catalog automatically scans and catalogs data, indexing it for organization-wide discovery via a Google-like search engine. Key features include data provisioning, end-to-end data lineage, integrated data quality, data relationships and recommendations, and even a Tableau extension.
Tool: Oracle Cloud Infrastructure Data Catalog
Related products: Oracle Enterprise Metadata Management
Description: Oracle Cloud Infrastructure Data Catalog is a metadata management service that helps organizations find and govern data using an organized inventory of data assets. The product features a modern, intuitive user interface that includes a simple dashboard, search-and-browse capabilities, recommended actions, and shortcuts. Oracle Cloud Infrastructure Data Catalog is included with an Oracle Cloud Infrastructure subscription.
Tool: Qlik Catalog (Qlik Data Catalyst)
Related products: QlikView, Qlik Sense, Qlik Data Integration Platform
Description: Qlik Catalog builds a secure, enterprise catalog of all the data your organization has available for analytics, regardless of its physical location. The product features automated data preparation and metadata tools to streamline the transformation of raw data as well. The tool includes a self-service data marketplace that lets users “shop” for the data they need and export, share or automatically publish data sets to Qlik Sense and other analytic tools and applications.
Tool: SAP Data Intelligence
Related products: SAP Data Warehouse Cloud
Description: SAP Data Intelligence is an AI-powered data management solution that includes data orchestration, machine learning, and metadata management. The product lets users discover and connect multiple data types regardless of where they reside physically, as well as refine and reuse audio, image, and video streams and data from devices based on the IoT. Users can optimize governance and compliance with built-in metadata management rules, and orchestrate modular data pipelines across distributed architectures.
Tool: Tableau Catalog
Related products: Tableau Desktop, Tableau Server, Tableau Online, Tableau Prep, Tableau Data Management
Description: Tableau Catalog provides a complete picture of the data and how it is connected to the analytics in the Tableau environment. The product automatically ingests all of these assets into one central list so users can quickly see all the tables, files and databases in one place. Metadata and context is made available when data is connected so users can ensure they are using the correct data for analysis. Metadata and REST APIs bring the metadata to Tableau for analysis.
Tool: Talend Data Catalog
Related products: Talend Open Studio, Talend Data Fabric, Talend Data Management Platform, Talend Data Preparation, Talend Big Data Platform, Talend Data Services Platform, Talend Integration Cloud, Talend Stitch Data Loader
Description: Talend Data Catalog automatically crawls, profiles, organizes, links, and enriches metadata. Up to 80 percent of information associated with the data is documented automatically and kept up-to-date through smart relationships and machine learning. Data Catalog key features include faceted search, data sampling, semantic discovery. categorization, and auto-profiling. The tool also includes social curation and data relationship discovery and certification, as well as a suite of design and productivity tools.
Tool: Unifi Data Catalog
Related products: Unifi Data Platform
Description: Unifi was founded by data and enterprise infrastructure experts from Greenplum. Unifi’s data catalog provides user the ability to easily search and discover data regardless of where it lives and irrespective of its structure using natural language search. It also includes AI-powered data discovery out-of-box with auto-generated recommendations so users can view and explore datasets. Unifi also enables users to deconstruct TWBX files and see the fill lineage of a data source to see how datasets were transformed.
Tool: Zaloni Arena
Description: Zaloni Arena operationalizes data along the entire pipeline, from data source to consumer. The product automates repeatable data management tasks and processes and provides central management of all enterprise data sources whether on-prem, cloud, multi-cloud, or hybrid. Zaloni is compatible with all major Hadoop distributions, most data processing engines, and applicable deployment models.
Tim is Solutions Review’s Editorial Director and leads coverage on big data, business intelligence, and data analytics. A 2017 and 2018 Most Influential Business Journalist and 2021 “Who’s Who” in data management and data integration, Tim is a recognized influencer and thought leader in enterprise business software. Reach him via tking at solutionsreview dot com.