Skip to main content
 
Data Mesh

What is Data Mesh?

The traditional data infrastructure is built around a single, monolithic source of all enterprise data, be it a data warehouse, or more recently, a data lake.

Organizations are beginning to realize some of the problems in this design:

  1. The limitations of centralized teams: Centralized data teams cannot possibly understand the data needs of all of the different departments that they serve.
  2. The inability to serve different departments: One central platform cannot be flexible enough to accommodate the requirements of an organization’s different departments.
  3. Slow data provisioning. Centralized platforms are inherently rigid: as they are set up to perform standard operations across the entire organization. As a result, data provisioning is slow, and can never be real-time or on-demand.

Data mesh is a new, decentralized data architecture that attempts to solve the above problems by replacing the single, centralized data source with multiple data domains, each managed by different departments within the organization.

Data Mesh

Why is Data Mesh Important for Organizations?

Data mesh offers organizations the best of both worlds: flexibility and control. In a data mesh architecture, data domains are not silos, but authoritative centers of control fully provisioned to distribute data throughout the organization in a fully governed manner.

One key concept within the data mesh view of the world is data as a product, delivered by data domains to the data consumers within the organization at large. By productizing data, it becomes “packaged,” and made available in a seamless, self-service manner.

Data Mesh

Why is Data Virtualization the Cornerstone in Creating a Data Mesh?

In order for data mesh to work, as described above, it needs a data delivery system that can address its distributed nature. Traditional replication-based data integration approaches, such as extract, transform, and load (ETL) processes, are not capable of performing this function, as they are designed to move data from multiple data sources into a single repository.

Data virtualization, in contrast, is a perfect fit for data mesh. Unlike ETL processes, it provides real-time access to data without having to replicate it.

The architecture of data virtualization is extremely powerful in enabling data mesh:

  • The only data that data virtualization centralizes is the critical metadata for accessing the different data sources.
  • This architecture enables organizations to implement governance and security protocols across all of the different data domains from a single point of control.
  • This architecture also enables organizations to implement highly tailored semantic models above the individual data sources, that effectively serve as data domains without changing the underlying data.
  • These semantic models can be easily changed, developed, or re-designed, again without changing the underlying data.
  • Data virtualization enables full-featured data catalogs that not only list what data is available but can also provide ready, real-time access to it, in a self-service manner.
Data Mesh

Benefits of a Data Mesh

Data mesh architecture, supported by data virtualization, enables organizations to provide data that is:

Curated.

Domain-specific, yet consumable by the organization at large.

Real-time, on-demand, and available in a self-service manner.

Fully governed, safe, secure, and trusted.

Flexibly provisioned, based on the needs of different enterprise departments.

Denodo Free Trial

30-days free trial on the cloud for you to fully test Denodo Professional

START FREE TRIAL

Denodo Express

The free way to data virtualization

DOWNLOAD FOR FREE