Data Mesh Myths: Separating Fact from Fiction
Table of contents
Can data mesh unlock the potential of decentralized data management, or is it merely the latest buzzword to distract us from more pressing business initiatives?
Data mesh has gained traction as a transformative approach to data analytics and management. However, amidst the growing enthusiasm, several misconceptions and myths have taken root and now obscure the true potential of this innovative data architecture.
As we step into the new year, finding ways to optimize your approach to data is more important than ever. By working to dispel the myths surrounding the data mesh architecture, we can unravel its complexities and reveal the many advantages that await your organization.
From security concerns and implementation challenges to the role of data teams, let’s navigate these data mesh myths together to sort out the intricacies and achieve greater clarity on the topic.
Myth #1: Data Mesh is Too Expensive
What is the price tag on innovation?
One of the most prevalent assumptions about data mesh is that the architecture is simply too expensive to be worth implementing. Yet, like any worthwhile investment, the data mesh architecture can require significant upfront costs, yet save us costs in the long run.
These initial costs can build up quickly in the implementation stages, especially when maintaining an existing infrastructure while building the new one. As a result, these systems must be capable of operating simultaneously until the older system is decommissioned.
This can scare off hopeful investors. Along with the need for resources, the high cost of specialists and training for current teams can appear daunting in the face of simply keeping an older approach to data management.
However, over time, the investment offers significant cost benefits.
-
Heightened Engineer Productivity:
Data mesh enables data engineers to spend their time more productively and complete their immediate responsibilities rather than raking through data silos. With a data mesh architecture, engineers can easily locate the data they need from self-serve platforms that operate cross-functionally across different business domains.
-
Faster Product & Project Launches:
Data mesh facilitates vastly improved collaboration and communication between different departments, allowing new products and projects to be integrated and available for search at a much faster pace. This heightened project and product launch speed can have benefits all across an organization, from product development to marketing.
-
Improved Data Management:
In a data mesh architecture, data assets are managed autonomously by specific business domains but shared cross-functionally across all organizations. This independent yet collaborative ownership enables more trustworthy data with a clear understanding of what kind of data it is, what its value is, the quality of the data, and when and who generated it. In addition, the data will be stored in one place and not copied hundreds of times, saving on storage space and eliminating the challenge of data relevance. If errors are found in the data or modifications are needed, these tasks can easily be accomplished in one place without fear of desynchronization.
-
Enhanced Data Governance:
A data mesh architecture enables equal access to metadata about different data products. As a result, all data users are clear on what kind of dataset they are dealing with and what usefulness the data provides. If another team requires access to the data, they can easily request this access from a centralized directory where both the table level and specific rows and columns can be found.
-
Streamlined Data Catalog:
Data mesh places all metadata about data products into one central directory where it is easily discoverable. From this data directory (or data catalog), you can request access directly from the catalog owner, bypassing more time-consuming processes that involve gaining access from busy data engineers. In turn, discovering data is transformed into a more efficient process and engineering teams are alleviated of the burden of providing access to data to all other users.
-
Clear Data Lineage:
At each moment in time, data mesh enables you to see the basis used to generate a dataset and what it depends on. This information is very useful when modifications to the data must be made (such as adding or removing a column), as the system can automatically tell you which datasets depend on the data being modified and may be affected by the modifications. The improved data lineage offered by data mesh also allows you to better understand which datasets need to be regenerated (for example, if a bug was found in upstream dependencies).
-
Better Data Quality Standards:
As policies are built into the data mesh, the quality of data and their standardization increases. The process of experimentation and prototyping is accelerated. Data science and AI teams can directly discover data, request access to it, and conduct experiments, again without the participation of data engineers. All this speeds up and simplifies innovation, which can offer you competitive advantage over other companies.
Myth #2: Data Mesh Isn’t Safe
When a new data strategy enters the business space, the question of security inevitably arises.
However, while many may fear that the data mesh architecture cannot keep data secure, the opposite is often true. Data mesh is, on the contrary, a technology for data decentralization that enables each team to claim ownership over their data products and — in theory — store those dat products anywhere available within the data catalog.
Data mesh is typically stored using cloud technologies that are highly available and fault tolerant, minimizing potential threats to the data products.
As for privacy, business-critical data may be better stored on-premises rather than in the cloud. Since only the directory is centralized within the data mesh, the autonomous nature of different data products belonging to specific business domains makes data mesh even better suited to a hybrid approach that utilizes both the cloud and on-premise resources.
Myth #3: Adapting to Data Mesh is Too Complex
While data mesh may seem complex to newcomers, the architecture actually simplifies rather than complicates.
To understand this, let’s look at how businesses construct different processes. In this scenario, this construction will involve scrum for project management, Jiro or Trello for task tracking, Pagerduty for incident monitoring, as well as additional tools.
At first glance, this use of many different tools may seem to complicate the system, so we can expect a bit of initial resistance from employees.
However, this layered approach ultimately allows the constructed business process to reach a much higher level of efficiency. It may take some time to put all the necessary pieces in place and get everyone on board, but — eventually — the results prove worth the effort.
This is the basic social aspect of data mesh to consider. On the surface and from the perspective of a newcomer, data mesh can seem overly complex — but with proper time, effort, and team flexibility, data mesh becomes a vital component of a much more productive business.
Moreover, shifting the social culture of a business to embrace complexity is crucial, as the benefits of data mesh are best achieved through collaborative efforts.
Myth #4: Tailoring Data Mesh to Your Needs is Impossible
Data mesh can be completely customized to your needs. In fact, at present its customization flexibility is perhaps what can get confusing to newcomers – with the separation of data products into different domains.
While, yes, data mesh is based on the categorization of information, each organization can choose between the different tools and systems that make up their specific approach to data mesh. Even ready-made solutions like data catalogs can still be tailored to the business needs of each organization.
Ultimately, there is no direct product to download and install data mesh from — data mesh is achieved through the combined efforts of technical innovation and a cultural shift in the workplace.
Myth #5: Getting Started with Data Mesh Requires Too Much Effort
Data mesh is not initially a quick turnaround — the approach requires careful and strategic planning.
However, despite the time it takes to implement a fully realized data mesh architecture, you can get started with data mesh in smaller initial phases. As a result, you can divide the effort required into bite-size tasks.
For instance, establishing a partial data mesh or a data catalog prototype can be achieved in just a couple of months and set the tone for the complete transformation of a business.
Aside from the time commitment, the two other major factors that get misconstrued as requiring too much effort are talent and migrations.
From a talent perspective, you will eventually need considerable expertise and support to maintain a data mesh.
Yet, you can get started initially with just two main teams — data engineers and everyone else. The data engineers create the self-serve platforms for all other teams to use and determine the best practices for their organization. Meanwhile, the business owners, stakeholders, project leads, cloud teams, and security teams can work on sorting out the data mesh standards.
This is known as federated governance, one of the core principles that guides the data mesh concept. Take a look at what Zhamak Dehghani, the creator of data mesh, has to say about the federated governance principle:
“[Data mesh governance] embraces constant change to the data landscape. It delegates the responsibility of modeling and quality of the data to individual domains, and heavily automates the computational instructions that assure data is secure, compliant, of quality, and usable. Risk is managed early in the life cycle of data, and throughout, in an automated fashion. It embeds the computational policies in each and every domain and data product.”
As for the challenge of legacy systems, this once again loops back to approaching a data mesh implementation in phases. You can transform each legacy system one by one, as data mesh does not require immediate integration. However, to achieve maximum efficiency, all systems and products must eventually be connected within the data mesh.
Slowly but surely, you will begin to reap the data mesh rewards.
Driving Data Mesh Awareness & Adoption at Your Organization
Dispelling the myths surrounding data mesh is paramount for fostering a genuine understanding of its capabilities and fostering successful adoption.
As we debunked misconceptions about cost considerations, security risks, technical complexity, customization, implementation effort, it has become evident that data mesh — when properly understood and leveraged — transforms into a powerful tool for decentralized data management.
By dismantling these myths within your own organization, you can pave the way for your teams to harness the true potential of data mesh, enabling agile and scalable data ecosystems along the way. Embracing data mesh with confidence and unity amongst your teams can act as a catalyst for facilitating greater innovation and data-driven decisions in your organization.
The data landscape continues to evolve. As it does, data mesh can serve as a guiding light toward the transformative data capabilities this architecture has to offer.
Was this article useful for you?
Get in the know with our publications, including the latest expert blogs
End-to-End Digital Transformation
Reach out to our experts to discuss how we can elevate your business