Centralized vs. Federated data teams is the wrong debate.

Centralized vs. Federated data teams is the wrong debate.

I have a confession to make.

I started my data career managing a team that did its best to stay outside of enterprise data management prescriptions. 

We even called our database a “datamart”—not because it technically was one, but because it let us bypass the enterprise rules for data warehouses. 

Our excuse? 

We had to deliver for two trading desks + a quant team, under intense pressure, and the enterprise group just didn’t understand that kind of urgency.

Of course, karma being what it is, I spent the rest of my career firmly on the enterprise side—one of those people who “didn’t understand.” 

That experience gave me a front-row seat to one of the oldest debates in data: 

Should data and AI functions be centralized or federated?

Each model comes with pros, cons, and plenty of passionate opinions. 

I’ve learned the best answer often isn’t one or the other—it’s a smart blend of both. Here’s why.

Centralized Model

Pros

  • Strong governance and consistency—easier to enforce standards, policies, and privacy compliance

  • Efficient use of resources—one team, one set of tools, better economies of scale

  • Easier to build shared capabilities like MDM, metadata, observability, and analytics platforms

Cons

  • Bottlenecks when demand is high

  • Harder to get business buy-in—can feel like an external service

  • Perceived lack of business knowledge or alignment

Federated Model: 

 Pros 

  • Closer alignment to business priorities and nuances

  • Faster iterations—no waiting in a central queue

  • Clear ownership and accountability at the domain level

Cons

  • Hard to solve for cross-domain challenges, including AI governance

  • Inconsistent standards make reuse and cross leverage challenging

  • Duplication of tools and effort

My Take: The Hub and Spoke Model

When done right, hub and spoke blends the strengths of both worlds.

The Hub

  • Builds shared infrastructure (MDM, lakehouse, lineage tools, observability)

  • Defines and enforces enterprise standards

  • Leads cross-domain initiatives and AI platform strategy

  • Drives data literacy and education

The Spokes

  • Sit in business lines/functions, owning use cases tied to KPIs

  • Maintain local domain models and dashboards

  • Bridge domain experts with centralized capabilities

  • Feed lessons learned back into enterprise strategy

What matters most is clarity:

  • Who owns what?

  • Who’s accountable?

  • How will you collaborate without creating silos?

If you’ve been on either side of the fence like I have, you know there’s no perfect org chart—only structures that fit your culture and maturity level.

The right model isn’t about where the data sits—it’s about how people work together to turn it into value.

So, what model are you using today? Has it evolved? What’s working (or not) for you?

From Debate to Delivery. Clarity Wins.

I help data and AI leaders move beyond centralized vs. federated ideology and design operating models that actually deliver business value.

With 25+ years leading data, AI, and risk transformation at firms like Voya Financial, Deutsche Bank, Citigroup, and Freddie Mac, I know how to blend governance with agility—turning data strategy into measurable impact.

→ Planning an event or need a practical, straight-talking speaker on data operating models, monetization, AI, or risk? Book me here.

→ Want to see how a hub-and-spoke approach could unlock hidden value in your data? Schedule a discovery chat.

Great illustration. What about when you have multiple external parties involved? This hub and spokes/federated model is on trend in health care data analytics where organizational partnerships allow researchers to access sensitive/hard to find data locked locally while protecting patient privacy.

Like
Reply
Gerald T.

Global Sales Development Leadership | GTM and AI agent architect | GTM and AI Ops | Servant Leader

1mo

Julia Bardmesser - Excellent articulation of the trade-offs in data organization. The "Hub and Spoke" model is exactly the approach I've seen deliver the most sustainable value. The key is enabling the "Spokes" without creating chaos. I've found that balance is best achieved when the "Hub" provides a common platform that facilitates a federated governance model—setting enterprise ground rules while empowering domains to execute. This requires a shift to decentralized, domain-driven ownership. When domains like sales or finance truly own their data as a product, you see a significant uplift in quality and relevance. The central team's role rightly evolves to become an enabler of this federated model. This is exactly what we are trying to solve for here at Semarchy. Great post. It all comes back to the clarity you mentioned on ownership and collaboration.

Elena Alikhachkina, PhD

Chief Information & AI Officer | Scaling Enterprise AI with Strong Data & AI Governance | Data Products • Decision Intelligence • Generative AI (RAG/Agents) | Board & PE/VC Advisor | NACD | Healthcare

1mo

Great topic for Friday. I see hub-and-spoke as federated by default. The real question is not what model you pick—we’ve all learned the hard way you cannot centralize what is not natural to centralization. The question is how you deploy hub-and-spoke. I’ve built this model four times in large, globally distributed companies. When done well, it unlocks both governance and innovation. When done poorly, the spokes turn into data-copying machines, multiplying inconsistencies at the speed of light. The secret sauce isn’t in choosing central vs. federated—it’s in designing the flows, incentives, and accountability that make the hub and spokes operate as one system.

Build it and they will come! This concept is never true when centrally managing data.

Trevor Leahy (Lee Hee)🪬🧿

Human Consultant Oasis Blue, EX Test Consultant at Fujitsu, EX Business Consultant at B&Q, WH Smith and SwissAir

1mo

Thank you for framing the problem. This can be applied to leadership models, operations and organisations. The key imo is to let the knowledge and information drive the operation and from that the organisation. When the hierarchical command led organisation drives the operation then trouble will most probably be ahead. 🙏✅♥️

To view or add a comment, sign in

Explore content categories