**RUN** | Management & Operational Excellence
Lead photo: Thomas A. Fink, Shutterstock
Nearly four years ago, Mihir Shah had a vision for fundamentally changing Fidelity Investments’ data strategy. Then CTO, Shah pushed to become the Boston-based multinational financial services company’s first ever enterprise head of data architecture and engineering to enact that vision, what he calls a next-generation data pipeline.
At the center of that vision was one key tenet: “No matter what your role is, we want to make data available to you to make your job easier and to make your decision-making better,” he says.
Becoming a data-driven company has been a key goal of most organizations for years now, but many have failed to achieve that goal. For Shah, transforming the data operating model is the foundation of success in becoming data-driven.
Mihir Shah, enterprise head of data architecture and engineering, Fidelity Investments
“Everybody knows that data is valuable, and that integrated data is more valuable than siloed data,” he says. “We’ve been talking about this for years, but nobody has been willing to make those operating model changes to actually make that happen. The technology is there. It’s getting better. But I think the key secret sauce is the operating model.”
When Shah first joined Fidelity as a principle technical adviser in 1994, the company was relatively advanced in its use of data. For example, it had a marketing database with customer profiles and customer modeling. But the data wasn’t integrated across its businesses. Shah’s vision was to change the way Fidelity managed data and made it available to its 41,000-plus employees.
“The siloed approach has multiple side effects,” Shah says. “For somebody to leverage a data asset, you have to gather those data sets from everywhere for every new use case. The time to market and the friction that comes when you have siloed data is very high.”
A simple data set in a traditional data architecture might include an account with positions, balances and transactions, for example, Shah says. The marketing department wants that data to calculate total assets and customer behavior. The risk department wants that data for calculating fraudulent activities. The finance department needs that data to understand the profitability of the customer.
“We had a marketing data warehouse, we had a risk data warehouse, we had a finance data warehouse, all operating separately, all with feeds going from one to the other,” Shah says. “They all worked fine, but at the back end the cost of actually managing all that was tremendous. We could be spending that money on more value-added activities than just moving data around.”
Instead, Shah sought to use new technologies and the scalability of the cloud to create a single data warehouse that made data independent of its users.
Shah’s transformation vision hinged on three steps. First, Fidelity had to migrate its data into that single cloud data warehouse and create processes by which new data would feed into the data warehouse as it was created. Second, it needed to make the warehouse accessible to everyone in the company. Third, everything had to be secure within the boundaries of Fidelity’s ethical, privacy and contractual standards.
For organizations implementing similar data lake–like concepts, a major stumbling block can be stakeholders’ unwillingness to give up ownership of their data.
“We had to rethink how we actually create a business model where people don’t have to give up what they own, yet we actually are able to integrate everything,” says Shah, whose team handles the engineering of the platform’s cloud database, ingestion pipelines, monitoring and data catalogs.
Procedures also had to be put in place for getting newly created data into the data lake, a process that must be highly coordinated given Fidelity’s taxonomy, which includes 15 data categories at the top level and 3,000 categories at the lowest level. Shah’s team provides the sequence for seeding data into the platform and orchestrates the process, but the data owners execute the process.
“The data is best owned and managed by the people who currently manage and own it,” Shah says. “We said that each data owner needs to push data into the platform and seed the platform but continue to own it.”
Getting people to use the platform, build products and migrate their existing applications, dashboards, reporting and so on has been another key element of Shah’s strategy.
“People will gravitate toward the best source of data, the clean source of data, data that is managed, data where they can talk to someone if they need to,” Shah says. “Now, it’s almost a network effect. Everybody knows this is the best place to get data. They come on board, they find a couple of datasets missing, so they go to their business partners and say, ‘I need this data set.’ So, more data comes in, the usage goes up, and the more the usage goes up, the more there’s a demand for more data sets.”
Fidelity started implementing the new strategy in January 2020. By June 2020, it was in production. Getting engineers throughout Fidelity excited about the vision was key to selling the rest of the company on the transformation, Shah says.
“If you get the engineers on board and get them excited, they convince the managers and business partners,” he adds. “They do the selling up in their vertical business units and they are trusted within the business units. Even if other people don’t believe in me, they will believe in their own engineer.”
Today, there are nearly two petabytes of data in the platform. Meanwhile, Shah says his focus has shifted as the power of the strategy has become apparent.
“If we can do this inside of Fidelity, why not expand the data architecture to our ecosystem?” he says. “Now, when we think about data architecture, we are thinking not just about Fidelity business units; we’re thinking about the entire ecosystem and saying, ‘How do we make data an asset, not just for us but also for our institutional customers? How can we make our suppliers’ lives easier by not having them send us a hundred thousand FTP files every night?’ Your thinking about a data architecture should not be limited to your business units.
Thor Olavsrud is a senior writer at CIO.com.
The age of passive business intelligence is over. Today, you need the power to analyze real-time information and respond in the moment. Get the executive perspective on Active Intelligence—including how to achieve it in your organization.
In today’s world, success requires collaboration – not only with your existing partners but with suppliers, customers, and even competitors. Discover the top 10 emerging BI and data trends, and find out how to use them to your advantage.
Too often, data delivery is the barrier to informed action—but it doesn’t have to be. Download this executive brief to learn how conversational analytics can supply instant insights for real-time decisions.
Every data-driven executive understands the tension between speed and risk. But how, specifically, can you thread that needle? Learn which tools and strategies will help you master real-time data delivery without sacrificing quality or security.
In our real-time economy, uncertainty is everywhere. So how do you act quickly without acting impulsively? By adopting a data strategy that gives you—and everyone in your organization—in-the-moment awareness of your context and your business.