What Data Governance Means to Cloud Computing (And Vice Versa) – insideBIGDATA

The symbiosis between data governance and cloud computing is apparent to any organization with significant cloud investments. The cloud has a plethora of resources that enhance data governance, which is critical for regulatory compliance, risk mitigation, and long term profitability of data assets.

Simultaneously, however, data governance hallmarks of metadata management, data cataloging, and data stewardship are requisite for maximizing the clouds utility by illustrating where data are and employing them for singular use cases like customer 360s, real-time product or service recommendations, and advanced analytics.

Although its difficult to say which of these capabilities is more advantageous to the enterprise, they clearly complement each other.

For data governance, the clouds chief value proposition is that from a standpoint of connectivity, or data integration, or data quality, or cataloging, everythings included so you have an all-in-one solution thats possible because were in a microservices world where you can deploy efficiently and have things kind of seamlessly interoperable, Informatica MDM General Manager Manouj Tahiliani revealed.

The cloud is also a significant contributor to dispersing resources across locations which, if left unchecked, could easily result in silos and ungoverned use. According to Profisee CTO Eric Melcher, the cloud creates the much more distributed enterprise where youve got lots of applications: our HR applications, CRM applications, ERP applications, our customer experience platform. Weve got applications across the line. And by the way, the datas not in that server in the corner, its now all over the place. So, I need governance to understand where everything is.

Data governance is essential for keeping cloud deployments orderly, while the cloud enriches the very means of effecting governance by allowing organizations to position resources wherever theyre most advantageous for modern computingparticularly when supported by governance staples of metadata management, data cataloguing, data modeling, and data stewardship.

Metadata Intelligence

Metadata is utilitarian for governing data in cloud settings. Its particularly beneficial for assembling data across numerous sources to use in a single domain such as customer, product, supply chain information, and more. Tahiliani referred to the notion of metadata intelligence as foundational to implementing data quality and data integration. Coupling metadata intelligence with Master Data Management exploits the sundry of cloud sources by enabling organizations to govern how they get data from different sources, curate it, and share it across the enterprise, Tahiliani observed.

Metadata plays a couple different roles in this basic functionality. Its the key to connecting data to various downstream systems through MDMs logical model. Firms can also use the power of metadata to help them discover elements that need to be mastered, rapidly provide integration between the system using that metadata knowledge and share that data, as well as use metadata to use machine learning capabilities around matching and data quality rules, Tahiliani commented. These capabilities are instrumental for combining sources for customer insight, for example, in a well governed fashion.

Data Cataloging

Metadata also helps the data cataloging process Tahiliani described, which identifies what data are where in cloud settings. In fact, there are several cloud tools that reinforce this aspect of data governance by empowering organizations so the first thing they do is catalog what they have, Melcher noted. Now that weve cataloged it, lets start classifying and understanding what it is and figuring out whats sensitive and who knows about the data here and there. Once data are classified, its relatively straightforward to tag them and ascribe ownership to them, which provides the framework for suitably governing data across any variety of cloud sources.

Credible catalogs can point at sources to provide this functionality or allow sources to input their metadata for cataloging purposes. The latter is particularly effectual with MDM because it lets organizations push the logical definitions of Master Dataand register assets to what the end user would actually expect, Melcher mentioned. The benefit of this approach is data already adheres to MDM conventions of data quality, definitions, and other aspects of its logical model, which aids in creating catalogs and classifications that enhance enterprise use caseswhile ensuring those use cases are well governed.

Data Modeling

MDM is also useful for unifying the different data models that abound throughout cloud sources so organizations can combine them for analytics, for example. Data modeling is a pivotal aspect of data governance that can be particularly time consuming (delaying time to value) with traditional methods. MDM, however, lets organizations create models for products or any other domain partly by using data discovery mechanisms. As youre creating the models and what attributes you need to have, the discovery aspect allows you to understand across the source systems for those different entities whats the metadata thats being held, Tahiliani explained. That provides information about how often its used and updated to really determine if that should be an attribute within your master data model.

The most expressive data models unify terminology and include specific vocabularies to describe customers or supply chain needs, for instance. The metadata intelligence Tahiliani alluded to assists this facet of data modeling while allowing organizations to redress disparities in how entities are represented when you know which source systems the metadata youre pulling in is from, and what the vocabulary is across those source systems, Tahiliani added.

Data Stewardship

All of the above dimensions of data governancemetadata intelligence, data cataloging, and data modelingare invaluable to data stewards attempting to ensure governance standards are met across the enterprise in heterogeneous cloud, edge, and on-premise applications. The enhanced data modeling traits pertaining to definitions and attributes that are found in MDM (including information from data cataloging tools) are critical to data stewards validating data governance protocols.

For example, if a steward needs at-a-glance information about the data model for a customer entity, hell see this is a description of a customer associated with this glossary term called business partner or whatever the case may be, and Bob in Sales is an expert on what customer is, Melcher indicated. With this approach MDM participates in governance by pushingmetadata into a catalog, but then, as governance occurs, the output of governance is then available for end users to consume back in [MDM], Melcher concluded.

Master Data Governance

Data governance is indispensable for making good on the clouds premise of real-time access to a distributed data landscape with elastic scalability in a pay-per-use pricing model. Additionally, the cloud contains many toolssuch as cloud native MDM solutions, data cataloging instruments, and othersthat streamline governance capabilities to decrease datas risk while boosting its enterprise value over the long term.

MDM sits between each of these constructs as a viable means of balancing the usefulness of one with the other. You shouldnt do governance in MDM, Melcher cautioned. You should do governance and MDM. You should be governing across more than just your Master Data Management platform.

About the Author

Jelani Harper is an editorial consultant servicing the information technology market. He specializes in data-driven applications focused on semantic technologies, data governance and analytics.

Sign up for the free insideBIGDATAnewsletter.

Join us on Twitter:@InsideBigData1 https://twitter.com/InsideBigData1

Read more here:
What Data Governance Means to Cloud Computing (And Vice Versa) - insideBIGDATA

Related Posts

Comments are closed.