Collaborative data management is a necessity in the self-service world and knowledge sharing is the first step in creating collaborative culture. Crowdsourcing of tribal knowledge is an important part of curation practice. Everyone who works with data has the opportunity to curate by sharing their knowledge and experiences. Data catalogs are rapidly becoming the new “gold standard” for metadata management, making metadata accessible and informative for non-technical data consumers.Ī typical organization has many people doing data curation work with varying degrees of responsibility and corresponding time commitment. ![]() Data curation is a metadata management activity and data catalogs are essential data curation technology. Making datasets easy to find, understand, and access is the purpose of data curation-a purpose that demands well-described datasets. But organizing and managing are the essence of data curation. That is what we do when we store data in data warehouses or data lakes. Collecting datasets is only the beginning. The distinction between “collections of data” and “collections of datasets” is subtle but significant.ĭata curation, then, is the work of organizing and managing a collection of datasets to meet the needs and interests of a specific groups of people. Note that the focus here is datasets – files, tables, etc. If curated describes collections of things that are selected and managed to meet the needs of a specific group, then curated data is a collection of datasets that is selected and managed to meet the needs and interests of a specific group of people. Organizing and managing are the critical elements of curation-making things easy to find, understand, and access. has described Apple’s App Store as “curated computing.”Ĭuration is the work of organizing and managing a collection of things to meet the needs and interests of a specific group of people. ![]() More recently we’ve started to use the term to describe managed collections of many kinds such as curated content at a website, curated music and videos available through streaming services, and curated apps through download services. The traditional use of the word is associated with collections of artifacts in a museum and works of art in a gallery. ![]() Let’s set data aside for a moment and consider the meaning and the activities of curating. Curating data involves much more than storing data in a shared database. When speaking and consulting, I often hear people refer to data in their data lakes and data warehouses as curated data, believing that it is curated because it is stored as shareable data. Data curation is important in today’s world of data sharing and self-service analytics, but I think it is a frequently misused term. Data curation is a term that has recently become a common part of data management vocabulary.
0 Comments
Leave a Reply. |