Discussion about this post

User's avatar
Perry Chen's avatar

Sorry, couldn't make my comment earlier since I had very similar feelings about the definition of data products as you discussed and started to look into the relationships between data products and other datasets in data space. I'm recently working on a study on complexity analysis and design considerations for data products and Data-as-a-Service (DaaS). The initial study show we may need to do more work before we can effectively handle the complexity of the current and fast changing data operation and management environments in data space, which cannot be easily reduced can only be controlled and managed if we can find right enablers and methods.

We know not every dataset or data entity in data space needs to or should be a data product. But, each data product must have its associated dataset(s) after certain efforts in design and processes to move, transform, package and present it (or them) through using various tools, platforms and activities. This new and different form of data which has a special value or use cases such that we need to not only name or call it differently but also treat it differently in catalogs, metadata management, applications and management. And, we also need to make a distinction between the internal data products and the external data products.

Looking forwards to reading your following-up articles on data products taxonomy and approaches if they have not been shared here yet.

Expand full comment
Darren Lynch's avatar

I have in recent years been looking for a clear definition of a data product, to date I have found a number definitions and while they made sense, they always seemed to be somewhat open ended. As data professionals we seem to like order and governance but openness at the same time. It feels a little like a get out of jail free card. I look forward to seeing where you take this.

Expand full comment
9 more comments...

No posts