I was discussing Hadoop architecture with a team and the meeting ended in agreeing to disagree on the architecture! There seems to be a confusion among new generation data experts about Data Warehouse, Data Marts & Hadoop Data Lakes.
Data Warehouses were designed way back in 1980s and the idea was to design a data reflection of the business to be used for analytics. I do not think the concept of DWH changes with advent of Big Data and yet we keep hearing of Hadoop will get rid of DWH. there could be cases where Hadoop Data Lake would serve the business purpose but to say that it is a replacement of Data Marts & Data Warehouse is incorrect.The integration in Data Warehouse is not just to arrange and store data for business but it also takes care of 'cleansing data to solve various data quality & validity issues' that affect business.
Data Warehouses were designed way back in 1980s and the idea was to design a data reflection of the business to be used for analytics. I do not think the concept of DWH changes with advent of Big Data and yet we keep hearing of Hadoop will get rid of DWH. there could be cases where Hadoop Data Lake would serve the business purpose but to say that it is a replacement of Data Marts & Data Warehouse is incorrect.The integration in Data Warehouse is not just to arrange and store data for business but it also takes care of 'cleansing data to solve various data quality & validity issues' that affect business.
No comments:
Post a Comment