Tag: Data Lake
-
A Low Cost Data Repository for Data Science Project
In the past month, I have made a number of research and testing on different data science technology including on-premise and cloud solutions. They are: Google BigQuery, Cloud Storage, Airflow Qilk Sense, Data Streaming (CDC), Data Warehouse Automation, Data Lake Creation Debezium CDC EnterpriseDB, GaussDB (by Huawei) In the coming future, I will share some…
-
10 Myths of Data Science Exploded Totally
There are multiple myths floating around that add a false aura around data science roles and the related industry. The myths are built by people highlighting or even magnifying the benefits of data science with a number of use cases. I am always using my own term “verbal experts” to classify such people in the…
-
Data Lake VS Data Warehouse
Vendors are always saying that Data Warehouses are legacy data management platforms which should be replaced by data lakes (or even worse situation as Hadoop to replace data warehouse). In this article, I would like to share my opinion on data warehouse and data lake by the team experiences. Unfortunately, vendors position data lakes as…


