Tag: Data Lake

  • A Low Cost Data Repository for Data Science Project

    A Low Cost Data Repository for Data Science Project

    In the past month, I have made a number of research and testing on different data science technology including on-premise and cloud solutions.   They are: Google BigQuery, Cloud Storage, Airflow Qilk Sense, Data Streaming (CDC), Data Warehouse Automation, Data Lake Creation Debezium CDC EnterpriseDB, GaussDB (by Huawei) In the coming future, I will share some…

    Continue reading

  • 10 Myths of Data Science Exploded Totally

    10 Myths of Data Science Exploded Totally

    There are multiple myths floating around that add a false aura around data science roles and the related industry.  The myths are built by people highlighting or even magnifying the benefits of data science with a number of use cases.  I am always using my own term “verbal experts” to classify such people in the…

    Continue reading

  • Data Lake VS Data Warehouse

    Data Lake VS Data Warehouse

    Vendors are always saying that Data Warehouses are legacy data management platforms which should be replaced by data lakes (or even worse situation as Hadoop to replace data warehouse).  In this article, I would like to share my opinion on data warehouse and data lake by the team experiences. Unfortunately, vendors position data lakes as…

    Continue reading

0Shares