In recent years there has been an insatiable hunger for data lakes because of their ability to store data regardless of whether it is structured, semi-structured, or unstructured. This capability is especially important because the rate of increase in the volume of unstructured and semi-structured data far outweighs that of structured data. You will find that most data lakes are built on one of three cloud object stores: Amazon Simple Storage Service (Amazon S3), Azure Data Lake Storage Gen2 (ADLS Gen2), or Google Cloud Storage (GCS). It is also becoming increasingly more common for data lakes to span multiple-cloud providers, and as a result, more than one storage service.
Why Cloud Object Stores are Important
Cloud object stores are typically low cost, highly scalable, extremely secure, compliant with several international standards, and provide virtually unlimited storage capacity. Implementing an on-premises data lake with all these attributes would be impractical for all but...
Continue reading this post on the Open Data Blend Blog.