Choosing a Data Lake Format: What to Actually Look For
ModelingData Lakeposted by ODSC Community July 24, 2023
Recently we’ve seen lots of posts about a variety of different file formats for data lakes. There’s Delta Lake, Hudi, Iceberg, and QBeast, to name a few. It can be tough to keep track of all these data lake formats — let alone figure out why... Read more
8 Data Lake Vendors to Make Your Data Life Easier in 2023
Modelingdata engineeringData LakeMLOpsposted by ODSC Team May 31, 2023
Data has to be stored somewhere. Data warehouses are repositories for your cleaned, processed data, but what about all that unstructured data your organization is starting to notice? Where does it go? To make your data management processes easier, here’s a primer on data lakes, and... Read more
Powerful, Open Source, and Completely Free? HPCC Systems is the Real Deal for Data Lakes
ConferencesModelingData LakeHPCC SystemsWest 2020posted by ODSC Community November 6, 2020
We invite you to learn more about the powerful, open-source HPCC Systems. Our comprehensive, dedicated data lake platform makes combining different types of data easier and faster than competing platforms — even data stored in massive, mixed schema data lakes — and it scales very quickly... Read more
Gain Insight Into the COVID-19 Pandemic with the HPCC Systems Data Lake Platform
ModelingBig DataCOVID 19COVID-19 Data SourceData LakeHPCC SystemsPandemic MetricsSARS-Cov-2posted by ODSC Community September 24, 2020
This open-source project features a community contribution cluster which can be made available to ODSC followers. The severe acute respiratory syndrome coronavirus 2 (SARS-Cov-2) has taken around 930,000 lives and infected more than 29 million people worldwide so far. The COVID-19 pandemic is still ongoing, and... Read more