What is Big Data?
Big Data refers to particularly large volumes of data that are collected, stored, processed and analyzed from a wide variety of sources. They are characterized by their size, complexity, fast-moving nature and weak structuring. Big Data plays a particularly important role in the context of Business Intelligence (BI). In analyzing collected data, BI makes use of systematic approaches and, in particular, new software solutions for evaluating large volumes of data and improving the speed of data processing.
Date Warehouse vs. Data Lake – Platforms for storing Big Data
When it comes to storing Big Data, the terms Data Warehouse and Data Lake have established and are suitable in different ways depending on the requirements.
Data Warehouse captures structured data in mostly tabular form that has been processed for a specific purpose. The data model is therefore predefined, which results in less flexibility of new data sources, but at the same time data in a Data Warehouse is very robust and easy to maintain. One approach to modeling Data Warehouses is the Data Vault.
Data Lake, on the other hand, captures both structured and unstructured data whose purpose has not yet been defined. Since the data model is not defined until it is read, this is a potential source of error. In contrast, Data Lakes are very flexible and scalable.