Data Warehouse
What is? A computer system designed to store and analyze large amounts of data for an organization.
What does it do?
- Gathers data from different areas
- Integrates and stores the data
- Make it available for analysis
Why is it valuable?
- Supports business intelligence activity
- Enable effective analysis and decision making
- Foster data driven innovation
Data Base
What is? Organized collection of structured information, or data, typically stored electronically in a computer system.
What does it do? Stores data in a structured way: rows and columns. Used for day-to-day transactional processing
Why is it valuable? It facilitates the access, and management to data since it is an organized collection of information. Used by application developers, end-users, or operational staff.
Data marts
What is? A relational database for analysis.
What does it do? Data is focused on one subject area
Why is it valuable? Since it focuses on a specific area, it is easier to query by business analysts, data scientists, or executives for reporting and insights.
Data Lake
What is? A centralized repository that stores vasts amounts of raw data in its native format.
What does it do? Store any kind of data.
Why is it valuable? It allows flexibility and scalability. Since it stores diverse data types, it makes it ideal for advanced analytics, machine learning, and AI applications.
| Feature | Data Base | Data Warehouse | Data Mart | Data Lake |
| Data Structure | Structured | Structured | Structured | Structured and Unstructured |
| Complexity to change | Complex | Complex | Complex | Less complex |
| Purpose of data | Known | Known | Known | May not be known |
| Coverage of departments | Covers many | Covers many | Covers only one | Covers many |
| Data Sources | Few Sources | Many source systems | Few sources | Many source systems |
| Typical size | <100 GB | >100 GB | <100 GB | >100 GB |