Data Warehouse, Data Lake, Data Marts, Data Base

  1. Data Warehouse
  2. Data Base
  3. Data marts
  4. Data Lake

Data Warehouse

What is? A computer system designed to store and analyze large amounts of data for an organization.

What does it do?

  1. Gathers data from different areas
  2. Integrates and stores the data
  3. Make it available for analysis

Why is it valuable?

  1. Supports business intelligence activity
  2. Enable effective analysis and decision making
  3. Foster data driven innovation

Data Base

What is? Organized collection of structured information, or data, typically stored electronically in a computer system.

What does it do? Stores data in a structured way: rows and columns. Used for day-to-day transactional processing

Why is it valuable? It facilitates the access, and management to data since it is an organized collection of information. Used by application developersend-users, or operational staff.

Data marts

What is? A relational database for analysis.

What does it do? Data is focused on one subject area

Why is it valuable? Since it focuses on a specific area, it is easier to query by business analystsdata scientists, or executives for reporting and insights.

Data Lake

What is? A centralized repository that stores vasts amounts of raw data in its native format.

What does it do? Store any kind of data.

Why is it valuable? It allows flexibility and scalability. Since it stores diverse data types, it makes it ideal for advanced analytics, machine learning, and AI applications.


FeatureData BaseData WarehouseData MartData Lake
Data StructureStructuredStructuredStructuredStructured and Unstructured
Complexity to changeComplexComplexComplexLess complex
Purpose of dataKnownKnownKnownMay not be known
Coverage of departmentsCovers manyCovers manyCovers only oneCovers many
Data SourcesFew SourcesMany source systemsFew sourcesMany source systems
Typical size<100 GB>100 GB<100 GB>100 GB