This project is part of a data engineering course by FIA Business School designed to provide hands-on experience with the modern data stack. Students will learn how to ingest, transform, model, and visualize data using a variety of tools and platforms.
- Ingest data from CSV files into a PostgreSQL database.
- Transfer data from PostgreSQL to Snowflake using Airbyte.
- Model data in Snowflake using a three-layer architecture (staging, dimension/fact, marts).
- Create visualizations using Metabase, Plotly, or Streamlit.
We are using the AdventureWorks dataset for this project. The dataset includes various aspects of a fictional company, such as customer, sales, and product data.
The dataset consists of the following files:
AdventureWorks Calendar Lookup.csvAdventureWorks Customer Lookup.csvAdventureWorks Product Categories Lookup.csvAdventureWorks Product Lookup.csvAdventureWorks Product Subcategories Lookup.csvAdventureWorks Returns Data.csvAdventureWorks Sales Data 2020.csvAdventureWorks Territory Lookup.csv
- PostgreSQL
- Snowflake
- Airbyte
- Metabase/Plotly/Streamlit
Detailed steps to install and configure the necessary software and tools will be provided in separate documents or during classroom sessions.
SQL scripts for creating tables and ingesting data into PostgreSQL are provided. Students are expected to create similar schemas in Snowflake as part of the data modeling exercise.
We will use Airbyte to transfer data from PostgreSQL to Snowflake, followed by transforming the data according to the three-layer architecture approach.
Students will use Metabase, Plotly, or Streamlit to create dashboards or visual reports that provide insights into the data. Specific requirements for visualizations will be provided in project guidelines.
Students are encouraged to contribute to the project by suggesting improvements or identifying bugs. Contributions should be submitted as pull requests to the repository.
For more information or questions about the project, please contact Felipe Yoshimoto at https://www.linkedin.com/in/felipe-yoshimoto-252a04204/.