There are several tutorials. We organize them into different sections that one can practice in sync with lectures.
Some have been discontinued but still they can be useful for learning.
Check the subdirectories to see some small examples we use in our courses.
- Data quality
- Queue-based data ingestion
- Data Ingestion and Apache Nifi
- Example of cloud data pipeline
- Data ingestion with MongoDb
- Hadoop systems
- MapReduce Programming
- Basic Dask
- Basic distributed queries with Trino
- Workflows with Apache Airflow
- Spark programming
- Spark streaming
- Stream processing with Flink
- An example of a mini edge-cloud big data platform
You might see many code examples about IoT, Edge and Clouds from our IoTCloudSamples that you can reuse for big data studies.