Assignment 7 – Parallel Computing for Financial Data

Overview

This project benchmarks pandas vs. polars ingestion, computes rolling analytics, explores threading vs. multiprocessing, and aggregates hierarchical portfolio metrics for large financial time-series data. The pipeline produces comparative performance summaries and charts while ensuring correctness through automated tests.

Project Layout

File	Description
`data_loader.py`	Loads CSV market data with profiling for pandas/polars.
`metrics.py`	Rolling moving average, volatility, and Sharpe ratio utilities.
`parallel.py`	Threaded and process-based execution wrappers with psutil profiling.
`portfolio.py`	Sequential and multiprocessing portfolio aggregation logic.
`reporting.py`	Assembles benchmark summary tables and matplotlib visualisations.
`main.py`	Command-line entry point orchestrating the full workflow.
`portfolio_structure-1.json`	Sample nested portfolio hierarchy.
`reports/`	Generated charts after running `main.py`.
`tests/`	Pytest suite covering rolling metrics, parallelism, and portfolio aggregation.
`performance_report.md`	Narrative summary of measured results.

Prerequisites

Python 3.9+
Required packages: pandas, numpy, psutil, matplotlib, pytest
Optional (enables polars comparisons): polars

Environment Setup

python -m venv .venv
.venv\Scripts\activate        # Windows
pip install -r requirements.txt  # (see below for suggested list)

If you do not maintain a requirements file yet, install manually:

pip install pandas numpy psutil matplotlib pytest
# Optional:
pip install polars

Running the Pipeline

python main.py --data market_data-1.csv --portfolio portfolio_structure-1.json --window 20 --report-dir reports

Outputs a JSON summary to stdout.
Saves comparison charts inside reports/.
Requires polars only if you want to benchmark the polars backend; otherwise it will be skipped gracefully.

Running Tests

pytest

The suite validates rolling metric values, parity between threading/multiprocessing and the sequential baseline, and portfolio aggregation invariants.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Assignment 7 – Parallel Computing for Financial Data

Overview

Project Layout

Prerequisites

Environment Setup

Running the Pipeline

Running Tests

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
__pycache__		__pycache__
reports		reports
tests		tests
README.md		README.md
data_loader.py		data_loader.py
instructions.txt		instructions.txt
main.py		main.py
market_data-1.csv		market_data-1.csv
metrics.py		metrics.py
parallel.py		parallel.py
performance_report.md		performance_report.md
portfolio.py		portfolio.py
portfolio_structure-1.json		portfolio_structure-1.json
reporting.py		reporting.py

finm-python-for-finance/assignment-7

Folders and files

Latest commit

History

Repository files navigation

Assignment 7 – Parallel Computing for Financial Data

Overview

Project Layout

Prerequisites

Environment Setup

Running the Pipeline

Running Tests

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages