Skip to content

Implement transformations and generate formatted csv file#28

Open
saipraneeth0318 wants to merge 2 commits intoHedgeApple:masterfrom
saipraneeth0318:master
Open

Implement transformations and generate formatted csv file#28
saipraneeth0318 wants to merge 2 commits intoHedgeApple:masterfrom
saipraneeth0318:master

Conversation

@saipraneeth0318
Copy link
Copy Markdown

As I have implemeted transformations and generated formatted.csv file as per my understanding. Please review the code changes and let me know if anything missing.

@saipraneeth0318
Copy link
Copy Markdown
Author

ETL Homework Solution

This repository contains an ETL (Extract, Transform, Load) solution designed to read data from a homework.csv file, transform it, and then save the results to a formatted.csv file.

Table of Contents

Description

This ETL solution aims to streamline the process of reading, transforming, and saving CSV data. Through this framework, users can seamlessly process homework.csv and obtain a formatted output in formatted.csv.

Project Structure

  • etl_conf.json: A configuration file that holds settings and parameters for the ETL process.
  • etl_solution.py: The main Python script that orchestrates the ETL process. It reads data from homework.csv, performs necessary transformations, and outputs the results to formatted.csv.
  • requirements.txt: Contains the necessary Python library dependencies to run the ETL script.
  • homework.csv: Source CSV file containing the raw data to be processed.
  • formatted.csv (After execution): The resultant file after processing homework.csv through the ETL solution.

Setup and Execution

1. Clone the Repository

git clone https://github.com/saipraneeth0318/etl_homework.git
cd etl_homework

## Install Dependencies

Before running the script, install the required Python libraries:

```bash
pip install -r requirements.txt

## Run the ETL Script

Execute the `etl_solution.py` script:

```bash
python etl_solution.py

After execution, check the repository for the `formatted.csv` file, containing the processed data.

## Configuration

For any necessary adjustments to the ETL settings or parameters, modify the `etl_conf.json` file accordingly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant