CloudTrail ETL & Detection Pipeline

Author: Yonal Serasinghe

Overview

This project showcases a manual ETL pipline for AWS CloudTrail logs using Pythong and SQLite. The pipeline parses raw JSON logs and transforms them into tables. SQLite is then used to detect anomalies. The overall goal is to effectively and efficiently identify unusual activity within a cloud enviornment.

Structure

Features

Note these command prompts are based on Windows, adjust accordingly

Parses raw CloudTrail JSON logs into a clean CSV format.
Loads structured data into a SQLite database.
Detects data anomlies using SQL queries:
- API call rate spikes
- Activity from unusual regions
- Sensitive or high-risk API calls
- Sudden spikes in event metrics
Scripts seperated into parsing, loading, and detecting.

How to Run

Necessary Installations:

install pandas

Place CloudTrail logs in JSON form into the data/ folder.
Parse logs into CSV: py etl/parse_cloudtrail.py
Load CSV into SQLite database: py etl/load_to_sql.py
Run detection queries: py etl/detections.py

Tech Stack

Python
pandas
SQLite
SQL
AWS CloudTrail

Possible Improvements and Considerations

Automate input of logs
Allow for file types other than JSON
Integrate with AWS S3 buckets for continuous updates
Build dashboard for visualization of anomalies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CloudTrail ETL & Detection Pipeline

Author: Yonal Serasinghe

Overview

Structure

Features

Note these command prompts are based on Windows, adjust accordingly

How to Run

Tech Stack

Possible Improvements and Considerations

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
db		db
etl		etl
README.md		README.md
cloudtrail.db		cloudtrail.db
requirements.txt		requirements.txt

yonalsera/cloud-security-pipeline

Folders and files

Latest commit

History

Repository files navigation

CloudTrail ETL & Detection Pipeline

Author: Yonal Serasinghe

Overview

Structure

Features

Note these command prompts are based on Windows, adjust accordingly

How to Run

Tech Stack

Possible Improvements and Considerations

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages