docs: Add Data Source Inventory and Ingestion Architecture Analysis (#98) #102

23shivay · 2025-05-08T14:36:15Z

Summary

This PR fulfills Task #98: Create Inventory of Data Sources and Define Standard Data Contracts, a sub-task under the broader initiative to design a scalable and decoupled ingestion framework for real-time and batch data.

Key Outcomes

Created a complete inventory of current data sources
Classified each as real-time or batch
Documented data types, formats, and ingestion frequency
Outlined architecture fit and ingestion approach per source
Included initial design of real-time and batch ingestion pipelines
Identified sources requiring transformation (e.g., XML → JSON)

Artifacts Included

source-summary-table.png: Source-wise frequency, format, and structure
architecture-fit-table.png: Architecture design fit for each source
architecture-diagram.jpg: Conceptual ingestion architecture for real-time and batch sources

Images:

Notes

Data formats and transformation points are defined with future schema standardization (e.g., JSON Schema, Avro) in mind.
The current scope focuses on documentation and analysis; implementation will follow in subsequent tasks.

cc @mvadodariya

23shivay added 3 commits May 8, 2025 01:08

docs: add updated architecture diagram with explanation

7f31e2e

Add data source overview docs and images

cfbdc82

docs: add ingestion sources summary, architecture fit table, and diagram

79a652f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs: Add Data Source Inventory and Ingestion Architecture Analysis (#98) #102

docs: Add Data Source Inventory and Ingestion Architecture Analysis (#98) #102

Uh oh!

23shivay commented May 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

docs: Add Data Source Inventory and Ingestion Architecture Analysis (#98) #102

Are you sure you want to change the base?

docs: Add Data Source Inventory and Ingestion Architecture Analysis (#98) #102

Uh oh!

Conversation

23shivay commented May 8, 2025

Summary

Key Outcomes

Artifacts Included

Images:

Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant