LearnLoom: AI-Powered Learning Insights Dashboard

1. Executive Summary

LearnLoom is a full-stack EdTech web application engineered to provide educators with actionable, data-driven insights into student learning patterns. By processing and visualizing academic datasets, the platform transforms raw data into a strategic tool for improving educational outcomes.

The system's architecture is built on a decoupled frontend and backend, communicating via a RESTful API. The backend, powered by Python and Flask, is responsible for data ingestion, cleaning, and serving aggregated analytics. The frontend, a modern React/Vite single-page application, provides a responsive and intuitive interface for data visualization and interaction. A key feature is the integration with Google's Gemini API to deliver qualitative, AI-generated summaries of quantitative data, bridging the gap between numbers and narrative.

2. Core Features

Centralized Analytics Dashboard: A high-performance, responsive UI presenting a holistic view of student data through interactive charts and KPI cards.
Key Performance Indicators (KPIs): At-a-glance metrics including overall course completion rates, average student scores, and total enrollment figures.
Trend Analysis: Time-series visualizations of student performance and engagement, enabling educators to identify learning patterns and potential intervention points.
AI-Generated Narrative Insights: Leverages the Google Gemini API to synthesize complex data into concise, human-readable summaries and actionable recommendations.
Automated Data Pipeline: A backend process for fetching, cleaning, and preparing datasets from external sources like Kaggle, ensuring data integrity.
RESTful API Architecture: A clearly defined API contract ensures a stable and scalable interface between the client and server.

3. System Architecture

The application employs a classic client-server model with a decoupled architecture, promoting separation of concerns and independent scalability.

3.1. Frontend (Client)

The frontend is a Single-Page Application (SPA) built with React. It is responsible for all presentation logic. It does not contain any business logic; instead, it queries the backend API for data and renders it. State management is handled within React components, and API interactions are centralized for maintainability.

3.2. Backend (Server)

The backend is a stateless RESTful API built with Flask. Its primary responsibilities are:

Exposing Data: It provides structured JSON endpoints for the frontend to consume.
Business Logic: It performs all data calculations, aggregations, and analysis.
Data Persistence: It manages the lifecycle of the data, from raw file to cleaned, in-memory DataFrame.
Third-Party Integration: It securely communicates with the Google Gemini API, abstracting this complexity away from the client.

3.3. Data Pipeline

The application relies on a simple, script-driven ETL (Extract, Transform, Load) process:

Extract: The refresh-data endpoint triggers a Python script that uses the Kaggle API to download a raw CSV dataset into the backend/data/raw/ directory.
Transform: The raw data is then processed by a cleaning service (services/data_cleaning.py). This step standardizes column names, handles missing values, removes duplicates, and ensures data types are correct. The cleaned data is saved as a new CSV in backend/data/cleaned/.
Load: When the Flask server starts, it loads the cleaned CSV into a pandas DataFrame, which is then held in memory to serve API requests quickly. This in-memory approach is suitable for datasets of this size and provides low-latency query responses.

4. Technology Stack & Rationale

The technology choices were made to prioritize development speed, scalability, and maintainability.

4.1. Frontend Tech

Framework: React (v18.2.0+)
- Why? Its component-based architecture is ideal for building a modular and maintainable UI. The vast ecosystem and community support accelerate development.
Build Tool: Vite (v4.4.5+)
- Why? Vite offers a significantly faster development experience compared to traditional bundlers, with near-instant Hot Module Replacement (HMR) and optimized build outputs.
Language: TypeScript
- Why? It provides static typing, which reduces runtime errors, improves code quality, and makes the codebase easier to refactor and scale.
Styling: Tailwind CSS (v3)
- Why? A utility-first CSS framework that enables rapid UI development without leaving the HTML, promoting consistency and reducing CSS file size.
Charting: Recharts
- Why? A composable and declarative charting library for React that simplifies the creation of complex, interactive visualizations.

4.2. Backend Tech

Framework: Flask (v2.x)
- Why? As a lightweight and unopinionated micro-framework, Flask is perfect for building RESTful APIs. It provides the essentials without imposing a rigid structure, allowing for flexibility.
Language: Python (v3.9+)
- Why? The de facto language for data science and machine learning. Its powerful data manipulation libraries (pandas) are central to the backend's functionality.
Core Libraries:
- pandas: The cornerstone of our data processing engine, used for efficient data cleaning, transformation, and analysis.
- Flask-Cors: Middleware to handle Cross-Origin Resource Sharing, essential for allowing the frontend (on a different port) to communicate with the backend.
- kaggle: The official Python client for interacting with the Kaggle API to automate dataset downloads.

5. Project Structure

The monorepo is organized with a clear boundary between the frontend and backend, minimizing coupling.

.
├── backend/                      # Python Flask application
│   ├── api/                      # Flask Blueprints defining API endpoints
│   ├── config/                   # Configuration settings
│   ├── data/                     # Data storage (raw and cleaned CSVs)
│   │   ├── cleaned/              # Cleaned datasets
│   │   └── raw/                  # Raw downloaded datasets
│   ├── database/                 # Database connection and query logic (placeholder)
│   ├── ml/                       # Machine Learning models and preprocessing (placeholder)
│   ├── services/                 # Core business logic and data processing services
│   ├── utils/                    # Utility functions (e.g., Kaggle download)
│   ├── app.py                    # Main Flask application entry point
│   └── requirements.txt          # Python dependency list
│
├── frontend/                     # React Vite application
│   ├── components/               # Reusable React components (e.g., charts, cards)
│   ├── public/                   # Static assets (e.g., index.html)
│   ├── src/                      # Frontend source code
│   │   ├── api/                  # Centralized API client for backend communication
│   │   └── ...                   # Other frontend modules (e.g., pages, utilities)
│   ├── App.tsx                   # Main React application component
│   ├── index.html                # Main HTML entry point
│   ├── package.json              # Frontend dependencies and scripts
│   ├── tsconfig.json             # TypeScript configuration
│   ├── vite.config.ts            # Vite build configuration
│   └── .env                      # Environment variables (e.g., API keys)
│
└── README.md                     # Project documentation (this file)

5. Getting Started

Follow these instructions to set up and run the project on your local machine.

5.1. Prerequisites

Ensure you have the following installed:

Git: Download & Install Git
Python 3.9+: Download & Install Python
Node.js LTS (v18+ or v20+): Download & Install Node.js (includes npm)
Visual Studio Code (Recommended IDE): Download & Install VS Code

5.2. Repository Setup

Clone the repository:

git clone <repository-url>
cd LearnLoom # Or whatever your project root directory is named

Create data directories for the backend:
```
mkdir -p backend/data/raw backend/data/cleaned
```
(Note: mkdir -p works on macOS/Linux. On Windows, you might need mkdir backend\data\raw and mkdir backend\data\cleaned separately.)

5.3. Backend Setup

Navigate to the backend directory:
```
cd backend
```
Create and activate a Python virtual environment:
- Windows (PowerShell):
```
python -m venv venv
.\venv\Scripts\activate
```
- macOS/Linux (Bash/Zsh):
```
python3 -m venv venv
source venv/bin/activate
```
(Your terminal prompt should now show (venv) indicating the environment is active.)
Install Python dependencies:
```
pip install -r requirements.txt
```
Kaggle API Credentials: To enable data downloads from Kaggle, you need to set up your API token:
- Go to your Kaggle account page and click "Create New API Token" to download kaggle.json.
- Place this kaggle.json file in the appropriate directory:
  - Windows: C:\Users\YOUR_USERNAME\.kaggle\
  - macOS/Linux: ~/.kaggle/ (ensure permissions are chmod 600 ~/.kaggle/kaggle.json)

5.4. Frontend Setup

Navigate to the frontend directory:

cd ../frontend # From the backend directory, or directly `cd frontend` from project root

Set up environment variables:
- Create a new file named .env in the frontend directory.
- Add your Google Gemini API key to it. You can obtain a key from Google AI Studio.
- The content of your .env file should be:

VITE_GEMINI_API_KEY="YOUR_API_KEY_HERE"//


3.  **Install Node.js dependencies:**
    ```bash
    npm install
    ```

### 5.5. Data Initialization

Before running the application, you must initialize the data by triggering the backend's data refresh process.

1.  **Start the Backend Server:**
    *   Ensure you are in the `backend` directory with your virtual environment activated.
    *   Run: `python app.py`

2.  **Trigger Data Refresh:**
    *   Open a **new terminal** (do not close the backend server terminal).
    *   Send a `POST` request to the refresh endpoint. This will download the Kaggle dataset, clean it, and save it to `backend/data/cleaned/cleaned_students.csv`.
    ```bash
    curl -X POST http://127.0.0.1:5000/api/refresh-data
    ```
    *(This command may take a moment to complete.)*

3.  **Restart Backend (Important):**
    *   Go back to your backend server terminal.
    *   Press `Ctrl+C` to stop the server.
    *   Restart it: `python app.py`
    *(This ensures the backend loads the newly created `cleaned_students.csv` file.)*

### 5.6. Running the Application

With both backend and data initialized:

1.  **Start the Backend Server:** (If not already running from step 5.5)
    *   In the `backend` directory (with venv active): `python app.py`

2.  **Start the Frontend Development Server:**
    *   In the `frontend` directory: `npm run dev`

3.  **Access the Application:**
    *   Open your web browser and navigate to `http://localhost:5173` (or the port indicated by `npm run dev`).

## 6. API Endpoints

The backend exposes a comprehensive set of RESTful API endpoints. For detailed request/response schemas, refer to `docs/api_contract.md`.

| Method | Endpoint                               | Description                                                              | 
| :----- | :------------------------------------- | :----------------------------------------------------------------------- |
| `GET`  | `/api/health`                          | Checks if the backend server is operational.                             |
| `GET`  | `/api/dashboard-data`                  | Retrieves all consolidated data required for the main dashboard view.    |
| `GET`  | `/api/average-score`                   | Returns the overall average student score.                               |
| `GET`  | `/api/completion-rate`                 | Returns the percentage of students who completed their courses.          |
| `GET`  | `/api/dropout-rate`                    | Returns the percentage of students who dropped out.                      |
| `GET`  | `/api/total-students`                  | Returns the total number of students in the dataset.                     |
| `GET`  | `/api/active-students`                 | Returns the number of currently active students.                         |
| `GET`  | `/api/score-trend`                     | Provides data for visualizing score trends over time.                    |
| `POST` | `/api/ai-summary`                      | Generates an AI-powered summary of learning insights.                    |
| `POST` | `/api/predict`                         | Predicts student completion likelihood based on input data.              |
| `POST` | `/api/refresh-data`                    | Triggers the data download, cleaning, and loading process.               |
| `GET`  | `/api/student/{student_id}/profile`    | Retrieves a detailed profile for a specific student.                     |
| `GET`  | `/api/system-status`                   | Provides the current status of backend system components.                |
| `GET`  | `/api/course-analytics`                | Returns analytics for all courses.                                       |
| `GET`  | `/api/top-courses`                     | Identifies and returns top-performing courses.                           |
| `GET`  | `/api/hardest-courses`                 | Identifies and returns courses with the lowest average scores.           |

## 7. Development Guidelines

-   **Code Style:** Adhere to existing code styles (e.g., ESLint for JS/TS, Black/Flake8 for Python).
-   **Testing:** Implement unit and integration tests for new features.
-   **Documentation:** Keep API contracts and inline comments up-to-date.
-   **Environment Variables:** Manage sensitive information using `.env` files.

## 8. Contributing

Contributions are welcome! Please refer to `CONTRIBUTING.md` (if available) for guidelines on how to submit pull requests, report bugs, and suggest features.

## 9. License

This project is licensed under the MIT License. See the `LICENSE` file for details.

## 10. Contact

For questions or support, please open an issue on the GitHub repository or contact [Your Name/Team Email].

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LearnLoom: AI-Powered Learning Insights Dashboard

Table of Contents

1. Executive Summary

2. Core Features

3. System Architecture

3.1. Frontend (Client)

3.2. Backend (Server)

3.3. Data Pipeline

4. Technology Stack & Rationale

4.1. Frontend Tech

4.2. Backend Tech

5. Project Structure

5. Getting Started

5.1. Prerequisites

5.2. Repository Setup

5.3. Backend Setup

5.4. Frontend Setup

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
backend		backend
data		data
docs		docs
frontend		frontend
.gitignore		.gitignore
LearnLoom_How_It_Works.txt		LearnLoom_How_It_Works.txt
README.md		README.md
kaggle.json		kaggle.json
package-lock.json		package-lock.json

Atul-Chahar/LearnLoom

Folders and files

Latest commit

History

Repository files navigation

LearnLoom: AI-Powered Learning Insights Dashboard

Table of Contents

1. Executive Summary

2. Core Features

3. System Architecture

3.1. Frontend (Client)

3.2. Backend (Server)

3.3. Data Pipeline

4. Technology Stack & Rationale

4.1. Frontend Tech

4.2. Backend Tech

5. Project Structure

5. Getting Started

5.1. Prerequisites

5.2. Repository Setup

5.3. Backend Setup

5.4. Frontend Setup

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages