Log Classification with Hybrid Classification Framework

An advanced log classification system that combines three complementary approaches to handle varying levels of complexity in log patterns. This project provides a flexible and effective solution for processing predictable, complex, and poorly-labeled data patterns with real-time capabilities and analytics insights.

Features

Core Classification Methods

Regular Expression (Regex): Handles simplified and predictable patterns using predefined rules
Sentence Transformer + Logistic Regression: Manages complex patterns with sufficient training data using embeddings
LLM (Large Language Models): Handles complex patterns when labeled training data is insufficient using Groq API 4.Real-time Log Streaming: WebSocket support for live log classification as logs arrive 5.Analytics Dashboard: Comprehensive statistics, trends, and insights about classified logs
Confidence Scores: Get classification confidence levels for better decision-making

Setup Instructions

Prerequisites

Python 3.8 or higher
pip package manager

Installation

Clone the repository:

git clone https://github.com/labdhiongithub7/log_classification.git
cd log_classification

Create a virtual environment (recommended):

python -m venv venv

# On Windows
venv\Scripts\activate

# On Linux/Mac
source venv/bin/activate

Install dependencies:
```
pip install -r requirements.txt
```
Set up environment variables: Create a .env file in the root directory:
```
GROQ_API_KEY=your_groq_api_key_here
```
Get your API key from Groq Console
Download models (if needed): The BERT model will be downloaded automatically on first use. Ensure the models/ directory contains log_classifier.joblib.

Running the Server

Start the FastAPI server:

uvicorn server:app --reload

The server will be available at:

Main endpoint: http://127.0.0.1:8000
Interactive API docs: http://127.0.0.1:8000/docs
Alternative docs: http://127.0.0.1:8000/redoc

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.ipynb_checkpoints		.ipynb_checkpoints
models		models
resources		resources
training		training
.gitignore		.gitignore
README.md		README.md
classify.py		classify.py
log_classification.ipynb		log_classification.ipynb
processor_bert.py		processor_bert.py
processor_llm.py		processor_llm.py
processor_regex.py		processor_regex.py
requirements.txt		requirements.txt
server.py		server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Log Classification with Hybrid Classification Framework

Features

Core Classification Methods

Setup Instructions

Prerequisites

Installation

Running the Server

About

Uh oh!

Releases

Packages

Languages

labdhiongithub7/log_classification

Folders and files

Latest commit

History

Repository files navigation

Log Classification with Hybrid Classification Framework

Features

Core Classification Methods

Setup Instructions

Prerequisites

Installation

Running the Server

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages