🕵️ Inspectra – Semantic Plagiarism & Paraphrase Detector

Inspectra is a powerful web-based tool designed to detect plagiarism, paraphrased content, and semantic similarity between documents. It uses advanced NLP models like Sentence-BERT, BLEU, ROUGE, and keyword-based web scraping to identify both direct and paraphrased overlaps — going beyond traditional copy-paste detection.

🚀 Features

🔍 Semantic Similarity using Sentence-BERT (paraphrase-MiniLM-L6-v2)
🧠 BLEU & ROUGE metrics for sentence-level evaluation
📊 Plagiarism Percentage Estimation based on similarity matrices
🌐 Web Scraping Detection – checks if the content exists online using:
- KeyBERT for keyword extraction
- Google + Wikipedia search scraping
- Cosine similarity with scraped web content
✨ Highlighting of matching/paraphrased sentences across documents
📎 Supports .txt and .pdf files

🧱 Tech Stack

Layer	Tool/Library
Backend	Python, Sentence Transformers, NLTK, ROUGE
Web Scraping	BeautifulSoup, Requests, KeyBERT
Similarity Engine	Cosine Similarity, ROUGE-L
Frontend	Streamlit
Preprocessing	Numpy, Regex, TF-IDF

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
app		app
output		output
tests		tests
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
requirements.txt		requirements.txt
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🕵️ Inspectra – Semantic Plagiarism & Paraphrase Detector

🚀 Features

🧱 Tech Stack

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

22pt16/PlagiarismChecker

Folders and files

Latest commit

History

Repository files navigation

🕵️ Inspectra – Semantic Plagiarism & Paraphrase Detector

🚀 Features

🧱 Tech Stack

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages