Skip to content

Project repository for DA 204o Data Science in Practice (Aug semester 2024) @ IISc BLR

Notifications You must be signed in to change notification settings

Shambo-Samanta/DA-204o

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Phishing Analysis

Project repository for DA 204o Data Science in Practice (Aug semester 2024) @ IISc BLR

Project Purpose

Enable a phishing URL detection system.

Dataset

Source: PhiUSIIL Phishing URL (Website)

Summary: PhiUSIIL Phishing URL Dataset is a substantial dataset comprising 134,850 legitimate and 100,945 phishing URLs. Most of the URLs we analyzed, while constructing the dataset, are the latest URLs. Features are extracted from the source code of the webpage and URL. Features such as CharContinuationRate, URLTitleMatchScore, URLCharProb, and TLDLegitimateProb are derived from existing features.

Additional Info:

  1. Column "FILENAME" can be ignored.
  2. Label 1 corresponds to a legitimate URL, label 0 to a phishing URL

Contributors

  1. Deepansh Sood
  2. Shambo Samanta
  3. Sudipta Ghosh
  4. Sourajit Bhar

About

Project repository for DA 204o Data Science in Practice (Aug semester 2024) @ IISc BLR

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%