Skip to content

creativeskyai/seed-sweep

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SeedSweep

Python License Version Tests Platform

Scan filesystems for cryptocurrency wallets, keys, seed phrases, and addresses.

Overview

SeedSweep is a security auditing tool that searches filesystem drives for cryptocurrency-related files and sensitive cryptographic material. It examines file names and contents to detect BIP39 seed phrases, Bitcoin addresses, Ethereum addresses, and files containing crypto-related keywords.

Features

  • Scans .txt, .docx, .pdf files for crypto-related content
  • OCR text extraction from image files (.png, .jpg, .jpeg)
  • BIP39 seed phrase detection (12, 15, 18, 21, and 24 word phrases)
  • Bitcoin address detection (P2PKH, P2SH, Bech32 formats)
  • Ethereum address detection
  • Filename keyword matching against 40+ crypto-related terms
  • Configurable directory traversal depth
  • System directory exclusion (Windows and macOS)
  • Concurrent file processing with progress bar
  • Excel spreadsheet and plain text report generation

Requirements

  • Python 3.9 or higher
  • Tesseract OCR (optional, for image text extraction)

Installation

From source

git clone <repository-url>
cd seedsweep
pip install -e .

Install development dependencies

pip install -e ".[dev]"

Install Tesseract OCR

Tesseract is required for OCR scanning of image files. If you don't need image scanning, use the --no-ocr flag to skip it.

Usage

Command-line interface

# Scan all drives
seedsweep --drives ALL

# Scan specific drives (Windows)
seedsweep --drives C D

# Scan root filesystem (Linux/macOS)
seedsweep --drives /

# Custom output directory and depth
seedsweep --drives ALL --output-dir ./results --max-depth 10

# Disable OCR processing
seedsweep --drives ALL --no-ocr

# Verbose logging
seedsweep --drives ALL -v

Interactive mode

Run without the --drives flag for interactive drive selection:

seedsweep

You will be prompted to select which drives to scan.

Options

Flag Description Default
--drives DRIVE [DRIVE ...] Drives or mount points to scan, or ALL Interactive prompt
--output-dir PATH Output directory for result files ~/Desktop/SeedSweep_Results/
--max-depth N Maximum directory traversal depth 5
--no-ocr Skip OCR on image files OCR enabled
-v, --verbose Enable debug-level logging Off
--version Show version and exit

Output

Results are written to the output directory (default ~/Desktop/SeedSweep_Results/):

File Description
seedsweep_results.txt Plain text summary of all findings
seedsweep_results.xlsx Excel spreadsheet with filterable columns
seedsweep.log Full scan log with timestamps

The Excel spreadsheet includes the following columns: Drive, Top Folder, Bottom Folder, Keyword Match, Seed Phrases/Addresses Found, File Extension, File Name, File Path.

Configuration

The following constants in seedsweep/scanner.py can be edited to customize scan behavior:

  • ALLOWED_EXTENSIONS - File extensions to include in scanning
  • SCAN_KEYWORDS - Keywords that trigger a match when found in filenames or content
  • EXCLUDED_DIRS - Absolute paths to skip during traversal
  • EXCLUDED_DIR_NAMES - Directory names to skip (exact match, case-insensitive)
  • EXCLUDED_DIR_PATTERNS - Regex patterns for directory names to skip

Running Tests

pip install -e ".[dev]"
pytest

License

This project is licensed under the MIT License. See the LICENSE file for details.

About

Scan filesystems for cryptocurrency wallets, keys, seed phrases, and addresses.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages