Scan filesystems for cryptocurrency wallets, keys, seed phrases, and addresses.
SeedSweep is a security auditing tool that searches filesystem drives for cryptocurrency-related files and sensitive cryptographic material. It examines file names and contents to detect BIP39 seed phrases, Bitcoin addresses, Ethereum addresses, and files containing crypto-related keywords.
- Scans
.txt,.docx,.pdffiles for crypto-related content - OCR text extraction from image files (
.png,.jpg,.jpeg) - BIP39 seed phrase detection (12, 15, 18, 21, and 24 word phrases)
- Bitcoin address detection (P2PKH, P2SH, Bech32 formats)
- Ethereum address detection
- Filename keyword matching against 40+ crypto-related terms
- Configurable directory traversal depth
- System directory exclusion (Windows and macOS)
- Concurrent file processing with progress bar
- Excel spreadsheet and plain text report generation
- Python 3.9 or higher
- Tesseract OCR (optional, for image text extraction)
git clone <repository-url>
cd seedsweep
pip install -e .pip install -e ".[dev]"Tesseract is required for OCR scanning of image files. If you don't need
image scanning, use the --no-ocr flag to skip it.
- Windows: Download from https://github.com/tesseract-ocr/tesseract/releases and add the install directory to your system PATH.
- macOS:
brew install tesseract - Linux (Debian/Ubuntu):
sudo apt install tesseract-ocr
# Scan all drives
seedsweep --drives ALL
# Scan specific drives (Windows)
seedsweep --drives C D
# Scan root filesystem (Linux/macOS)
seedsweep --drives /
# Custom output directory and depth
seedsweep --drives ALL --output-dir ./results --max-depth 10
# Disable OCR processing
seedsweep --drives ALL --no-ocr
# Verbose logging
seedsweep --drives ALL -vRun without the --drives flag for interactive drive selection:
seedsweepYou will be prompted to select which drives to scan.
| Flag | Description | Default |
|---|---|---|
--drives DRIVE [DRIVE ...] |
Drives or mount points to scan, or ALL |
Interactive prompt |
--output-dir PATH |
Output directory for result files | ~/Desktop/SeedSweep_Results/ |
--max-depth N |
Maximum directory traversal depth | 5 |
--no-ocr |
Skip OCR on image files | OCR enabled |
-v, --verbose |
Enable debug-level logging | Off |
--version |
Show version and exit |
Results are written to the output directory (default ~/Desktop/SeedSweep_Results/):
| File | Description |
|---|---|
seedsweep_results.txt |
Plain text summary of all findings |
seedsweep_results.xlsx |
Excel spreadsheet with filterable columns |
seedsweep.log |
Full scan log with timestamps |
The Excel spreadsheet includes the following columns: Drive, Top Folder, Bottom Folder, Keyword Match, Seed Phrases/Addresses Found, File Extension, File Name, File Path.
The following constants in seedsweep/scanner.py can be edited to
customize scan behavior:
ALLOWED_EXTENSIONS- File extensions to include in scanningSCAN_KEYWORDS- Keywords that trigger a match when found in filenames or contentEXCLUDED_DIRS- Absolute paths to skip during traversalEXCLUDED_DIR_NAMES- Directory names to skip (exact match, case-insensitive)EXCLUDED_DIR_PATTERNS- Regex patterns for directory names to skip
pip install -e ".[dev]"
pytestThis project is licensed under the MIT License. See the LICENSE file for details.