SmartTaintRL

Efficient Detection and Localization of Bad Randomness Vulnerabilities in Ethereum Smart Contracts via Reinforcement Learning

Overview

SmartTaintRL is a framework that combines taint analysis with deep reinforcement learning to detect and localize Bad Randomness vulnerabilities in Ethereum smart contracts. Unlike traditional approaches that rely on pattern matching or exhaustive path exploration, our method uses a Deep Q-Network (DQN) agent to intelligently prioritize high-risk execution paths.

Key Features

High Accuracy: Achieves F1-score of 0.955 on balanced datasets and 0.950 on imbalanced ones
Efficient Path Pruning: Reduces search space by 45% while maintaining 96% recall
Precise Localization: 92.9% function-level accuracy in identifying vulnerable code
Robust to Class Imbalance: Less than 1% F1 degradation under real-world conditions (95:5 ratio)

Architecture

┌─────────────────────────────────────────────────────────────────┐
│                     SmartTaintRL Pipeline                       │
├─────────────────────────────────────────────────────────────────┤
│                                                                 │
│  ┌──────────────┐    ┌──────────────┐    ┌──────────────┐      │
│  │   Phase 1    │    │   Phase 2    │    │   Phase 3    │      │
│  │              │    │              │    │              │      │
│  │ Taint        │───▶│ RL-based    │───▶│ Vulnerability│      │
│  │ Analysis     │    │ Path        │    │ Localization │      │
│  │              │    │ Prioritization   │              │      │
│  └──────────────┘    └──────────────┘    └──────────────┘      │
│                                                                 │
│  • Extract paths     • DQN agent        • Function-level       │
│  • 100-dim features  • Pool management  • Node-level           │
│  • Source-sink pairs • Hierarchical     • Gradient-based       │
│                        rewards            attribution          │
└─────────────────────────────────────────────────────────────────┘

Bad Randomness Vulnerability

Bad Randomness is ranked as the 4th most critical smart contract vulnerability by OWASP. Blockchain's deterministic nature prevents true random number generation, and developers often misuse predictable values like:

block.timestamp
block.number
blockhash
block.difficulty

These vulnerabilities have caused significant financial losses, including the SmartBillions hack (400 ETH) and Fomo3D exploit ($3M+).

Installation

Step 1: Clone the Repository

git clone https://github.com/HadisRe/SmartTaintRL.git
cd SmartTaintRL

Step 2: Create Virtual Environment

We recommend using a virtual environment to avoid dependency conflicts.

On Linux/Mac:

python -m venv venv
source venv/bin/activate

On Windows:

python -m venv venv
venv\Scripts\activate

Step 3: Install Dependencies

pip install -r requirements.txt

This will install:

PyTorch (deep learning framework)
NumPy (numerical computing)
Gymnasium (RL environment)
Matplotlib and Seaborn (visualization)

Step 4: Verify Installation

python -c "import torch; import gymnasium; print('Installation successful!')"

Project Structure

SmartTaintRL/
├── src/
│   ├── rl_agent/
│   │   ├── dqn_agent.py           # Deep Q-Network and training loop
│   │   ├── env1_data_loader.py    # Contract data loader
│   │   ├── env2_pool_manager.py   # Pool management for path selection
│   │   ├── env3_state_builder.py  # 100-dim state vector construction
│   │   └── env4_environment.py    # Main RL environment (Gymnasium)
│   │
│   ├── preprocessing/
│   │   ├── embed1_profile.py      # Contract profile generation
│   │   ├── embed2_path_db.py      # Path database construction  
│   │   └── embed3_modifiers.py    # Modifier extraction and integration
│   │
│   └── localization/
│       ├── localizer.py           # Function and node-level localization
│       └── test_localization.py   # Evaluation on ground truth contracts
│
├── data/
│   └── ground_truth/
│       └── final_dataset_14_contracts.pkl  # Ground truth for localization
│
├── requirements.txt
├── LICENSE
└── README.md

# Pre-processed datasets available on Google Drive (see Data section)

Data

Option 1: Use Pre-processed Data (Recommended)

The pre-processed datasets are available on Google Drive due to file size limitations:

Download Dataset from Google Drive

The dataset contains:

contract_profiles_Balanced/: Contract profiles for balanced dataset (200 vulnerable, 200 safe)
contract_profiles_Imbalanced/: Contract profiles for imbalanced dataset (223 vulnerable, 4,083 safe)
path_databases_updated_Balanced/: Path-level features for balanced dataset
path_databases_updated_Imbalanced/: Path-level features for imbalanced dataset

Each path database contains 100-dimensional feature vectors extracted from taint analysis results.

Option 2: Generate Data from Scratch

If you want to process new contracts or regenerate the features, you need to:

Step 1: Run TaintSentinel on your Solidity contracts to generate:

AST files (*_ast.json)
Semantic graphs (*_semantic_graph.json)
Taint analysis results (*_taint_analysis_filtered.json)

Step 2: Run the preprocessing scripts in order:

# Generate contract profiles from TaintSentinel outputs
python src/preprocessing/embed1_profile.py

# Build path databases with 100-dimensional features
python src/preprocessing/embed2_path_db.py

# Extract and integrate modifier information
python src/preprocessing/embed3_modifiers.py

These scripts read the TaintSentinel outputs and create the feature files needed for the RL agent.

Methodology

Phase 1: Taint Analysis (Preprocessing)

This phase uses TaintSentinel for:

Transforming Solidity code into semantic graphs (CFG + DFG)
Identifying taint sources (weak entropy) and sinks (sensitive operations)
Extracting all possible taint paths

The preprocessing scripts (embed*.py) then process these outputs to create 100-dimensional feature vectors for each path, including:

Structural features (path length, node diversity, branch complexity)
Security features (require density, modifier protection, mitigation score)
Semantic features (keccak operations, arithmetic intensity)
Source-sink interaction features

Phase 2: RL-based Path Prioritization

The DQN agent learns to make ANALYZE or SKIP decisions for each path:

State: 20 × 100 matrix (20 paths in pool, 100 features each)
Action: ANALYZE or SKIP for selected path
Reward: Hierarchical reward based on path importance and risk level

Key components:

Dynamic pool management with priority-based refilling
Pattern registry for tracking discovered vulnerability signatures
Hierarchical reward engineering with safety constraints

Phase 3: Vulnerability Localization

After detection, the system localizes vulnerabilities using:

Gradient-based attribution from Q-network
Graph propagation with centrality analysis
Function-level and node-level ranking

The localization module (src/localization/) implements this phase, providing ranked lists of suspicious functions and nodes within each contract.

Comparison with Existing Tools

Method	F1 (Balanced)	F1 (Imbalanced)	Time
Slither	0.450	-	22min
Mythril	0.372	-	>24h
TaintSentinel	0.892	0.611	8.3h
RNVulDet	0.662	0.360	5.5h
SmartTaintRL	0.955	0.950	3h

Dataset

We used a filtered subset of contracts for RL training:

Balanced Dataset: 400 contracts

200 vulnerable contracts
200 safe contracts

Imbalanced Dataset: 4,306 contracts

223 vulnerable contracts
4,083 safe contracts

The imbalanced dataset reflects real-world conditions where vulnerable contracts are significantly fewer than safe ones (approximately 1:18 ratio).

Usage

Training the DQN Agent

To train the model from scratch, run the DQN agent with the path databases:

python src/rl_agent/dqn_agent.py

For optimal results, train the model for 2500 to 3500 episodes. Our experiments show that models trained within this range achieve the best balance between detection accuracy and generalization.

Running Localization

The localization module requires a trained model. After training, you can run vulnerability localization:

python src/localization/test_localization.py

This will evaluate function-level and node-level localization on the 14 ground truth contracts. The localizer uses gradient-based attribution from the trained Q-network to identify vulnerable functions and nodes.

Expected results (as reported in the paper):

Strict Accuracy: 64.3%
Relaxed Accuracy (with caller detection): 92.9%
Node-level P@5: 0.65, R@5: 0.77, F1@5: 0.70

Related Work

This project builds upon:

TaintSentinel - Taint analysis framework used for preprocessing

License

This project is licensed under the MIT License - see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SmartTaintRL

Overview

Key Features

Architecture

Bad Randomness Vulnerability

Installation

Step 1: Clone the Repository

Step 2: Create Virtual Environment

Step 3: Install Dependencies

Step 4: Verify Installation

Project Structure

Data

Option 1: Use Pre-processed Data (Recommended)

Option 2: Generate Data from Scratch

Methodology

Phase 1: Taint Analysis (Preprocessing)

Phase 2: RL-based Path Prioritization

Phase 3: Vulnerability Localization

Comparison with Existing Tools

Dataset

Usage

Training the DQN Agent

Running Localization

Related Work

License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

License

HadisRe/SmartTaintRL

Folders and files

Latest commit

History

Repository files navigation

SmartTaintRL

Overview

Key Features

Architecture

Bad Randomness Vulnerability

Installation

Step 1: Clone the Repository

Step 2: Create Virtual Environment

Step 3: Install Dependencies

Step 4: Verify Installation

Project Structure

Data

Option 1: Use Pre-processed Data (Recommended)

Option 2: Generate Data from Scratch

Methodology

Phase 1: Taint Analysis (Preprocessing)

Phase 2: RL-based Path Prioritization

Phase 3: Vulnerability Localization

Comparison with Existing Tools

Dataset

Usage

Training the DQN Agent

Running Localization

Related Work

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages