GitHub - Junghwan-Oh/autoresearch-trading: Nunchi's autoresearch trading

Autonomous Trading Strategy Research

Karpathy-style autoresearch for Hyperliquid perpetual futures — 103 experiments, zero human intervention

Agent CLI • Docs • Research • Discord • X

An AI agent autonomously modifies a single file (strategy.py), backtests each change against historical Hyperliquid perp data, and keeps only improvements. Adapts Karpathy's autoresearch pattern for trading strategy discovery. Starting from a simple momentum baseline (Sharpe 2.7), the system discovered a 6-signal ensemble strategy achieving Sharpe 21.4 with 0.3% max drawdown — a 7.9x improvement over 103 fully autonomous experiments.

Quick Start

Prerequisites

Python 3.10+
uv — fast Python package manager

# Install uv if you don't have it
curl -LsSf https://astral.sh/uv/install.sh | sh

Setup

git clone https://github.com/Nunchi-trade/auto-researchtrading.git
cd auto-researchtrading
uv run prepare.py                # Download data (~1 min, cached to ~/.cache/autotrader/data/)

No API keys required. Data is fetched from public CryptoCompare and Hyperliquid APIs.

Run a Backtest

uv run backtest.py               # Run current strategy against validation data

score:              20.634000
sharpe:             20.634000
total_return_pct:   130.000000
max_drawdown_pct:   0.300000
num_trades:         7605

Run All Benchmarks

uv run run_benchmarks.py         # Compare 5 reference strategies

Running Your Own Experiments

Rules

Rule	Detail
Only edit `strategy.py`	This is the single mutable file
Do not modify	`prepare.py`, `backtest.py`, or anything in `benchmarks/`
No new dependencies	Only `numpy`, `pandas`, `scipy`, `requests`, `pyarrow`, and stdlib
Time budget	120 seconds per backtest

Manual Experiment Loop

git checkout -b autotrader/myexp          # 1. Create experiment branch

# 2. Edit strategy.py with your idea (parameters, signals, entry/exit logic)

uv run backtest.py                        # 3. Run the backtest

# 4. If score improved → keep
git add strategy.py && git commit -m "exp1: description of change"

# 5. If score got worse → revert
git reset --hard HEAD~1

Repeat. Each commit is one atomic experiment. The git history becomes your experiment log.

Autonomous Loop (with Claude Code)

The intended workflow uses Claude Code with the /autoresearch skill to run experiments without human intervention:

claude                           # Start Claude Code from repo root
/autoresearch                    # Launch the autonomous loop

The agent will:

Read the current strategy and scores
Propose and implement a modification to strategy.py
Run uv run backtest.py and parse the score
Keep the change if score improved, revert if not
Repeat indefinitely until interrupted

See program.md for detailed instructions on guiding the autonomous loop.

Strategy Interface

Your strategy must implement a Strategy class with a single on_bar() method — no shared state, no hidden coupling.

class Strategy:
    def __init__(self):
        # Initialize any tracking state
        pass

    def on_bar(self, bar_data: dict, portfolio: PortfolioState) -> list[Signal]:
        """
        Called once per hourly bar across all symbols.

        Args:
            bar_data: dict of symbol → BarData
                - BarData.close, .open, .high, .low, .volume, .funding_rate
                - BarData.history: DataFrame of last 500 bars
            portfolio: PortfolioState
                - portfolio.cash: available cash
                - portfolio.positions: dict of symbol → signed USD notional

        Returns:
            List of Signal(symbol, target_position, order_type="market")
            target_position is signed USD notional (+long, -short, 0=close)
        """
        return []

Data Available

Field	Description
`bar_data[symbol].history`	DataFrame of last 500 hourly bars
Columns	`timestamp`, `open`, `high`, `low`, `close`, `volume`, `funding_rate`
Symbols	BTC, ETH, SOL
Validation period	2024-07-01 to 2025-03-31
Initial capital	$100,000
Fees	2 bps maker, 5 bps taker, 1 bps slippage

Scoring Formula

score = sharpe × √(min(trades/50, 1.0)) − drawdown_penalty − turnover_penalty

Component	Formula
Sharpe	`mean(daily_returns) / std(daily_returns) × √365`
Drawdown penalty	`max(0, max_drawdown_pct − 15) × 0.05`
Turnover penalty	`max(0, annual_turnover/capital − 500) × 0.001`
Hard cutoffs (→ −999)	Fewer than 10 trades, drawdown > 50%, lost > 50% of capital

Benchmarks

5 reference strategies to beat. The baseline to clear is 2.724.

Rank	Strategy	Score	Sharpe	Return	Max DD	Trades
1	`simple_momentum`	2.724	2.724	+42.6%	7.6%	9081
2	`funding_arb`	-0.191	-0.191	-1.3%	9.4%	1403
3	`regime_mm`	-0.322	-0.322	-3.1%	11.2%	12854
4	`mean_reversion`	-3.964	-3.380	-26.2%	26.7%	3185
5	`momentum_breakout`	-999	—	—	—	0

Results

Score Progression (103 Autonomous Experiments)

Experiment	Score	Sharpe	Max DD	Trades	Key Change
Baseline	2.724	2.724	7.6%	9081	Simple momentum starting point
exp15	8.393	8.823	3.1%	2562	5-signal ensemble, 4/5 votes, cooldown
exp28	9.382	9.944	3.0%	2545	ATR 5.5 trailing stop
exp37	10.305	11.125	2.3%	3212	BB width compression (6th signal)
exp42	11.302	11.886	1.4%	3024	Remove funding boost
exp46	13.480	14.015	1.4%	3157	Remove strength scaling
exp56	14.592	14.666	0.7%	4205	Cooldown 3
exp66	15.718	15.849	0.7%	4467	Simplified momentum
exp72	19.697	20.099	0.7%	6283	RSI period 8
exp86	19.859	20.498	0.6%	7534	Cooldown 2
exp102	20.634	20.634	0.3%	7605	RSI 50/50, BB 85, position 0.08

Final score: 20.634 — 7.6x improvement over baseline, fully autonomous.

Key Discoveries

Rank	Discovery	Impact	Insight
1	RSI period 8	+5.0 Sharpe	Standard 14-period RSI is too slow for hourly crypto
2	Remove strength scaling	+1.7 Sharpe	Uniform sizing beats momentum-weighted sizing
3	Simplified momentum	+0.8 Sharpe	Just `ret > threshold`, no multi-timeframe confirmation needed
4	BB width compression	+0.9 Sharpe	Bollinger Band width percentile as 6th ensemble signal
5	ATR 5.5 trailing stop	+1.0 Sharpe	Hold winners much longer than conventional 3.5x ATR
6	The Great Simplification	+2.0 Sharpe	Removing pyramiding, funding boost, BTC filter, correlation filter
7	Position size 0.08	+0.6 Sharpe	Smaller positions eliminate turnover penalty

Biggest Lesson: Simplicity Wins

The strongest gains came from removing complexity, not adding it. Every "smart" feature — BTC lead-lag filter, correlation-based weight adjustment, momentum strength scaling, pyramiding, funding carry — was tested, then permanently removed when it hurt performance. The final strategy is remarkably simple.

See STRATEGIES.md for the complete evolution log with mathematical details for all 103 experiments.

Best Strategy Architecture

6-signal ensemble with 4/6 majority vote:

Signal	Bull Condition	Bear Condition
Momentum	12h return > dynamic threshold	12h return < -dynamic threshold
Very-short momentum	6h return > threshold × 0.7	6h return < -threshold × 0.7
EMA crossover	EMA(7) > EMA(26)	EMA(7) < EMA(26)
RSI(8)	RSI > 50	RSI < 50
MACD(14,23,9)	MACD histogram > 0	MACD histogram < 0
BB compression	BB width < 85th percentile	BB width < 85th percentile

Exit conditions (priority order):

ATR trailing stop — 5.5x ATR from peak/trough
RSI mean-reversion — Exit longs at RSI > 69, exit shorts at RSI < 31
Signal flip — Reverse position when opposing ensemble fires

Key parameters:

Parameter	Value	Purpose
`BASE_POSITION_PCT`	0.08	Per-symbol position size as fraction of equity
`COOLDOWN_BARS`	2	Minimum bars between exit and re-entry
`RSI_PERIOD`	8	Fast RSI tuned for hourly crypto
`ATR_STOP_MULT`	5.5	Wide trailing stop to let winners run
`MIN_VOTES`	4	Majority vote threshold (4 of 6 signals)

Project Structure

├── strategy.py          # The only file you edit — your strategy lives here
├── backtest.py          # Entry point — runs one backtest (fixed, do not modify)
├── prepare.py           # Data download + backtest engine (fixed, do not modify)
├── run_benchmarks.py    # Run all 5 benchmark strategies
├── benchmarks/          # 5 reference strategies for comparison
│   ├── simple_momentum.py
│   ├── funding_arb.py
│   ├── regime_mm.py
│   ├── mean_reversion.py
│   └── momentum_breakout.py
├── program.md           # Detailed instructions for the autonomous loop
├── STRATEGIES.md        # Complete evolution log of all 103 experiments
├── charts/              # Visualization PNGs of experiment progression
├── pyproject.toml       # Dependencies (numpy, pandas, scipy, requests, pyarrow)
└── uv.lock              # Locked dependencies for reproducibility

Branches

Branch	Description
`main`	Base scaffold and data pipeline
`autotrader/mar10c`	Best autotrader strategy (score 20.634)
`autoresearch/mar10-opus`	LLM training optimization experiments

Attribution

Built on Karpathy's autoresearch pattern. Data from CryptoCompare and Hyperliquid.

Links

Agent CLI — github.com/Nunchi-trade/agent-cli
Docs — docs.nunchi.trade
Research — research.nunchi.trade
Discord — discord.gg/nunchi
X — @nunchi

_{Built by Nunchi • MIT License}

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
assets		assets
benchmarks		benchmarks
charts		charts
.gitignore		.gitignore
POST.md		POST.md
README.md		README.md
STRATEGIES.md		STRATEGIES.md
TWITTER_THREAD.md		TWITTER_THREAD.md
autoresearch-results.tsv		autoresearch-results.tsv
backtest.py		backtest.py
equity_curve.csv		equity_curve.csv
equity_curve_baseline.csv		equity_curve_baseline.csv
equity_curve_exp102.csv		equity_curve_exp102.csv
equity_curve_exp15.csv		equity_curve_exp15.csv
equity_curve_exp46.csv		equity_curve_exp46.csv
equity_curve_exp72.csv		equity_curve_exp72.csv
export_equity.py		export_equity.py
export_milestones.py		export_milestones.py
generate_charts.py		generate_charts.py
prepare.py		prepare.py
program.md		program.md
pyproject.toml		pyproject.toml
results.tsv		results.tsv
run_benchmarks.py		run_benchmarks.py
strategy.py		strategy.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Autonomous Trading Strategy Research

Quick Start

Prerequisites

Setup

Run a Backtest

Run All Benchmarks

Running Your Own Experiments

Rules

Manual Experiment Loop

Autonomous Loop (with Claude Code)

Strategy Interface

Data Available

Scoring Formula

Benchmarks

Results

Score Progression (103 Autonomous Experiments)

Key Discoveries

Biggest Lesson: Simplicity Wins

Best Strategy Architecture

Project Structure

Branches

Attribution

Links

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Autonomous Trading Strategy Research

Quick Start

Prerequisites

Setup

Run a Backtest

Run All Benchmarks

Running Your Own Experiments

Rules

Manual Experiment Loop

Autonomous Loop (with Claude Code)

Strategy Interface

Data Available

Scoring Formula

Benchmarks

Results

Score Progression (103 Autonomous Experiments)

Key Discoveries

Biggest Lesson: Simplicity Wins

Best Strategy Architecture

Project Structure

Branches

Attribution

Links

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages