BTC RL Trader

Production-grade reinforcement learning trend-following trading platform for BTC/USD.

Overview

This system implements a sophisticated Double DQN agent with dueling architecture and prioritized experience replay to learn optimal trading strategies from historical BTC/USD data. The platform is designed for continuous learning and deployment with comprehensive risk management.

Key Features

Double DQN with Dueling Architecture: Reduces overestimation bias and improves value estimation
Prioritized Experience Replay: Focuses learning on important transitions
Walk-Forward Training: Robust out-of-sample validation
Risk-Aware Rewards: Optimizes for Sharpe ratio and drawdown control
Production API: FastAPI service with Prometheus metrics
Real-Time Data: OANDA v20 API integration with retry logic
Comprehensive Backtesting: Full performance analytics and visualization

Architecture

graph TB
    A[OANDA API] -->|Market Data| B[Data Pipeline]
    B --> C[Feature Engineering]
    C --> D[RL Environment]
    D --> E[DQN Agent]
    E -->|Actions| D
    D -->|Rewards/States| E
    E --> F[Model Checkpoints]
    F --> G[FastAPI Service]
    G --> H[Trading Decisions]
    G --> I[Prometheus Metrics]
    I --> J[Grafana Dashboard]

Performance Targets

Metric	Target	Hard Fail
Sharpe Ratio	≥ 2.0	< 1.5
Max Drawdown	≤ 5%	> 10%
Hit Rate	≥ 55%	< 45%
Avg Trade P/L	> 0	≤ 0

Quick Start

Prerequisites

Python 3.11+
CUDA 12.3+ (for GPU support)
Docker & Docker Compose
OANDA API credentials

Installation

# Clone repository
git clone https://github.com/yourusername/btc-rl-trader.git
cd btc-rl-trader

# Install dependencies
poetry install

# Set environment variables
export OANDA_ACCOUNT_ID="your_account_id"
export OANDA_API_KEY="your_api_key"

Training

# Run walk-forward training
python training_pipeline.py

# Monitor training
tensorboard --logdir=logs

Deployment

# Build and run with Docker
docker-compose up -d

# Check health
curl http://localhost:8000/health

# Get prediction
curl -X POST http://localhost:8000/predict \
  -H "Content-Type: application/json" \
  -d '{"use_latest": true}'

Project Structure

btc-rl-trader/
├── data/               # Data fetching and cleaning
├── features/           # Technical indicators and features
├── env/                # Gym trading environment
├── agents/             # DQN implementation
├── backtest/           # Backtesting engine
├── serve/              # FastAPI deployment
├── tests/              # Unit tests
├── notebooks/          # Jupyter demos
└── config/             # Configuration files

Configuration

Edit config/config.yaml to adjust:

Risk parameters (stop loss, position sizing)
Model architecture (layers, learning rate)
Training settings (episodes, batch size)
API endpoints and credentials

API Endpoints

GET /health - Service health check
POST /predict - Get trading prediction
POST /trade - Execute trade (dry run by default)
GET /metrics - Performance metrics
GET /metrics/prometheus - Prometheus format

Testing

# Run all tests
poetry run pytest

# With coverage
poetry run pytest --cov=. --cov-report=html

# Type checking
poetry run mypy .

# Linting
poetry run flake8 .
poetry run black --check .

Monitoring

Access monitoring dashboards:

Grafana: http://localhost:3000 (admin/admin)
Prometheus: http://localhost:9091

Risk Management

The system implements multiple risk controls:

Position Sizing: Kelly criterion with volatility adjustment
Stop Loss: Automatic 2% stop loss per position
Drawdown Limits: Episode termination at 20% drawdown
Leverage Limits: Maximum 95% capital utilization
Funding Costs: Realistic funding rate simulation

Performance Analysis

Generate comprehensive performance reports:

from backtest import PerformanceVisualizer

visualizer = PerformanceVisualizer()
visualizer.create_performance_report(
    portfolio_values,
    trades,
    positions,
    metrics,
    save_path="reports/performance.pdf"
)

Contributing

Fork the repository
Create feature branch (git checkout -b feature/amazing-feature)
Commit changes (git commit -m 'Add amazing feature')
Push to branch (git push origin feature/amazing-feature)
Open Pull Request

License

This project is licensed under the MIT License - see LICENSE file for details.

Disclaimer

IMPORTANT: This software is for educational and research purposes only. Do not trade real money without extensive testing and validation. Past performance does not guarantee future results. Trading cryptocurrencies involves substantial risk of loss.

Acknowledgments

OpenAI Gym for the RL framework
PyTorch team for the deep learning library
OANDA for market data API
All contributors and researchers in algorithmic trading

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.claude		.claude
.github/workflows		.github/workflows
agents		agents
analysis		analysis
analysis_results		analysis_results
backtest		backtest
backtest_results		backtest_results
backups/20250629_164642		backups/20250629_164642
checkpoints_phase3		checkpoints_phase3
config		config
data		data
ensemble_models		ensemble_models
features		features
knowledge_base		knowledge_base
lstm-trading-system		lstm-trading-system
monitoring		monitoring
notebooks		notebooks
results		results
results_minimal		results_minimal
serve		serve
tests		tests
training		training
v3_recommendations		v3_recommendations
venv_btc_trader		venv_btc_trader
.env.example		.env.example
.gitignore		.gitignore
3plus_sharpe_checklist.md		3plus_sharpe_checklist.md
ANALYSIS_SUMMARY.md		ANALYSIS_SUMMARY.md
CLAUDE.md		CLAUDE.md
DUAL_STRATEGY_STATUS.md		DUAL_STRATEGY_STATUS.md
Dockerfile		Dockerfile
ENHANCEMENT_SUMMARY.md		ENHANCEMENT_SUMMARY.md
E[Indicator		E[Indicator
F[Signal		F[Signal
GBP_USD_TRADING.md		GBP_USD_TRADING.md
G[Execution		G[Execution
IMPORTANT_NOTE.md		IMPORTANT_NOTE.md
IMPROVEMENTS.md		IMPROVEMENTS.md
INTRADAY_TRADING_SETUP.md		INTRADAY_TRADING_SETUP.md
L[Backtest		L[Backtest
Makefile		Makefile
OANDA_SETUP.md		OANDA_SETUP.md
PAPER_TRADING_READY.md		PAPER_TRADING_READY.md
README.md		README.md
RESEARCH_BASED_IMPROVEMENTS.md		RESEARCH_BASED_IMPROVEMENTS.md
SHARPE_MILESTONE_MONITOR.md		SHARPE_MILESTONE_MONITOR.md
SMA_TRADING_README.md		SMA_TRADING_README.md
V3_PROJECT_REPORT.md		V3_PROJECT_REPORT.md
V3_TRAINING_STATUS.md		V3_TRAINING_STATUS.md
V3_VS_V4_COMPARISON.md		V3_VS_V4_COMPARISON.md
V4_ANALYSIS.md		V4_ANALYSIS.md
__init__.py		__init__.py
aat_drl_system.py		aat_drl_system.py
analyze_best_strategy.py		analyze_best_strategy.py
analyze_sharpe_profit.py		analyze_sharpe_profit.py
analyze_strategy_simple.py		analyze_strategy_simple.py
analyze_v2_model.py		analyze_v2_model.py
analyze_v3_strategy.py		analyze_v3_strategy.py
backtest_phase3_realistic.py		backtest_phase3_realistic.py
basic_long_short.py		basic_long_short.py
basic_long_short_state.json		basic_long_short_state.json
check_v3_status.py		check_v3_status.py
compare_models.py		compare_models.py
current_strategy_report_20250701_114914.md		current_strategy_report_20250701_114914.md
current_strategy_results_20250701_114914.json		current_strategy_results_20250701_114914.json
debug_v3.py		debug_v3.py
deploy_ensemble.py		deploy_ensemble.py
deploy_ensemble_auto.py		deploy_ensemble_auto.py
deploy_paper_trading.py		deploy_paper_trading.py
deploy_paper_trading_minimal.py		deploy_paper_trading_minimal.py
deploy_phase1_active.py		deploy_phase1_active.py
deploy_phase1_live.py		deploy_phase1_live.py
deploy_phase1_simple.py		deploy_phase1_simple.py
deploy_phase2.py		deploy_phase2.py
deploy_phase3.py		deploy_phase3.py
deploy_phase3_oanda.py		deploy_phase3_oanda.py
diagnose_state_size.py		diagnose_state_size.py
docker-compose.yml		docker-compose.yml
early_stopping_analysis.png		early_stopping_analysis.png
early_stopping_analysis.py		early_stopping_analysis.py
early_stopping_report.md		early_stopping_report.md
enhanced_paper_trader.py		enhanced_paper_trader.py
enhanced_paper_trading_state.json		enhanced_paper_trading_state.json
ensemble_framework.py		ensemble_framework.py
extract_research.py		extract_research.py
fetch		fetch
intraday_config.yaml		intraday_config.yaml
intraday_rl_trainer.py		intraday_rl_trainer.py
intraday_short_trader.py		intraday_short_trader.py
intraday_trainer_fast.py		intraday_trainer_fast.py
knowledg1		knowledg1
knowledg1:Zone.Identifier		knowledg1:Zone.Identifier
launch_v2_training.sh		launch_v2_training.sh
long_short_strategy.py		long_short_strategy.py
manual_setup.sh		manual_setup.sh
mean_reversion_agent.py		mean_reversion_agent.py
minimal_aat_trainer.py		minimal_aat_trainer.py
minimal_test.py		minimal_test.py
monitor_phase3_training.py		monitor_phase3_training.py
monitor_sharpe_milestone.py		monitor_sharpe_milestone.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

BTC RL Trader

Overview

Key Features

Architecture

Performance Targets

Quick Start

Prerequisites

Installation

Training

Deployment

Project Structure

Configuration

API Endpoints

Testing

Monitoring

Risk Management

Performance Analysis

Contributing

License

Disclaimer

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

wlshlad85/btc-rl-trader

Folders and files

Latest commit

History

Repository files navigation

BTC RL Trader

Overview

Key Features

Architecture

Performance Targets

Quick Start

Prerequisites

Installation

Training

Deployment

Project Structure

Configuration

API Endpoints

Testing

Monitoring

Risk Management

Performance Analysis

Contributing

License

Disclaimer

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages