Quantitative Market Analysis Toolkit

A professional-grade system for market state analysis, prediction, and investment strategy evaluation. This toolkit uses advanced machine learning techniques to classify market conditions, predict future states, and backtest multiple investment strategies.

Read our detailed research paper for methodology, findings and complete analysis.

Core Capabilities

Market Classification

Bear Market: Negative trend with significant drawdown
Bull Market: Positive trend with sustained growth
Static Market: Sideways movement or consolidation

Predictive Models

Traditional ML: Gradient boosting with feature engineering
Neural Networks: LSTM and Attention-based models for time series
Ensemble Methods: Combined models for robust prediction

Strategy Backtesting

Buy-and-Hold: Baseline strategy for comparison
Prediction-Based: Asset allocation based on market predictions
Dynamic Allocation: Adaptive allocation using probability signals
Combined Strategy: Multi-factor approach with risk management
Anomaly-Aware: Strategy that responds to detected market anomalies

Getting Started

Installation

# Clone the repository
git clone https://github.com/SadeekFarhan21/Quantathon.git
cd Quantathon

# Install dependencies
pip install -r requirements.txt

Basic Usage

Run a complete market analysis with default settings:

python main.py --data data/market_data.xlsx --output results

Custom Analysis Period

Analyze a specific market period:

python main.py --data data/market_data.xlsx --start_date 2018-01-01 --end_date 2022-12-31 --train_start_date 2008-01-01 --train_end_date 2017-12-31

Advanced Neural Network Models

Use deep learning capabilities:

python main.py --data data/market_data.xlsx --advanced --model_type attention

Input Data Format

The system expects an Excel file with the following structure:

Price Sheet

Date: Trading dates
SP500: S&P 500 index values
BondRate: Interest rates for bond alternatives

Probability Sheet

Date: Trading dates
PrDec: Probability of significant decrease
PrInc: Probability of significant increase

Key Components

Market State Classification

Identifies market regimes based on price trends, volatility, and drawdown characteristics. This classification forms the foundation for prediction and strategy development.

Anomaly Detection

Identifies unusual market behavior through multiple methods:

Isolation forest for outlier detection
Statistical analysis of volatility spikes
Price jump identification
DBSCAN clustering for pattern detection

Risk Analysis

Advanced risk metrics including:

Value at Risk (VaR) calculations
Expected Shortfall (ES)
Maximum drawdown analysis
Stress testing of extreme scenarios

Performance Evaluation

Comprehensive metrics for strategy evaluation:

Risk-adjusted returns (Sharpe, Sortino)
Drawdown characteristics
Win rate and recovery periods
Comparative performance visualization

Implementation Details

The codebase follows a modular architecture:

Quantathon/
│
├── src/                          # Core functionality
│   ├── data_loader_market.py     # Data import and preprocessing
│   ├── market_classifier.py      # Market state identification
│   ├── prediction_model.py       # ML prediction models
│   ├── advanced_models.py        # Neural network models
│   ├── market_anomaly.py         # Anomaly detection
│   ├── risk_management.py        # Risk assessment tools
|   ├── advanced_models.py        # Neural network and deep learning models
|   ├── backtest.py               # Standard backtesting framework
|   ├── enhanced_backtester.py    # Advanced backtesting with portfolio analytics
|   ├── enhanced_strategies.py    # Sophisticated trading strategy implementations │
|   ├── markov_chain.py           # Market state transition modeling 
|   ├── markov_strategy.py        # Strategies based on Markov predictions
|   ├── yield_analyzer.py         # Bond yield analysis and modeling
│
├── config/                       # Configuration files
│   └── strategy_optimizations.py # Strategy optimization config
│
├── utils/       
│       ├── financial_utils.py    # financial utilities
│       └── logger.py             # Logging configuration
├── scripts/                      # Analysis utilities
│   ├── analyze_probabilities.py  # Probability signal analysis
│   ├── bond_rate_analysis.py     # Bond rate studies
│
├── main.py                       # Main execution script
├── requirements.txt              # Dependencies
└── README.md                     # Project documentation

System Architecture

The system is designed with a modular architecture to ensure flexibility and scalability. Here is an overview of how the components interact with each other:

Data Loader: Imports and preprocesses market data from various sources.
Market Classifier: Identifies market states (Bear, Bull, Static) based on historical data.
Prediction Model: Utilizes machine learning models to predict future market states.
Backtesting Engine: Simulates investment strategies based on historical data and model predictions.
Anomaly Detection: Identifies unusual market behaviors that may impact strategy performance.
Risk Analysis: Evaluates the risk associated with different strategies using advanced metrics.
Performance Evaluation: Assesses the performance of strategies using various financial metrics.

The following diagram illustrates the interaction between these components:

+------------------+       +------------------+       +------------------+
|   Data Loader    | ----> | Market Classifier| ----> | Prediction Model |
+------------------+       +------------------+       +------------------+
        |                        |                        |
        v                        v                        v
+------------------+       +------------------+       +------------------+
| Backtesting Engine| <----| Anomaly Detection| <----| Risk Analysis    |
+------------------+       +------------------+       +------------------+
        |                        |                        |
        v                        v                        v
+---------------------------------------------------------------+
|                    Performance Evaluation                     |
+---------------------------------------------------------------+

Strategy Descriptions

Buy and Hold

Simple benchmark strategy that invests in S&P 500 and holds for the entire period.

Prediction-Based Strategy

Binary allocation model that invests 100% in stocks during predicted Bull markets and 100% in bonds during Bear or Static markets.

Dynamic Allocation

Weighted allocation based on prediction probabilities, allowing for more nuanced positioning.

Combined Strategy

Multi-signal approach using:

Market prediction signals
Trend indicators
Volatility measures
Probability metrics

Anomaly-Aware Strategy

Adaptive strategy that:

Reduces equity exposure when anomalies are detected
Gradually returns to normal allocation as market stabilizes
Uses specialized risk management during volatile periods

Advanced Usage

Fine-Tuning Predictions

Adjust model hyperparameters:

python main.py --data data/market_data.xlsx --advanced \
    --model_type ensemble --train_start_date 2008-01-01

Custom Model Integration

The system is designed for extensibility. You can add custom models by:

Creating a new model class in src/custom_models.py
Implementing the required train() and predict() methods
Importing and initializing your model in main.py

Command-Line Arguments and Usage

The main.py script provides a flexible command-line interface to control the market analysis and backtesting pipeline. Here's a detailed breakdown of the available arguments:

General Arguments

--data: (Required) Path to the market data file (Excel format).
- Example: --data data/market_data.xlsx
--output: (Optional) Directory to save the results and analysis. Default: results.
- Example: --output my_analysis_results
--verbose: (Optional) Enable verbose output for detailed logging.
- Example: --verbose
--enhanced: (Optional) Enable enhanced backtesting strategies.
- Example: --enhanced
--markov: (Optional) Enable Markov chain prediction strategy.
- Example: --markov

Date Range Arguments

These arguments control the date ranges used for analysis and training. It's crucial to set these correctly to ensure meaningful results.

--start_date: (Optional) Start date for the analysis period (YYYY-MM-DD). If not specified, the analysis will start from the beginning of the available data.
- Example: --start_date 2019-01-01
--end_date: (Optional) End date for the analysis period (YYYY-MM-DD). If not specified, the analysis will end at the end of the available data. Default: 2022-12-31.
- Example: --end_date 2022-12-31
--train_start_date: (Optional) Start date for the training data (YYYY-MM-DD). If not specified, the training data will start from the beginning of the available data.
- Example: --train_start_date 2008-01-01
--train_end_date: (Optional) End date for the training data (YYYY-MM-DD). If not specified, the training data will end before the analysis period.
- Example: --train_end_date 2018-12-31

Important Notes on Date Ranges:

The start_date and end_date define the period over which the backtesting and performance analysis are conducted.
The train_start_date and train_end_date define the period used to train the prediction model.
The training period should generally precede the analysis period to avoid lookahead bias.
If no training period is specified, the system will use a default approach: either all data before the analysis period or the first 80% of the available data.
The system will automatically adjust the date ranges to fit within the available data. Warnings will be logged if the specified dates are outside the available range.

Advanced Model Arguments

These arguments control the use of advanced neural network models.

--advanced: (Optional) Enable the use of advanced PyTorch-based models.
- Example: --advanced
--model_type: (Optional) Type of advanced model to use. Choices: attention, tcn, ensemble. Default: attention.

Example Commands

Analyzing a Specific Period with a Trained Model

To analyze the market from January 1, 2019, to December 31, 2022, using a model trained on data up to December 31, 2018, run the following command:

python main.py --data data/market_data.xlsx --output results --advanced --model_type attention --start_date 2019-01-01 --end_date 2022-12-31 --train_end_date 2018-12-31 --enhanced```

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.cph		.cph
config		config
data		data
models		models
research_paper		research_paper
results		results
scripts		scripts
slides		slides
src		src
utils		utils
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

jalenfran/Quantathon25

Folders and files

Latest commit

History

Repository files navigation