-
Notifications
You must be signed in to change notification settings - Fork 3
Expand file tree
/
Copy pathrequirements.txt
More file actions
178 lines (152 loc) · 4.32 KB
/
Copy pathrequirements.txt
File metadata and controls
178 lines (152 loc) · 4.32 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
# AI Research Agent - Comprehensive Dependencies
# Install with: pip install -r requirements.txt
# ===== CORE WEB FRAMEWORK =====
streamlit>=1.28.0
fastapi>=0.104.0
uvicorn>=0.24.0
# ===== AI & LANGUAGE MODELS =====
openai>=1.14.3
sentence-transformers
faiss-cpu
tiktoken>=0.5.0
# ===== BROWSER AUTOMATION =====
# Selenium (for DocSend processing with stealth capabilities)
selenium>=4.15.0
webdriver-manager>=4.0.0
# Playwright (for advanced web scraping)
playwright>=1.50.0
# ===== OCR & IMAGE PROCESSING =====
pytesseract>=0.3.10
Pillow>=10.0.0
# ===== DOCUMENT PROCESSING =====
PyMuPDF # For PDF processing (fitz)
python-docx>=1.1.0 # For DOCX processing
PyPDF2>=3.0.1 # Alternative PDF processing
# ===== HTTP CLIENTS & WEB =====
aiohttp>=3.9.0
httpx>=0.27.0 # For async HTTP requests
requests>=2.31.0
certifi>=2023.11.17 # For SSL certificate verification
beautifulsoup4
validators
brotli>=1.1.0 # For Brotli compression support in sitemaps
# ===== DATA PROCESSING =====
pandas>=2.1.0
numpy>=1.24.0
# ===== FINANCIAL DATA (OpenBB Platform) =====
# OpenBB Platform for unified equity data access (prices, fundamentals, filings, news)
# Requires provider API keys to be set in environment (FMP_API_KEY, etc.)
openbb>=4.5.0
# ===== MCP & CRYPTO ANALYSIS =====
# MCP (Model Context Protocol) Dependencies
aiofiles>=23.2.1
websockets>=12.0 # For MCP SSE connections
pydantic>=2.0.0 # For data validation and models
# Visualization & Charting
plotly>=5.17.0 # Interactive charts
altair>=5.0.0 # Statistical visualizations
bokeh>=3.0.0 # Alternative plotting library
# ===== AUTHENTICATION & SECURITY =====
bcrypt>=4.1.0
cryptography>=41.0.0
# ===== CONFIGURATION & ENVIRONMENT =====
python-dotenv>=1.0.1
pyyaml>=6.0.1
# ===== CACHING & STORAGE =====
redis
# ===== NOTION INTEGRATION =====
notion-client>=2.2.1
# ===== FIRECRAWL & WEB SCRAPING =====
firecrawl-py>=2.4.0
# ===== ASYNC UTILITIES =====
asyncio-throttle>=1.0.2
tenacity>=8.0.0
# ===== DATE & TIME =====
python-dateutil>=2.9.0
# ===== CLI & UI UTILITIES =====
prompt-toolkit>=3.0.0
rich>=13.7.0
typer>=0.9.0
# ===== SEARCH & RESEARCH =====
duckduckgo-search==7.5.0
google-api-python-client
# ===== OPEN DEEP RESEARCH (ODR) INTEGRATION =====
langgraph>=0.2.55
langchain-community>=0.3.9
langchain-openai>=0.3.7
langchain-anthropic>=0.3.15
langchain-mcp-adapters>=0.1.6
langchain-deepseek>=1.0.0
langchain-tavily
langchain-groq>=0.2.4
tavily-python>=0.5.0
arxiv>=2.1.3
xmltodict>=0.14.2
linkup-sdk>=0.2.3
exa-py>=1.8.8
markdownify>=0.11.6
azure-identity>=1.21.0
azure-search-documents>=11.5.2
langsmith>=0.3.37
langchain-google-vertexai>=2.0.25
langchain-google-genai>=2.1.5
supabase>=2.15.3
mcp>=1.9.4
# ===== SPECIALIZED LIBRARIES =====
kitchenai-whisk
# ===== ENTITY EXTRACTION =====
# Pin to specific version to avoid API drift (prompt_description vs instructions, etc.)
langextract==0.1.0
# ===== TESTING =====
pytest>=7.4.0
pytest-asyncio>=0.23.0
# ===== E2E BROWSER TESTING =====
browser-use>=0.11.4
# ===== SYSTEM DEPENDENCIES NOTES =====
#
# The following system dependencies are required:
#
# BROWSERS (at least one required):
# - Chrome/Chromium (recommended for DocSend)
# - Firefox
# - Microsoft Edge
#
# OCR ENGINE:
# - macOS: brew install tesseract
# - Windows: Download from https://github.com/UB-Mannheim/tesseract/wiki
# - Linux: sudo apt-get install tesseract-ocr (Ubuntu/Debian)
# sudo yum install tesseract (CentOS/RHEL)
#
# LIBMAGIC (required for langextract entity extraction):
# - macOS: brew install libmagic
# - Windows: pip install python-magic-bin (includes bundled libmagic)
# - Linux: sudo apt-get install libmagic1 (Ubuntu/Debian)
# sudo yum install file-libs (CentOS/RHEL)
#
# PLAYWRIGHT BROWSERS (for web scraping):
# After pip install, run: playwright install
#
# SUPPORTED PLATFORMS:
# - macOS (Intel/Apple Silicon)
# - Windows 10/11
# - Linux (Ubuntu, CentOS, etc.)
#
# ===== QUICK START =====
#
# 1. Create virtual environment (Python 3.13 required for OpenBB):
# python3.13 -m venv venv && source venv/bin/activate
#
# 2. Install system dependencies (macOS example):
# brew install tesseract libmagic
#
# 3. Install Python packages:
# pip install -r requirements.txt
#
# 4. Install Playwright browsers:
# playwright install
#
# 5. Copy .env.example to .env and configure API keys:
# cp .env.example .env
#
# 6. Run the app:
# streamlit run main.py