@@ -6,6 +6,61 @@ Deep research loop for finding elite-tier open-source AI projects. It cycles thr
66
77This spec is ** schedule-agnostic** — implementers (cron jobs, manual runs, other agents) handle timing and persistence.
88
9+ ## Discovery API: GitDB
10+
11+ The primary discovery mechanism is the ** GitDB API** (` https://p.gitdb.net/api/v1 ` ), which provides semantic search and related-project discovery.
12+
13+ ### Endpoints
14+
15+ | Endpoint | Purpose | Example |
16+ | ----------| ---------| ---------|
17+ | ` GET /projects/search ` | Search projects by query | ` https://p.gitdb.net/api/v1/projects/search?q=agent+framework&limit=50&offset=0&sort_by=relevance ` |
18+ | ` GET /{owner}/{repo}/related ` | Find related/similar projects | ` https://p.gitdb.net/api/v1/lobehub/lobe-chat/related?limit=10&offset=0 ` |
19+
20+ ### Query Parameters (Search)
21+
22+ | Param | Type | Description |
23+ | -------| ------| -------------|
24+ | ` q ` | string | Search query (keywords, topics) |
25+ | ` limit ` | int | Max results (default: 50) |
26+ | ` offset ` | int | Pagination offset |
27+ | ` sort_by ` | string | ` relevance ` , ` stars ` , ` updated ` , ` created ` |
28+
29+ ### Response Fields (Key)
30+
31+ ``` json
32+ {
33+ "projects" : [{
34+ "id" : 12345 ,
35+ "full_name" : " owner/repo" ,
36+ "url" : " https://github.com/owner/repo" ,
37+ "description" : " ..." ,
38+ "language" : " Python" ,
39+ "stars" : 15000 ,
40+ "forks" : 1200 ,
41+ "last_push" : " 2026-04-20T12:00:00+00:00" ,
42+ "is_archived" : false ,
43+ "license" : " mit" ,
44+ "topics" : " [\" ai\" , \" llm\" , \" agent\" ]" ,
45+ "similarity_score" : 0.85 // only in /related
46+ }],
47+ "pagination" : { "total" : 150 , "has_next" : true }
48+ }
49+ ```
50+
51+ ### Discovery Strategy
52+
53+ 1 . ** Start with search** : Use category keywords (e.g., "RAG vector database", "agent framework", "LLM inference")
54+ 2 . ** Drill down with related** : For any promising project, fetch related projects to discover competitors/alternatives
55+ 3 . ** Filter by thresholds** : Apply qualification criteria (stars ≥1000, activity, license)
56+ 4 . ** Cross-reference** : Verify against existing README entries to avoid duplicates
57+
58+ ### Rate Limits
59+
60+ - No authentication required for read endpoints
61+ - Be respectful: add small delays between requests
62+ - Cache results locally during a research session
63+
964## Clean Start Requirement
1065
1166Before doing any research, edits, branch creation, or PR work, the runner/agent must start from a clean, current base:
0 commit comments