-
Notifications
You must be signed in to change notification settings - Fork 6
Expand file tree
/
Copy pathCITATION.cff
More file actions
89 lines (77 loc) · 2.81 KB
/
Copy pathCITATION.cff
File metadata and controls
89 lines (77 loc) · 2.81 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
cff-version: 2.2.0
type: software
title: "A Three-Layer Caching Architecture for Low-Latency LLM Web Search on Commodity CPU Hardware"
abstract: "A production-grade, horizontally-scalable search and Retrieval-Augmented Generation (RAG) system featuring multi-layer caching, Playwright-based search agents, and Huffman-compressed session archival on commodity CPU hardware."
authors:
- family-names: "Bhattacharya"
given-names: "Ayushman"
email: "ayushman@myceli.ai"
affiliation: "Pollinations.ai"
contact:
- family-names: "Bhattacharya"
given-names: "Ayushman"
email: "ayushman@myceli.ai"
date-released: 2026-02-25
identifiers:
- type: "url"
value: "https://github.com/pollinations/lixsearch"
keywords:
- "search"
- "RAG"
- "retrieval-augmented-generation"
- "caching"
- "LLM"
- "web-search"
- "Redis"
- "Huffman-coding"
- "session-management"
- "embeddings"
- "CPU-inference"
- "python"
license: "MIT"
license-url: "https://opensource.org/licenses/MIT"
repository-code: "https://github.com/pollinations/lixsearch"
url: "https://search.elixpo.com"
version: "1.0.0"
preferred-citation:
type: "article"
title: "A Three-Layer Caching Architecture for Low-Latency LLM Web Search on Commodity CPU Hardware"
authors:
- family-names: "Bhattacharya"
given-names: "Ayushman"
email: "ayushman@myceli.ai"
affiliation: "Pollinations.ai"
year: 2026
url: "https://github.com/pollinations/lixsearch/blob/main/docs/paper/lix_cache_paper.pdf"
license: "CC-BY-NC-ND-4.0"
abstract: "We present a three-layer caching architecture for scaling LLM-powered web search on commodity CPU hardware. Deployed on a single 8-vCPU server, the system achieves an 89.3% cache hit rate with 0.1ms Redis read latency, reducing per-query cost by 1,000x compared to commercial search APIs."
references:
- type: "software"
title: "Chroma: AI-native open-source vector database"
authors:
- family-names: "Chroma"
given-names: "Team"
url: "https://github.com/chroma-core/chroma"
license: "Apache-2.0"
- type: "software"
title: "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks"
authors:
- family-names: "Reimers"
given-names: "Nils"
- family-names: "Gurevych"
given-names: "Iryna"
repository-code: "https://github.com/UKPLab/sentence-transformers"
- type: "software"
title: "Quart: Async Python web microframework"
authors:
- family-names: "Quart"
given-names: "Contributors"
repository-code: "https://github.com/pallets/quart"
license: "MIT"
- type: "software"
title: "Playwright: Browser automation library"
authors:
- family-names: "Microsoft"
given-names: "Playwright Team"
repository-code: "https://github.com/microsoft/playwright"
license: "Apache-2.0"