You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+6-2Lines changed: 6 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -25,15 +25,17 @@ A GenAI-powered catalog enrichment system that transforms basic product images i
25
25
-**Cultural Image Generation**: Create culturally-appropriate product backgrounds (Spanish courtyards, Mexican family spaces, British formal settings)
26
26
-**Quality Evaluation**: Automated VLM-based quality assessment of generated images with detailed scoring
27
27
-**3D Asset Generation**: Transform 2D product images into interactive 3D GLB models using Microsoft TRELLIS
28
-
-**Product FAQ Generation**: Automatically generate 3-5 product FAQs from enriched catalog data
28
+
-**Product FAQ Generation**: Automatically generate product FAQs from enriched catalog data, with optional product manual PDF upload for richer FAQs (up to 10) via stateless targeted RAG
29
29
-**Policy Compliance**: Upload policy PDFs and automatically check product listings against them using RAG + Milvus
30
30
-**Modular API**: Separate endpoints for VLM analysis, FAQ generation, image generation, and 3D asset generation
31
31
32
32
## Documentation
33
33
34
34
-**[API Documentation](docs/API.md)** - Detailed API endpoints, parameters, and examples
35
35
-**[Docker Deployment Guide](docs/DOCKER.md)** - Docker and Docker Compose setup instructions
36
-
-**[Product Requirements (PRD)](PRD.md)** - Product requirements and feature specifications
36
+
-**[Product Requirements (PRD)](docs/PRD.md)** - Product requirements and feature specifications
37
+
-**[Policy Compliance](docs/POLICY_COMPLIANCE.md)** - How policy compliance checking works
38
+
-**[Product Manual for FAQs](docs/PRODUCT_MANUAL_FAQS.md)** - How product manual PDFs enrich FAQ generation
37
39
-**[AI Agent Guidelines](AGENTS.md)** - Instructions for AI assistants working on this project
38
40
39
41
## Tech Stack
@@ -221,6 +223,8 @@ For complete Docker deployment instructions, see the **[Docker Deployment Guide]
**4) 3D Asset Generation (POST `/generate/3d`)** - Generate 3D models on demand
42
43
@@ -276,7 +277,10 @@ curl -X POST \
276
277
277
278
## 3️⃣ FAQ Generation: `/vlm/faqs`
278
279
279
-
Generate 3-5 frequently asked questions and answers for a product based on its enriched catalog data. Designed to be called after `/vlm/analyze` completes, using the enriched result as input.
280
+
Generate frequently asked questions and answers for a product based on its enriched catalog data. Designed to be called after `/vlm/analyze` completes, using the enriched result as input.
281
+
282
+
Without a product manual: generates 3-5 basic FAQs from the product data.
283
+
With manual knowledge (from `/vlm/manual/extract`): generates up to 10 richer FAQs that draw from both the product data and the manual, surfacing details that go beyond the description.
280
284
281
285
**Endpoint**: `POST /vlm/faqs`
282
286
**Content-Type**: `multipart/form-data`
@@ -291,6 +295,7 @@ Generate 3-5 frequently asked questions and answers for a product based on its e
Extract structured knowledge from a product manual PDF using targeted RAG. The endpoint processes the PDF, generates product-type-specific queries via the LLM (using title + categories, not description, to avoid duplicating what the description already covers), and retrieves relevant chunks from the manual for each topic.
374
+
375
+
This endpoint is **stateless** — all embeddings are computed in-memory and freed after the response. It can handle concurrent requests for different products.
"battery_life": "The speaker provides up to 12 hours of continuous playback...",
397
+
"waterproof_rating": "IPX7 rated, can be submerged up to 1 meter for 30 minutes...",
398
+
"care_instructions": "Clean with a damp cloth. Do not use abrasive cleaners..."
399
+
}
400
+
}
401
+
```
402
+
403
+
The `knowledge` object contains topic keys (dynamically generated by the LLM based on product type) mapped to the relevant text extracted from the manual. Topics with no relevant content are empty strings.
0 commit comments