LMM_RAG_Workshop_GPU

Before start:

make sure you have

models folder if not get it from here:https://drive.google.com/drive/folders/11BEK-gFWjFB1Qb3mxHMg1OCYQn-QtV29?usp=drive_link
/models

/Layout
/MFD
/MFR
README.MD

2.EngineeringHistory3Books_text.parquet if not get it from here: https://drive.google.com/file/d/1DwXRLUqc7W4fLAtZR3XWiLva0Dc2VBAY/view?usp=sharing

conda environment that used for this part is: LMMRAGwithGPU from computer 391
.env for test can use .env_for_testing

Part1 Database Preparation

including:

image extrcation
captiongeneration
text OCR

these 3 using same environment basically start from 1 -> 2 -> 3

imageextract.ipynb -> will give you crop image folder and full page folder, and also give you .json pairing each image to page number
captiongeneration.ipynb -> will give you .json of image and associate caption
textOCR.ipynb -> will give you .json of Text OCR

So after these 3 steps you will get

imagecaption.json
text.json

Part2 Embedding and Searching

including:

embed.ipynb

You may reuse the conda env from part 1.

Pipeline

After Part1 you get .json for image and .json for text dataset
parquet files: Run embed.ipynb to read from the above 2 json files and embed both, stored to xxx_text.parquet and xxx_image.parquet
RAG: Run rag.ipynb to perform vector search and get RAG results.

Part3 Generation

run rag.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
configs		configs
modules		modules
EngineeringHistory3Books_image.parquet		EngineeringHistory3Books_image.parquet
README.md		README.md
captiongenerate.ipynb		captiongenerate.ipynb
demofile.pdf		demofile.pdf
embed.ipynb		embed.ipynb
imageextract.ipynb		imageextract.ipynb
rag.ipynb		rag.ipynb
reqgpu.txt		reqgpu.txt
textOCR.ipynb		textOCR.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LMM_RAG_Workshop_GPU

Before start:

Part1 Database Preparation

Part2 Embedding and Searching

Pipeline

Part3 Generation

About

Uh oh!

Releases

Packages

Languages

HKUGenAI/LMM_RAG_Workshop_GPU

Folders and files

Latest commit

History

Repository files navigation

LMM_RAG_Workshop_GPU

Before start:

Part1 Database Preparation

Part2 Embedding and Searching

Pipeline

Part3 Generation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages