GitHub - mail2mhossain/ai_driven_video_understanding: Build an adaptive video-understanding pipeline that probes each video and automatically chooses the right mix of ASR, OCR, and a vision-language model. Process in chunks (Decord), transcribe speech (Whisper), read on-screen text (Tesseract), describe frames (Ovis2.5), then condense everything into a clear summary (Qwen2.5).

From raw footage to concise insights: Build an adaptive video-understanding pipeline that probes each video and automatically chooses the right mix of ASR, OCR, and a vision-language model. Process in chunks (Decord), transcribe speech (Whisper), read on-screen text (Tesseract), describe frames (Ovis2.5), then condense everything into a clear summary (Qwen2.5).

Crerate Conda Environment

conda create --prefix D:\\conda_env\\video_understanding Python=3.11 -y && conda activate D:\conda_env\video_understanding

Install requirements:

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121

pip install git+https://github.com/huggingface/transformers

pip install git+https://github.com/huggingface/accelerate

pip install -r requirements.txt

To remove the environment when done:

conda remove --prefix D:\\conda_env\\video_understanding --all

Run the App:

streamlit run app.py

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
docs		docs
.gitignore		.gitignore
Intellegent.mp4		Intellegent.mp4
README.md		README.md
app.py		app.py
badminton.md		badminton.md
badminton.mp4		badminton.mp4
deeper_video_understanding.py		deeper_video_understanding.py
desktop_app.py		desktop_app.py
image_understanding.py		image_understanding.py
pipeline_decider.py		pipeline_decider.py
py_installer.md		py_installer.md
qwen_2_5_VL.py		qwen_2_5_VL.py
qwen_another.py		qwen_another.py
requirements.txt		requirements.txt
test.py		test.py
text_summarizer.py		text_summarizer.py
transcribe_audio.py		transcribe_audio.py
version_check.py		version_check.py
video_ocr.py		video_ocr.py
video_understanding.py		video_understanding.py
workflow_executer.py		workflow_executer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Crerate Conda Environment

Install requirements:

To remove the environment when done:

Run the App:

About

Uh oh!

Releases

Packages

Languages

mail2mhossain/ai_driven_video_understanding

Folders and files

Latest commit

History

Repository files navigation

Crerate Conda Environment

Install requirements:

To remove the environment when done:

Run the App:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages