Skip to content

Latest commit

 

History

History
20 lines (17 loc) · 877 Bytes

README.md

File metadata and controls

20 lines (17 loc) · 877 Bytes

CS2270-Video-Captioning

Overall Structure

  • `models' : contains all submodules or model classes for captioning, text to SQL and text to table + config files
  • `datasets': contains reference to all datasets (raw data) stored
  • `data_processing': code for preprocessing and sampling videos + config files
  • `outputs': contains the Postgre SQL table generated from our system
  • `frontend': contains code to setup and run front-end for video uploading
  • `main.py': the initial entrypoint for code backend containing logic to select models and run pipeline
  • `baselines': contains all submodules and code for baseline models e.g. VIVA and ZELDA

Request GPU from OSCAR

srun --partition=gpu --gres=gpu:2 --cpus-per-task=4 --mem=16G --time=02:00:00 --pty bash

Profile latency using Pyinstrument

pyinstrument -o test_async_latency_text2table.html pipeline.py