SAIM-ADS

Repository for experiments and preprocessing related to advertisement videos analysis

Environment creation:

Create a conda environment using the following command:

conda create -n ads-env python=3.8

For creating conda enviroment with specific path:

conda create --prefix <path> python=3.8

Install pytorch using the following command:

conda install pytorch==1.12.1 torchvision==0.13.1 torchaudio==0.12.1 cudatoolkit=11.3 -c pytorch

Install additional requirements using the following:

pip install -r requirements.txt

Shot Extraction

Install PySceneDetect using the following command:

pip install scenedetect[opencv] --upgrade

Install CLIP

pip install ftfy regex tqdm
pip install git+https://github.com/openai/CLIP.git

Trancripts extraction

Extracting transcripts using Whisper

Install whisper using the following command:
```
pip install -U openai-whisper
```

While instantiating the model provide the download root path for the model:

   import whisper 
   model=whisper.load_model("large", download_root="path/to/download/model")

Extracting transcripts using whisper-X

Follow the instructions listed in Whisper-X for installation:
```
pip install git+https://github.com/m-bain/whisperx.git
```

Feature extraction

Go to folder feature_extraction and for extracting shot level features using vision transformers use the following

CUDA_VISIBLE_DEVICES=3 python extract_vit_features.py --feature_folder <destination vit features> --video_folder <base folder containing the shots> --   
 model_name google/vit-base-patch16-224 --video_type shot --shot_subfolder <type of shot here>

The previous command has been modified to extract features for files that have not been processed in the list of files provided in the file_list.txt file.

CUDA_VISIBLE_DEVICES=3 python extract_vit_features.py --feature_folder <destination vit features>  --video_folder <base folder containing the shots> --model_name google/vit-base-patch16-224 --video_type shot --shot_subfolder <type of shot here> --shot_file_list <path to file list>

Visual caption extraction

Create a new conda environment using the following:

conda create -n lavis python=3.8
conda activate lavis

Install lavis using the following:
```
pip install salesforce-lavis
```

Associated citation:

@article{Bose2023MMAUTowardsMU,
  title={MM-AU: Towards Multimodal Understanding of Advertisement Videos},
  author={Digbalay Bose and Rajat Hebbar and Tiantian Feng and Krishna Somandepalli and Anfeng Xu and Shrikanth S. Narayanan},
  journal={Proceedings of the 31st ACM International Conference on Multimedia},
  year={2023},
  url={https://dl.acm.org/doi/10.1145/3581783.3612371}
}

Name		Name	Last commit message	Last commit date
Latest commit History 99 Commits
caption_extraction		caption_extraction
configs		configs
data		data
datasets		datasets
feature_extraction		feature_extraction
figures		figures
llama @ 57b0eb6		llama @ 57b0eb6
losses		losses
models		models
notebooks		notebooks
optimizers		optimizers
plot_figs		plot_figs
plot_scripts		plot_scripts
post_process_results		post_process_results
preprocess_scripts		preprocess_scripts
scripts		scripts
transcript_modeling		transcript_modeling
utils		utils
zero_shot_experiments		zero_shot_experiments
.gitmodules		.gitmodules
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SAIM-ADS

Environment creation:

Shot Extraction

Install CLIP

Trancripts extraction

Extracting transcripts using Whisper

Extracting transcripts using whisper-X

Feature extraction

Visual caption extraction

About

Releases

Packages

Contributors 3

Languages

usc-sail/SAIM-ADS

Folders and files

Latest commit

History

Repository files navigation

SAIM-ADS

Environment creation:

Shot Extraction

Install CLIP

Trancripts extraction

Extracting transcripts using Whisper

Extracting transcripts using whisper-X

Feature extraction

Visual caption extraction

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages