dsa4266-project

Repository for DSA4266 project.

Setup

Install pipenv:
```
pip install pipenv
```
Create a virtual environment and install dependencies:
```
pipenv install --dev
```
Adding dependencies:

If you need to add a new dependency, use the following command:
```
pipenv install <package-name>
```
- Do not add the package directly to the Pipfile. The Pipfile and Pipfile.lock should be generated by pipenv.
- You can specify if the package is a development dependency by adding the --dev flag.
- Remember to commit the Pipfile and Pipfile.lock after adding a new dependency.

Project description

The objective of this project is to develop several machine learning models capable of accurately classifying videos as either real or deepfake, which is a binary classification task. In this study, we develop several machine learning models on the Deepfake Detection Challenge (DFDC) dataset, including:

Frame-based Convolutional Neural Networks (CNN)
- CNN Model Repository
Residual Networks (ResNet)
- Resnet Model Repository
Recurrent CNNs (RCNN)
CNN-Encoder + Long Short Term Memory (CNN-Encoder + LSTM)
Video Masked Autoencoders (VideoMAE)
- VideoMAE Model Repository

The models will leverage visual features extracted from video frames to distinguish deepfake videos from real videos. We then proceed to evaluate the performance of these models using the Area Under Curve (AUC) - Receiver Operating Characteristic (ROC) Curve to determine the optimal threshold to use when making predictions, before producing the classification report as well as the confusion matrix. Additionally, high precision and recall should be achieved, ensuring minimal false positives (incorrectly labelling real videos as fake) and false negatives (failing to detect a deepfake).

Dataset

Deepfake Detection Challenge (DFDC), a collaborative initiative by AWS, Facebook, Microsoft, the Partnership on AI’s Media Integrity Steering Committee, and academics.

Instructions for Dataset

Organise by Model Name: Inside the /data folder, create subdirectories for each model you want to run, named exactly as specified:

/data/cnn
/data/cnn_encoder_lstm
/data/rcnn
/data/resnet
/data/videomae

Unzip Data Files: Unzip each model's dataset into the respective subdirectory:

Frames dataset for frame-based models.
Videos dataset for video-based models.

Verify File Paths: Each model script should automatically refer to its dedicated folder under /data/<model-name>. Ensure that any required subfolders or files are placed correctly to avoid path errors.

Repository structure

.
├── LICENSE
├── Pipfile
├── Pipfile.lock
├── README.md
├── __init__.py
├── data
├── eda
│   ├── README.md
│   └── jh-eda.ipynb
├── evaluation
│   ├── README.md
│   ├── cnn_plots.ipynb
│   ├── evaluate_model.py
│   ├── evaluation-results
│   │   ├── cnn_auc_roc_curve.png
│   │   ├── cnn_classification_report.png
│   │   ├── cnn_confusion_matrix.png
│   │   ├── cnn_encoder_lstm_auc_roc_curve.png
│   │   ├── cnn_encoder_lstm_classification_report.png
│   │   ├── cnn_encoder_lstm_confusion_matrix.png
│   │   ├── cnn_model_diagram.png
│   │   ├── rcnn_auc_roc_curve.png
│   │   ├── rcnn_classification_report.png
│   │   ├── rcnn_confusion_matrix.png
│   │   ├── resnet_auc_roc_curve.png
│   │   ├── resnet_classification_report.png
│   │   ├── resnet_confusion_matrix.png
│   │   ├── resnet_model_diagram.png
│   │   ├── videomae_auc_roc_curve.png
│   │   ├── videomae_classification_report.png
│   │   └── videomae_confusion_matrix.png
│   └── run_evaluation.py
├── explainability
│   ├── Gradcam_CNN.ipynb
│   ├── Gradcam_Resnet.ipynb
│   ├── README.md
│   └── sample_frames
│       ├── frame_0008_adgb_deepfake.jpg
│       └── frame_0028_afbg_real.jpg
├── models
│   ├── README.md
│   ├── __init__.py
│   ├── cnn
│   │   ├── README.md
│   │   ├── cnn_dev.ipynb
│   │   ├── cnn_dev.py
│   │   ├── cnn_dev_predictions.py
│   │   ├── diagrams
│   │   │   └── cnn_loss.png
│   │   ├── loss.py
│   │   └── results
│   │       ├── cnn_loss.csv
│   │       ├── results.csv
│   │       ├── video_classification_results.csv
│   │       └── video_classification_results_rd_2.csv
│   ├── cnn_encoder_lstm
│   │   ├── README.md
│   │   ├── cnn_encoder_lstm.py
│   │   ├── diagrams
│   │   │   ├── cnn_encoder_lstm
│   │   │   ├── cnn_encoder_lstm.png
│   │   │   └── cnn_encoder_lstm_loss.png
│   │   ├── loss.py
│   │   ├── model_arch.ipynb
│   │   ├── results
│   │   │   ├── predictions_cnn_encoder_lstm.csv
│   │   │   ├── predictions_cnn_encoder_lstm_transformed.csv
│   │   │   └── progress.csv
│   │   ├── test_specific.py
│   │   ├── train.py
│   │   ├── train_specific.py
│   │   ├── train_specific_specific.py
│   │   └── transform_results.py
│   ├── rcnn
│   │   ├── README.md
│   │   ├── __init__.py
│   │   ├── diagrams
│   │   │   ├── rcnn_diagram
│   │   │   ├── rcnn_diagram.png
│   │   │   └── rcnn_loss.png
│   │   ├── loss.py
│   │   ├── model_arch.ipynb
│   │   ├── rcnn.py
│   │   ├── results
│   │   │   ├── predictions_rcnn.csv
│   │   │   ├── predictions_rcnn_transformed.csv
│   │   │   ├── progress.csv
│   │   │   └── transform_rcnn_results.py
│   │   ├── test_specific.py
│   │   ├── train.py
│   │   └── train_specific.py
│   ├── resnet
│   │   ├── README.md
│   │   ├── diagrams
│   │   │   └── resnet_loss.png
│   │   ├── loss.py
│   │   ├── resnet.ipynb
│   │   └── results
│   │       ├── resnet_results.csv
│   │       ├── training_loss.csv
│   │       └── tuning_loss.csv
│   └── videomae
│       ├── README.md
│       ├── diagrams
│       │   └── videomae_loss.png
│       ├── loss.py
│       ├── results
│       │   ├── results-32-frames.csv
│       │   └── videomae_loss.csv
│       └── videomae.ipynb
├── preprocessing
│   ├── Data flow pipeline.png
│   ├── README.md
│   ├── __init__.py
│   ├── augment.py
│   ├── create_balanced_dataset.ipynb
│   ├── dct.py
│   ├── dft.py
│   ├── frames.py
│   ├── label.py
│   ├── lbp.py
│   ├── split.py
│   ├── yolo.py
│   └── yolov11n-face.pt
└── utils
    ├── __init__.py
    ├── augmentation.py
    ├── checking_dataset.ipynb
    ├── dataset.py
    ├── dataset_vitmae.py
    ├── rough_work.ipynb
    ├── types.py
    ├── utils.py
    └── video_proc.py

Team members

In alphabetical order:

Au Jun Hui
Nixon Widjaja
Pong Yi Zhen
Sum Hung Yee
Tan Hui Xuan Valerie
Wilson Widyadhana

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

dsa4266-project

Setup

Project description

Dataset

Instructions for Dataset

Repository structure

Team members

About

Releases

Packages

Contributors 6

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 214 Commits
data		data
eda		eda
evaluation		evaluation
explainability		explainability
models		models
preprocessing		preprocessing
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
__init__.py		__init__.py

License

wilsonwid/dsa4266-project

Folders and files

Latest commit

History

Repository files navigation

dsa4266-project

Setup

Project description

Dataset

Instructions for Dataset

Repository structure

Team members

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 6

Languages

Packages