Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TAPIR training time stats #98

Open
shivanimall opened this issue Jun 12, 2024 · 2 comments
Open

TAPIR training time stats #98

shivanimall opened this issue Jun 12, 2024 · 2 comments

Comments

@shivanimall
Copy link

shivanimall commented Jun 12, 2024

I am using TAPIR as backbone architecture, and starting from the checkpoint version. I have to further train it for my case. Do you have any suggestions for what training times to expect from the checkpoint version? or have any fine-tuning stats?

For example, it seems that BootTAP uses pre-trained TAPIR as backbone, I wonder what were the training time stats for it, in case I missed on the paper.

Thank you!

(also noted this #59)

@shivanimall shivanimall changed the title training time stats TAPIR training time stats Jun 12, 2024
@cdoersch
Copy link
Collaborator

Sorry, we haven't been releasing much on the training loss curves because it's difficult to maintain them. BootsTAPIR took about 2 weeks to train on YouTube data on 256 A100 GPUs, but that's due to the diversity of YouTube. Further finetuning it on Libero took about 3 days (50K steps) on 128 GPUs, but it only took that long because we were jointly training on youtube and kubric as well. I suspect that a much shorter training run will produce similar results, but we haven't done rigorous experiments.

Unfortunately we haven't yet released BootsTAP training code, and it's unclear when we'll be able get to it considering how busy the team is. I'm hoping it'll be before ECCV, but I can't make any guarantees.

@shivanimall
Copy link
Author

shivanimall commented Jun 17, 2024

thank you for sharing these! I will leave this issue open, as I may follow back here with further questions, plus share my training time stats results, and in case that helps anyone.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants