PyTorch implementation for "ECO: Efficient Convolutional Network for Online Video Understanding", ECCV 2018 https://github.com/zhang-can/ECO-pytorch

Frame-Recurrent Video Super-Resolution https://github.com/msmsajjadi/frvsr https://ei.is.tuebingen.mpg.de/~msajjadi

用AI分析足球视频，AI在体育这块的应用还有很大的前景 https://www.bilibili.com/video/BV1A64y1M78H/

R-C3D: Region Convolutional 3D Network for Temporal Activity Detection https://github.com/VisionLearningGroup/R-C3D

PyTorch 行为识别模型库 https://github.com/coderSkyChen/Action_Recognition_Zoo

Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition https://github.com/kevin-ssy/Optical-Flow-Guided-Feature

Multi-Fiber Networks for Video Recognition https://github.com/cypw/PyTorch-MFNet

Code and models of paper " ECO: Efficient Convolutional Network for Online Video Understanding", ECCV 2018 https://github.com/mzolfaghari/ECO-efficient-video-understanding

SlowFast-Networks Reproduce slowfast networks in pytorch(unofficial) https://github.com/Guocode/SlowFast-Networks

This repo holds the codes of paper: "BSN: Boundary Sensitive Network for Temporal Action Proposal Generation", which is accepted in ECCV 2018. You can also find pytorch-version implementation in [BSN.pytorch]. https://github.com/wzmsltw/BSN-boundary-sensitive-network

Unofficial example code for using a pre-trained Distilled 3D Network (D3D) for video classification. https://github.com/princeton-vl/d3dhelper

X-Temporal：开源视频分类/理解代码包 https://github.com/Sense-X/X-Temporal

PyTorch implementation of "SlowFast Networks for Video Recognition". https://github.com/r1ch88/SlowFastNetworks

Stochastic Adversarial Video Prediction 预测生成视频下面的内容，可以做视频插值 https://github.com/alexlee-gk/video_prediction

Deep Video Analytics是个视频分析平台，能够从视频和图像中抽取并索引信息。可以用于视频搜索、视频识别检测、OCR等。 https://github.com/AKSHAYUBHAT/DeepVideoAnalytics

深度学习NBA球员行为识别/统计 https://github.com/neeilan/DeepPlayByPlay

ECO: Efficient Convolutional Network for Online Video Understanding https://github.com/mzolfaghari/ECO-pytorch

视频研究常用方法、数据集和任务汇总 https://github.com/gsig/PyVideoResearch

Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101 https://github.com/HHTseng/video-classification

深度学习视频高效加载器 https://github.com/zhreshold/decord

Decoupling Localization and Classification in Single Shot Temporal Action Detection https://github.com/HYPJUDY/Decouple-SSAD

R-C3D pytorch implementation https://github.com/sunnyxiaohu/R-C3D.pytorch

Python/OpenCV实现的视频场景检测与分析 https://github.com/Breakthrough/PySceneDetect

MMAction：PyTorch开源视频行为识别工具包 https://github.com/open-mmlab/mmaction

视频分析/多模态学习论文、代码、数据集大列表 https://github.com/HuaizhengZhang/Awsome-Deep-Learning-for-Video-Analysis

VideoGraph: Recognizing Minutes-Long Human Activities in Videos, ICCV 2019 Workshop https://github.com/noureldien/videograph

STCN: Stochastic Temporal Convolutional Networks https://github.com/emreaksan/stcn

PyTorch实现的C3D, R3D, R2Plus1D视频行为识别 https://github.com/jfzhang95/pytorch-video-recognition

Resources about activity recognition-行为识别资料汇总’ https://github.com/jindongwang/activityrecognition

【视频架构搜索】《Video Architecture Search | Google AI Blog》 https://github.com/google-research/google-research/tree/master/evanet

Facebook 开源 SlowFast：双帧率分析视频识别 https://github.com/facebookresearch/SlowFast

This is the official repo for "S3D: Single Shot multi-Span Detector via Fully 3D Convolutional Network" https://github.com/dazhang-cv/S3D

code for learning trajectory dependencies for human motion prediction https://github.com/wei-mao-2019/LearnTrajDep

Code and models for our CVPR'19 paper "Representation Flow for Action Recognition" https://github.com/piergiaj/representation-flow-cvpr19

"Attention in Convolutional LSTM for Gesture Recognition" in NIPS 2018 https://github.com/GuangmingZhu/AttentionConvLSTM

Code and models for our ICCV'19 paper "Evolving Space-Time Neural Architectures for Videos" https://github.com/piergiaj/evanet-iccv19

Unofficial PyTorch implementation of the CVPR'19 paper "Skeleton-Based Action Recognition with Directed Graph Neural Networks". https://github.com/kenziyuliu/DGNN-PyTorch

This repo contains a PyTorch S3D Text-Video model trained from scratch on HowTo100M using MIL-NCE https://github.com/antoine77340/S3D_HowTo100M

A Faster Pytorch Implementation of R-C3D https://github.com/sunnyxiaohu/R-C3D.pytorch

Pytorch code for End-to-End Audiovisual Speech Recognition https://github.com/mpc001/end-to-end-lipreading

NSD: Neural Scene Decomposition for Human Motion Capture https://github.com/hrhodin/NeuralSceneDecomposition

I3D implemetation in Keras + video preprocessing + visualization of results https://github.com/oanaignat/i3d_keras

CVPR2019 STEP: Spatio-Temporal Progressive Learning for Video Action Detection

https://github.com/NVlabs/STEP

Repository for the paper "Adversarial Framing for Image and Video Classification" https://github.com/zajaczajac/adv_framing

'IG-65M PyTorch - PyTorch 3D video classification models pre-trained on 65 million Instagram videos' https://github.com/moabitcoin/ig65m-pytorch

Keras深度学习视频分类 https://www.pyimagesearch.com/2019/07/15/video-classification-with-keras-and-deep-learning/

简易快速的视频深度特征抽取 https://github.com/antoine77340/video_feature_extractor

Code for the paper "High Speed and High Dynamic Range Video with an Event Camera" (arXiv, 2019). http://rpg.ifi.uzh.ch/E2VID.html https://github.com/uzh-rpg/rpg_e2vid

Timeception for Complex Action Recognition https://github.com/noureldien/timeception

MS-TCN: Multi-Stage Temporal Convolutional Network for Action Segmentation https://github.com/yabufarha/ms-tcn

Deep Spatio-Temporal Neural Network (DSTN) https://github.com/oywtece/dstn

Learning to align and match videos with kernelized temporal layers https://github.com/facebookresearch/videoalignment

Self-supervised Spatiotemporal Learning via Video Clip Order Prediction https://github.com/xudejing/video-clip-order-prediction

Real-time Hand Gesture Recognition with PyTorch on EgoGesture, NvGesture and Jester https://arxiv.org/abs/1901.10323

https://github.com/ahmetgunduz/Real-time-GesRec

【MMAction2：下一代行为理解工具箱和基准】 https://github.com/open-mmlab/mmaction2

Temporal Pyramid Network for Action Recognition https://github.com/decisionforce/TPN

Code repository for the paper: 'Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks' https://github.com/joaanna/something_else

Zero-shot video classification by end-to-end training of 3D convolutional neural networks https://github.com/bbrattoli/ZeroShotVideoClassification

Spatio-Temporal Action Localization System https://github.com/MVIG-SJTU/AlphAction

Gate-Shift Networks for Video Action Recognition - CVPR 2020 https://github.com/swathikirans/GSM

Learn to cycle: Time-consistent feature discovery for action recognition https://github.com/alexandrosstergiou/Squeeze-and-Recursion-Temporal-Gates

We present a new action tubelet detection framework, termed as MovingCenter Detector (MOC-detector), by treating an action instance as a trajectory of moving points.

https://github.com/MCG-NJU/MOC-Detector

Repository for PREDICT & CLUSTER: Unsupervised Skeleton Based Action Recognition https://github.com/shlizee/Predict-Cluster

The code for our paper 《Self-Supervised Temporal-Discriminative Representation Learning for Video Action Recognition》

https://github.com/FingerRec/Self-Supervised-Temporal-Discriminative-Representation-Learning-for-Video-Action-Recognition

The Pytorch code of the TEA module (Temporal Excitation and Aggregation for Action Recognition) https://github.com/Phoenix1327/tea-action-recognition

Official repo for ECCV 2020 paper - RubiksNet: Learnable 3D-Shift for Efficient Video Action Recognition https://github.com/StanfordVL/RubiksNet

基于PyTorch的电影/视频研究工具集 https://github.com/movienet/movienet-tools

Kubric：数据生成管道，用于创建带有丰富标注的半真实合成多目标视频，如实例分割掩码、深度图和光流等 https://github.com/google-research/kubric

PaddleVideo：飞桨视频模型开发套件 https://github.com/PaddlePaddle/PaddleVideo

Temporal-Relational CrossTransformers for Few-Shot Action Recognition

https://github.com/tobyperrett/trx

An Efficient PointLSTM for Point Clouds Based Gesture Recognition https://github.com/Blueprintf/pointlstm-gesture-recognition-pytorch

Boundary-Aware Cascade Networks for Temporal Action Segmentation https://github.com/MCG-NJU/BCN

Official repo for ECCV 2020 paper - RubiksNet: Learnable 3D-Shift for Efficient Video Action Recognition https://github.com/StanfordVL/RubiksNet

This repository contains the code implementation used in the paper Temporally Coherent Embeddings for Self-Supervised Video Representation Learning (TCE). https://github.com/csiro-robotics/TCE

MotionSqueeze: Neural Motion Feature Learning for Video Understanding https://github.com/arunos728/MotionSqueeze

Pytorch implementation of [Late Temporal Modeling in 3D CNN Architectures with BERT for Action Recognition] https://github.com/artest08/LateTemporalModeling3DCNN

This repository contains a PyTorch implementation for "X3D: Expanding Architectures for Efficient Video Recognition models" https://github.com/kkahatapitiya/X3D-Multigrid

Efficient 3D Backbone Network for Temporal Modeling https://github.com/youngwanLEE/VoV3D

MS-TCN++: Multi-Stage Temporal Convolutional Network for Action Segmentation (TPAMI 2020) https://github.com/sj-li/MS-TCN2

Motion capture from internet videos https://github.com/zju3dv/EasyMocap

A Closer Look at Temporal Sentence Grounding in Videos: Datasets and Metrics https://github.com/yytzsy/grounding_changing_distribution

TimeSformer：视频理解新架构 TimeSformer: A new architecture for video understanding https://ai.facebook.com/blog/timesformer-a-new-architecture-for-video-understanding

TimeSformer：视频理解新架构 https://arxiv.org/abs/2102.05095

《TDN: Temporal Difference Networks for Efficient Action Recognition》(CVPR 2021)

github.com/MCG-NJU/TDN

《Monocular Real-time Hand Shape and Motion Capture using Multi-modal Data》(2021) github.com/MengHao666/Minimal-Hand-pytorch

DeepLabCut: 用于执行多种任务的动物无标记姿态估计工具箱 github.com/DeepLabCut/DeepLabCut

MVFNet: Multi-View Fusion Network for Efficient Video Recognition (AAAI 2021) github.com/whwu95/MVFNet

《TransNet V2: Shot Boundary Detection Neural Network》(2020) github.com/soCzech/TransNetV2

PyTorchVideo：深度学习视频理解库 github.com/facebookresearch/pytorchvideo

外科视频分析相关文献集 github.com/Finspire13/Awesome-Surgical-Video-Analysis

Learning Representational Invariances for Data-Efficient Action Recognition

github.com/vt-vl-lab/video-data-aug

《Global2Local: Efficient Structure Search for Video Action Segmentation》(CVPR 2021)

github.com/ShangHua-Gao/G2L-search

《Learning Salient Boundary Feature for Anchor-free Temporal Action Localization》(CVPR 2021) github.com/TencentYoutuResearch/ActionDetection-AFSD

《Vid2Doppler: Synthesizing Doppler Radar Data from Videos for Training Privacy-Preserving Activity Recognition》(2021) github.com/FIGLAB/Vid2Doppler

Keras实例：CNN-RNN架构实现视频分类 https://keras.io/examples/vision/video_classification/

《Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks》(2021) github.com/MenghaoGuo/EANet

DMVR: DeepMind视频数据读取模块 github.com/deepmind/dmvr

Video Contrastive Learning with Global Context github.com/amazon-research/ video-contrastive-learning

Token Shift Transformer for Video Classification github.com/VideoNetworks/TokShift-Transformer

AutoVideo: 自动视频动作识别系统

github.com/datamllab/autovideo

Revisiting 3D ResNets for Video Recognition github.com/tensorflow/models/tree/master/official

EssentialMC2：视频理解系统 github.com/alibaba/EssentialMC2

用视频和图像共同训练的Transformer改善动作识别

https://ai.googleblog.com/2022/03/co-training-transformer-with-videos-and.html

'video-transformers - Easiest way of fine-tuning HuggingFace video classification models' by fatih GitHub: github.com/fcakyon/video-transformers

提出一种新的流视频模型，用于统一处理基于帧和序列的视频理解任务。 https://arxiv.org/abs/2303.17228 [CV]《Streaming Video Model》Y Zhao, C Luo, C Tang, D Chen, N Codella, Z Zha [Microsoft Research Asia] (2023)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VideoC.md

VideoC.md

CVPR2019 STEP: Spatio-Temporal Progressive Learning for Video Action Detection

Real-time Hand Gesture Recognition with PyTorch on EgoGesture, NvGesture and Jester https://arxiv.org/abs/1901.10323

We present a new action tubelet detection framework, termed as MovingCenter Detector (MOC-detector), by treating an action instance as a trajectory of moving points.

The code for our paper 《Self-Supervised Temporal-Discriminative Representation Learning for Video Action Recognition》

Temporal-Relational CrossTransformers for Few-Shot Action Recognition

《TDN: Temporal Difference Networks for Efficient Action Recognition》(CVPR 2021)

Learning Representational Invariances for Data-Efficient Action Recognition

《Global2Local: Efficient Structure Search for Video Action Segmentation》(CVPR 2021)

AutoVideo: 自动视频动作识别系统

用视频和图像共同训练的Transformer改善动作识别

Files

VideoC.md

Latest commit

History

VideoC.md

File metadata and controls

CVPR2019 STEP: Spatio-Temporal Progressive Learning for Video Action Detection

Real-time Hand Gesture Recognition with PyTorch on EgoGesture, NvGesture and Jester https://arxiv.org/abs/1901.10323

We present a new action tubelet detection framework, termed as MovingCenter Detector (MOC-detector), by treating an action instance as a trajectory of moving points.

The code for our paper 《Self-Supervised Temporal-Discriminative Representation Learning for Video Action Recognition》

Temporal-Relational CrossTransformers for Few-Shot Action Recognition

《TDN: Temporal Difference Networks for Efficient Action Recognition》(CVPR 2021)

Learning Representational Invariances for Data-Efficient Action Recognition

《Global2Local: Efficient Structure Search for Video Action Segmentation》(CVPR 2021)

AutoVideo: 自动视频动作识别系统

用视频和图像共同训练的Transformer改善动作识别