I3D論文解讀(Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset)

阿新 • • 發佈：2019-01-10

論文：Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

期刊：CVPR2017

papar:https://arxiv.org/pdf/1705.07750v1.pdf

相關工作：

相關工作就是下面這個圖

文章兩個重大貢獻：1 提出了kinetics資料集。2 提出了雙流3D卷積模型

3D ConvNet

模型細節：是原論文中C3D的變種。8層卷積、5層pooling、2層全連線。與C3D的區別在於這裡的卷積和全連線層後面加BN；且在第一個pooling層使用stride=2，這樣使得batch_size可以更大。輸入是16幀，每幀112*112。

Two-Stream Networks

LSTM缺點：能model高層變化卻不能捕捉低層運動(因為在低層，每個幀都是獨立地被CNN提取特徵)，有些低層運動可能是重要的；訓練很昂貴
Two-Stream Networks: 將單獨的一張RGB圖片和一疊計算得到的光流幀分別送入在ImageNet上預訓練的ConvNet中，再把兩個通道的score取平均

New*: Two-Stream Inflated 3D ConvNets

Implementation Details

模型：

實驗結果，可以看到I3D的準確率提高了許多：

參考文章：

https://blog.csdn.net/paranoid_cnn/article/details/77933316

https://blog.csdn.net/Gavinmiaoc/article/details/81208997

https://blog.csdn.net/zzmshuai/article/details/84936338

I3D論文解讀(Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset)

論文：Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset 期刊：CVPR2017 papar:https://arxiv.org/pdf/1705.07750v1.pdf 相關工作：相關工作就是

【論文閱讀】Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

【論文閱讀】Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset 這是一篇2017CVPR的論文，我感覺這篇論文最大的貢獻就是提出了kinetics資料集，這個資料集與之前的行為識別資料集相比有質的飛躍。同

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

本文是deepmind出品，目的，就一個，放出個關於視訊方面的訓練集kinetics，一個四百個類，每個類有至少四百個clips，每個clips十秒鐘，屬於從youtube上剪下的視訊，然後對比了幾種現在存在的用於行為識別的幾種框架，具體如下圖：其中，a，b

Good Bye 2018 A. New Year and the Christmas Ornament

傳送門 https://www.cnblogs.com/violet-acmer/p/10201535.html 題解：　　這題沒什麼好說的，讀懂題意就會了。比賽程式碼： 1 #include<iostream> 2 using namespac

論文解讀：Ask Your Neurons: A Neural-based Approach to Answering Questions about Images

這是關於VQA問題的第三篇系列文章，這篇文章是一篇比較經典的文章，所以跟大家分享。本篇文章將介紹論文：主要思想；模型方法；主要貢獻。有興趣可以檢視原文：Ask Your Neurons: A Neural-based Approach to Answering Questions abo

【論文閱讀】Human Action Recognition using Factorized Spatio-Temporal Convolutional Networks

【論文閱讀】Human Action Recognition using Factorized Spatio-Temporal Convolutional Networks 這是一篇15年ICCV的論文，在15年的時候，3D卷積網路剛剛興起，但是因為3D卷積網路的引數量較多，而且訓練資料

論文解讀：DeLiGAN: Generative Adversarial Networks for Diverse and Limited Data

前言：DeLiGAN是計算機視覺頂會CVPR2017發表的一篇論文，本文將結合Python原始碼學習DeLiGAN中的核心內容。DeLiGAN最大的貢獻就是將生成對抗網路（GANs）的輸入潛空間編碼為混合模型（高斯混合模型），從而使得生成對抗網路（GANs）在數量有限但具有多樣性的訓練資料上表現出較

Antarctic Site Dome A Promises to Open a New Window on the Remote Automation Cosmos

www.inhandnetworks.com Equipment deployed at Dome A in Antartica, a site as high as Maunakea and 10 times drier, showed that it would be an idea

Physicists Present a New Theory on the Origin of D remote connectivity ark Matter

Calculations for the new dark matter model developed at Mainz University Physicists have now come up with a new theory on how dark

BigGAN: A New State of the Art in Image Synthesis

“Best GAN samples ever yet? Very impressive ICLR submission! BigGAN improves Inception Scores by >100.”The above Tweet is from renowned Google DeepMind

A Mathematical Model Captures the Political Impact of Fake News

This story is for Medium members.Continue with FacebookContinue with GoogleMedium curates expert stories from leading publishers exclusively for members (w

The cart before the horse: A new model of cause and effect

But in many cases, this one-way relationship between cause and effect fails to accurately describe reality. In a recent paper in Nature Communications, sc

【轉】How to create a new user and grant permissions in MySQL

MySQL is one of the most popular database management systems. In this tutorial we will cover the steps needed to create new MySQL user and grant permission

Xcode No account for team "". Add a new account in the Accounts preference pane or verify that your accounts have valid credenti

問題背景 Xcode報錯誤資訊：No account for team "QMP96B5DPW". Add a new account in the Accounts preference pane or verify that your accounts have valid credentials.

I3D論文解讀(Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset)

I3D論文解讀(Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset)

【論文閱讀】Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

Good Bye 2018 A. New Year and the Christmas Ornament

論文解讀：Ask Your Neurons: A Neural-based Approach to Answering Questions about Images

【論文閱讀】Human Action Recognition using Factorized Spatio-Temporal Convolutional Networks

論文解讀：DeLiGAN: Generative Adversarial Networks for Diverse and Limited Data

Antarctic Site Dome A Promises to Open a New Window on the Remote Automation Cosmos

Physicists Present a New Theory on the Origin of D remote connectivity ark Matter

BigGAN: A New State of the Art in Image Synthesis

A Mathematical Model Captures the Political Impact of Fake News

The cart before the horse: A new model of cause and effect

【轉】How to create a new user and grant permissions in MySQL

Xcode No account for team "". Add a new account in the Accounts preference pane or verify that your accounts have valid credenti

Optical Flow Guided Feature A Fast and Robust Motion Representation for Video Action Recognition論文解讀

論文筆記 | A Closer Look at Spatiotemporal Convolutions for Action Recognition

CVPR2016之A Key Volume Mining Deep Framework for Action Recognition論文閱讀（視訊關鍵幀選取）

【論文閱讀】A Closer Look at Spatiotemporal Convolutions for Action Recognition

MSCNN論文解讀-A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection

【CV論文閱讀】Two stream convolutional Networks for action recognition in Vedios

I3D論文解讀(Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset)

相關推薦