[CVPR2015] Is object localization for free? – Weakly-supervised learning with convolutional neural networks論文筆記

阿新 • • 發佈：2018-04-03

sed pooling was 技術分享 sco 評測 5.0 ict highest

亮點

一個好名字給了讓讀者開始閱讀的理由
global max pooling over sliding window的定位方法值得借鑒

方法

本文的目標是：設計一個弱監督分類網絡，註意本文的目標主要是提升分類。因為是2015年的文章，方法比較簡單原始。

Following three modifications to a classification network.

Treat the fully connected layers as convolutions, which allows us to deal with nearly arbitrary-sized images as input.

The aim is to apply the network to bigger images in a sliding window manner thus extending its output to n×m× K, where n and m denote the number of sliding window positions in the x- and y- direction in the image, respectively.
3xhxw —> convs —> kxmxn (k: number of classes)

Explicitly search for the highest scoring object position in the image by adding a single global max-pooling layer at the output.

kxmxn —> kx1x1
The max-pooling operation hypothesizes the location of the object in the image at the position with the maximum score

Use a cost function that can explicitly model multiple objects present in the image.

因為圖中可能有很多物體，所以多類的分類loss不適用。作者把這個任務視為多個二分類問題，loss function和分類的分數如下

技術分享圖片

training

技術分享圖片

muti-scale test

技術分享圖片

實驗

classification

mAP on VOC 2012 test: ＋3.1% compared with [56]
mAP on VOC 2012 test: ＋7.6% compared with kx1x1 output and single scale training
mAP on VOC: ＋2.6% compared with RCNN
mAP on COCO 62.8%

Localisation

Metric: if the maximal response across scales falls within the ground truth bounding box of an object of the same class within 18 pixels tolerance, we label the predicted location as correct. If not, then we count the response as a false positive (it hit the background), and we also increment the false negative count (no object was found).
metric on VOC 2012 val: -0.3% compared with RCNN
mAP on COCO 41.2%

缺點

定位評測的metric不具有權威性
max pooling改為average pooling會不會對於多個instance的情況更好一些

[CVPR2015] Is object localization for free? – Weakly-supervised learning with convolutional neural networks論文筆記

sed pooling was 技術分享 sco 評測 5.0 ict highest p.p1 { margin: 0.0px 0.0px 0.0px 0.0px; font: 15.0px "Helvetica Neue"; color: #323333 } p.p2

論文筆記：Is object localization for free?

Is object localization for free? Weakly-supervised learning with convolutional neural networks 摘要提出一個弱監督卷積神經網路for 分類。主要貢獻有：

【論文閱讀】Learning Dual Convolutional Neural Networks for Low-Level Vision

論文閱讀（【CVPR2018】Jinshan Pan - Learning Dual Convolutional Neural Networks for Low-Level Vision）本文針對低層視覺問題，提出了一般性的用於解決低層視覺問題的對偶卷積神經網路。作者認為，低層視覺問題，如常見的有

Bag of Tricks for Image Classification with Convolutional Neural Networks

Bag of Tricks for Image Classification with Convolutional Neural Networks，李沐大神18年12月的新作，用卷積神經網路進行影象分類的一些技巧。論文：Bag of Tricks for Image Classific

Machine Learning is Fun! Part 3: Deep Learning and Convolutional Neural Networks

We can train this kind of neural network in a few minutes on a modern laptop. When it’s done, we’ll have a neural network that can recognize pictures of “8

【論文閱讀】Bag of Tricks for Image Classification with Convolutional Neural Networks

Bag of Tricks for Image Classification with Convolutional Neural Networks 論文：https://arxiv.org/pdf/1812.01187.pdf 本文作者總結了模型訓練過程中可以提高準確率的方法,如題，

Effective Use ofWord Order for Text Categorization with Convolutional Neural Networks（閱讀理解）

一篇公開在2014年的文章，從現在的角度來看這篇文章的話，我們發現作者提出的方法很難算是主流方法，但在當時也有一定的啟發意義。這裡我們就簡單介紹一下這篇文章。本文提出了將CNN直接應用於高維度的文字資料上，為我們提供了兩者CNN網路Seq-CNNAs a running to

[CVPR 2016] Weakly Supervised Deep Detection Networks論文筆記

del found score feature 圖片 http spl span 根據 p.p1 { margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px "Helvetica Neue"; color: #323333 } p.p2

Ask HN: What is your advice for a technical founder learning sales?

Think less about pushing your product, and more about aligning your product to a stated need.Instead of "how do I get Product X into Company Y," ask "how c

課程四(Convolutional Neural Networks)，第三周（Object detection） —— 0.Learning Goals

member 數據定位 finding dataset pre intersect sta nal Learning Goals: Understand the challenges of Object Localization, Object Detection a

課程四(Convolutional Neural Networks)，第三周（Object detection） —— 1.Practice questions：Detection algorithms

car mage 分享圖片 nbsp blog obj 分享圖片 pos 【解釋】 tree的兩個bounding boxes 都要保留，因為交並比小於0.5；car 0.73保留；pedestrain 0.98保留；motor

Understanding Convolutional Neural Networks for NLP

n) rnn eas published previous depend tput parameter www. When we hear about Convolutional Neural Network (CNNs), we typically think of Co

EffNet: An Efficient Structure for Convolutional Neural Networks

EffeNet對MoblieNet網路進行改進,主要思想為: 首先,將MoblieNet的 3×3 3\times3的depthwise convolution層分解為兩個 3×1 3\times1, 1×3 1\ti

深層CNN的調參經驗 | A practical theory for designing very deep convolutional neural networks

A practical theory for designing very deep convolutional neural networks 兩個前提假設： 1.對於每一個卷積層，其學習更復雜表示的能力應該被保證 2.最高層的感受野應該不大於影象範圍

A Sensitivity Analysis of Convolutional Neural Networks for Sentence Classification

引言 Ye Zhang在2016年掛在arXiv上的論文，從名字大概可以看出來，這是一篇CNN調參指南。概述模型方面用的是單層CNN，主要是CNN用做文字分類方面的研究，模型結構如下所示：上述模型來自Convolutional Neural Networks for

Fast and accurate object detection in high resolution 4K and 8K video using GPUs 論文筆記

文章目錄一、基本資訊二、研究背景三、創新點 3.1 概述 3.2 詳解 3.2.1 問題分析 3.2.2 Attention pipeline 3.2.3 Implementation

《Convolutional Neural Networks for Sentence Classification》論文結構解讀

1.資料以某一雙鞋子為例，評論結果作為標籤（2分類：好評，差評）【穿了一段時間，不錯，喜歡的下單吧；好評】【鞋子收到了，不是很滿意。沒有吊牌，一直都是還是隻有我這一雙是；差評】資料處理步驟：把所有評論資料集分詞，去除停用詞，然後構建word2index，然後表示“句子”，以

[深度學習] Image Classification影象分類之Bag of Tricks for Image Classification with Convolutional Neural Net

論文全稱：《Bag of Tricks for Image Classification with Convolutional Neural Networks》論文地址：https://arxiv.org/pdf/1812.01187.pdf 這篇文章主要討論最近這些訓練神經網路的tric

Paper Review: fpgaConvNet--A Framework for Mapping Convolutional Neural Networks on FPGAs

注：本文中所有的圖片均擷取自原文作者的論文和講稿。基本資訊題目：fpgaConvNet：一個將CNN對映到FPGA上的平臺作者：Stylianos I. Venieris， Christos-Savvas Bouganis 機構：Imperial College Londo

學習筆記之Supervised Learning with scikit-learn | DataCamp

Supervised Learning with scikit-learn | DataCamp https://www.datacamp.com/courses/supervised-learning-with-scikit-learn At the end of day, the value of D

[CVPR2015] Is object localization for free? – Weakly-supervised learning with convolutional neural networks論文筆記

相關推薦