Finding Tiny Faces in the Wild with Generative Adversarial Network 論文學習

阿新 • • 發佈：2019-01-04

Finding Tiny Faces in the Wild with Generative Adversarial Network

2018年的cvpr

論文地址：http://openaccess.thecvf.com/content_cvpr_2018/papers/Bai_Finding_Tiny_Faces_CVPR_2018_paper.pdf

Abstract

挑戰：在無限制條件尋找低解析度的人臉

方法：採用GAN從低解析度人臉生成高解析度人臉

一. Introduction

低解析度面臨的挑戰：

1. 缺乏細節用於區分

2. 目前CNN模型卷積核步長太長，對於低解析度人臉識別容易丟失大部分資訊

本文做出的貢獻：
1. 提出了一種新的人臉檢測的統一端到端卷積神經網路結構，採用超解析度和細化網路生成真實清晰的高解析度影象，並引入判別網路對人臉與非人臉進行分類。

2. 引入新的判別器loss

二. Related Work

2.1 face detection

介紹了下過去人臉識別的發展，從手工構建特徵人臉識別，到RCNN，再到本文。

2.2 Super-resolution and Refinement Network

2.3. Generative Adversarial Networks

三. Proposed Method

3.1 GAN

其中y是標籤（區分是臉or不是臉）

3.2. Network Architecture

簡介：生成器包含兩個子網路：超解析度和細化網路

判別器中加入分支網路，用於區分人臉非人臉和生成圖片和真實圖片

Generator network：

Discriminator network：

可以看到右邊有兩個分支

3.3. Loss Function

Pixel-wise loss：

其中G1是上取樣子網，G2是細化網路

Adversarial loss：

Classification loss：

yn＝1或yn＝0表示影象是人臉還是非人臉

聯合起來

G_loss:

包含

畫素loss+GAN_loss+分類loss

D_loss:

包含GAN_loss+分類loss

四. Experiments

4.1. Training and Validation Datasets

有兩個資料集，包括WIDER FACE dataset和FDDB，用WINDER FACE訓練GAN

4.2. Implementation Details

α = 0.001 and β = 0.01，用β1=0.9的Adam優化器

從零開始，對生成器網路進行訓練，用標準偏差為0.02的零均值高斯分佈初始化各層的權重，用0初始化偏差。

為了避免不希望的區域性最優，首先訓練一個基於MSE的SR網路去初始化生成網路

採用在ImageNet上預先訓練的VGG19模型作為骨幹網路，用兩個並行的f c層替換所有的fc層。用標準偏差為0.1的零均值高斯分佈對f c層進行初始化，所有偏置用0初始化。

作者的基線MB-FCN檢測器基於RESNET50網路，它是對ImageNet進行預訓練的

通過它從WINDER FACE中獲取人臉與非人臉，通過使用因子為4的雙三次插值對高解析度影象進行下采樣來生成相應的低解析度影象。

所有GAN變異體均以10^-4的學習速率訓練前3個階段，以10^-5的較低學習速率訓練後3個階段。

4.3. Ablation Studies

(消融研究通常指去除模型或演算法的一些“特徵”，並觀察這些特徵如何影響效能)

refinement network的效果圖（可以減少光照和模糊的影響）

4.4. Comparison with the State-of-the-Art

Evaluation on WIDER FACE.：

提升效果的原因：

（1）上取樣子網

（2）細化網路

（3）GAN的分類損失

Evaluation on FDDB

4.5. Qualitative Results

5. Conclusion

就到總結了，結尾作者說了下本文的貢獻，基本都是重複內容。

p.s. 還有點疑問啊，到底是怎麼檢測圖片中人臉的呢？？用判別器？

Finding Tiny Faces in the Wild with Generative Adversarial Network 論文學習

Finding Tiny Faces in the Wild with Generative Adversarial Network 2018年的cvpr 論文地址：http://openaccess.thecvf.com/content_cvpr_2018/papers

Finding Tiny Faces in the Wild With Generative Adversarial Network 感想

你看今年cvpr的這篇文章，提供了一種寫文章的思路：基本上算是拿GAN在face super resolution上應用，自然要在低解析度的tiny face上做。這種工作意義大不大還真不好說，但是做的performance好了發cvpr還是沒問題的，因為它看起來很新：做人臉

Labeled Faces in the Wild 人臉識別數據集

sig pop detection labs not ins recommend size hal http://blog.csdn.net/garfielder007/article/details/51480525 New (draft) survey paper:La

論文速讀（Jiaming Liu——【2019】Detecting Text in the Wild with Deep Character Embedding Network ）

整體 text one ext red more show 檢測 another Jiaming Liu——【2019】Detecting Text in the Wild with Deep Character Embedding Network 論文 Jiaming L

《Recursive Recurrent Nets with Attention Modeling for OCR in the Wild》筆記

該文提出了一個基於注意力模型的遞迴迴圈神經網路模型（R2AM),解決在在無字典約束的條件下，對自然場景文字進行識別.提出的模型主要有以下幾個優點：（1）採用了迴圈的CNN網路，可以更加有效和準確地提取影象特徵；（2）在一個隱式的字元級別識別模型中嵌入一個R

DensePose: Dense Human Pose Estimation In The Wild（理解）

0 - 背景　　Facebook AI Research（FAIR）開源了一項將2D的RGB影象的所有人體畫素實時對映到3D模型的技術（DensePose）。支援戶外和穿著寬鬆衣服的物件識別，支援多人同時識別，並且實時性良好。 1 - 思路 1.1 - 標註資料集　　對於一般的姿態識別（骨骼追蹤）

DensePose:Dense Human Pose Estimation In The Wild 論文閱讀筆記

一、本文主要是Facebook AI 和INRIA 聯合出品，基於RCNN架構，以及Mask RCNN的多工結構，開源http://densepose.org 二、主要工作分為三點 1：標註了一個新的資料集，基於coco資料集，增加了u

自然場景文字處理論文整理（5）Detecting Curve Text in the Wild: New Dataset and New Solution

這篇文章是在自然場景文字處理中針對彎曲問題做的非常好的一篇文章。後面打算先用這篇論文來做實驗。 paper：https://arxiv.org/abs/1712.02170 github:https://github.com/Yuliang-Liu/Curve-Text-Detect

深度補全（Single-Image Depth Perception in the Wild）

Single-Image Depth Perception in the Wild arXiv:1604.03901v2 [cs.CV] 6 Jan 2017 Abstract 本文研究了戶外的深度感知，即從無約束設定下單個影象恢復深度。本文介紹了一種新的戶外資料集深度，由戶

ICDAR2017 Competition on Reading Chinese Text in the Wild(RCTW-17) 介紹

閱讀文章：《ICDAR2017 Competition on Reading Chinese Text in the Wild(RCTW-17)》　　這篇文章是對一項中文檢測和識別比賽專案（RCTW）的介紹和總結，這是一項新的專注於中文識別的競賽。這項競賽的特點在於，包含12263張標註過的中文資料集，有

Chinese Text in the Wild 學習筆記

CTW資料集下載地址： CTW dataset Download from one of the following links. 騰訊微雲https://share.weiyun.com/50hF1Cc OneDrivehttps://1drv.ms/f/s!Al-inEPeCze

AFLW:Annotated Facial Landmarks in the Wild: A large-scale, real-world database for facial landmark

簡單翻譯了一下AFLW的論文（解釋說明書）。 AFLW是一個人臉庫，一共有25993張人臉影象，它最突出的特點是在人臉關鍵點上定位了21個點，更容易被檢測。其次圖片質量比較高，不僅僅是室內，還有室外，側臉等難於檢測的情況都涵蓋在它的人臉庫中。 AFLW提供alw.sqlite，資料

[CSS3] Target HTML Elements not Explicitly set in the DOM with CSS Pseudo Elements

border lose lac imp close election flex size selection Pseudo elements allow us to target elements that are not explicitly set in the

Paper Reading: Pose-Aware Face Recognition in the wild

Pose-Aware Face Recognition in the wild (CVPR 2016) paper link: https://www.cv-foundation.org/openaccess/content_cvpr_2016/papers/Masi_Pose-Awar

Summary——DensePose: Dense Human Pose Estimation In The Wild

Research question：在一張RGB圖片和一個曲面模型上建立對應。RGB圖片來自COCO資料集（本文篩選出含有人物的圖片），除此之外，由一個人體的表面模型（這個模型應該是立體的）為24個體塊分別找到六張不同角度（當呈現在標註者面前的時候也是平面的圖片），本文就

Linear Regression in the Wild

In one of my job interviews for a data scientist position, I was given a home assignment I'd like to share with you. The interviewer sent me a CSV file con

Finding Tiny Faces 解讀

提出三個部分針對影象中小臉的尋找：1。尺度不變性，2影象解析度，3上下文推理。提出了尺度在預訓練深度網路中的作用，提供一種調整網路的方法將有限的尺度推廣到極端的尺度，論證出在大規模的基準人臉資料集上（FDDB和WIDER FACE）上均有較好的結果。尺度不變性幾乎是所有當前識

Create a personal video watch list in the cloud with PHP and the Movie Database API Part 1

Up until a few years ago, I’d turn on the TV and find myself humming Springsteen’s “57 Channels and Nothin’ On” as I flipped through

Create a personal video watchlist in the cloud with PHP and the Movie Database API Part 2

If you have been following along with Part 1, you are half-way through building a web-based PHP application to store your personal wa

Faulty Reward Functions in the Wild

Reinforcement learning algorithms can break in surprising, counterintuitive ways. In this post we'll explore one failure mode, which is where you mi

Finding Tiny Faces in the Wild with Generative Adversarial Network 論文學習