《AdaptSegNet：Learning to Adapt Structured Output Space for Semantic Segmentation》論文筆記

阿新 • • 發佈：2020-10-14

參考程式碼：AdaptSegNet

1. 概述

導讀：這篇文章著力於解決模型未見過資料的適應性，一般來講模型對於與訓練集中資料類似的資料表現較好，但是對於未知場景的資料就表現較差了，這也是domain-adaptation需要解決的問題。這篇文章在分割任務下進行了研究，提出在output space（分割softmax輸出）上使用GAN網路去擬合兩種資料（合成數據與真實資料）分佈，此外還提出使用多層GAN監督的形式優化特徵的分佈。

之前的一些domain adaptation的工作是在feature層次上進行的，但是在分割任務中就顯得不是很適合了，這是由於分割任務中的特徵編碼了高維度的形狀/紋理等資訊，因而相當複雜，不易adapt。文章通過觀察已知資料和未知資料的特點，觀察到兩種資料在分割結果上更加具有視覺上的一致性，因而在網路的輸出（output space）上進行domain adaptation。下圖表示的就是這種空間下的相似性：

文中將整個網路劃分成兩個部分：分割網路組成的生成器和判別網路。並提出了兩個分佈擬合策略：

1）使用分割輸出（softmax概率圖）的結果去擬合兩個資料的分佈；
2）使用多層資料（在多個特徵上得到softmax output space）之後再使用GAN去拉近兩個分佈；

2. 方法設計

2.1 網路結構

文章的網路結構見下圖所示：
在這裡插入圖片描述
在上圖中可以看到文章的網路由兩部分組成：分割網路構成的生成器 G G G與判別器 D i D_i Di，輸入的真實影象與合成影象是 I t , I s ∈ R ( H ∗ W ∗ C ) I_t,I_s\in R^{(H*W*C)} It,Is∈R(H∗W∗

C)，之後得到兩個影象的softmax分割概率輸出 P t , P s P_t,P_s Pt,Ps，之後將這兩個概率圖輸入到判別器網路 D i D_i Di拉近這兩個資料的分佈。

2.2 單層GAN結構

判別器的訓練：
通過生成器得到的概率圖為 P t , P s P_t,P_s Pt,Ps，其過程為 P = G ( I ) ∈ R ( H ∗ W ∗ C ) P=G(I)\in R^{(H*W*C)} P=G(I)∈R(H∗W∗C)，那麼擬合這兩個分佈的GAN損失可以描述為：
L d ( P s , P t ) = − ∑ h , w l o g ( D ( P t ) ) + l o g ( D ( P s ) ) L_d(P_s,P_t)=-\sum_{h,w}log(D(P_t))+log(D(P_s))

Ld(Ps,Pt)=−h,w∑log(D(Pt))+log(D(Ps))

生成器的訓練：
此外，還存在合成數據的分割損失：
L s e g ( I s ) = − ∑ h , w ∑ c ∈ C Y s ( h , w , c ) l o g ( P s ( h , w , c ) ) L_{seg}(I_s)=-\sum_{h,w}\sum_{c\in C}Y_s^{(h,w,c)}log(P_s^{(h,w,c)}) Lseg(Is)=−h,w∑c∈C∑Ys(h,w,c)log(Ps(h,w,c))
再加上真實資料在判別網路下的損失：
L a d v ( I t ) = − ∑ h , w l o g ( D ( P t ) ) L_{adv}(I_t)=-\sum_{h,w}log(D(P_t)) Ladv(It)=−h,w∑log(D(Pt))

2.3 多層GAN結構

這裡的多層是在單層分割輸出基礎上使用多層特徵進行分割，之後再在這些分割結果上進行與單層結構類似的損失計算，因而這裡的損失函式可以描述為：
L I s , I t = ∑ i λ s e g i L s e g i ( I s ) + λ a d v i L a d v i ( I t ) L_{I_s,I_t}=\sum_i\lambda_{seg}^iL_{seg}^i(I_s)+\lambda_{adv}^iL_{adv}^i(I_t) LIs,It=i∑λsegiLsegi(Is)+λadviLadvi(It)
整體的優化過程為：
max ⁡ D min ⁡ G L ( I s , I t ) \max_D\min_GL(I_s,I_t) DmaxGminL(Is,It)

2.4 損失函式

文章的損失函式由分割損失與GAN損失兩部分組成，可以其使用的組成形式為：
L ( I s , I t ) = L s e g ( I s ) + λ a d v L a d v ( I t ) L(I_s,I_t)=L_{seg}(I_s)+\lambda_{adv}L_{adv}(I_t) L(Is,It)=Lseg(Is)+λadvLadv(It)

3. 實驗結果

GTA5-CityScapes：
在這裡插入圖片描述
SYNTHIA-CityScapes

《AdaptSegNet：Learning to Adapt Structured Output Space for Semantic Segmentation》論文筆記

參考程式碼：AdaptSegNet 1. 概述導讀：這篇文章著力於解決模型未見過資料的適應性，一般來講模型對於與訓練集中資料類似的資料表現較好，但是對於未知場景的資料就表現較差了，這也是domain-adaptation需

K8S-kubelet報錯： failed to get c ontainer info for "/system.slice/docker.service": unknown container "/system.slice/docker.service"

K8S版本：1.17.11 今天檢視kubelet日誌的時候，發信一堆報錯：檢視kubelet日誌：]# journalctl -f -u kubelet

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation 論文筆記

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation PointNet：三維分類與分割中點集的深度學習論文地址：https://arxiv.org/pdf/1612.00593.pdf 程式碼地址：https://github

FCN論文解讀：FCN-Fully Convolutional Networks for Semantic Segmentation

FCN原文作為語義分割領域的開山之作，對其進行研究和閱讀幾乎是入門語義分割領域的基礎，這篇部落格整理了自己閱讀該論文的一些心得感悟和收穫。

論文筆記3：SegFormer Simple and Efficient Design for Semantic Segmentation with Transformers

論文地址：https://arxiv.org/abs/2105.15203 1 引言文章提出了一種基於transformer的語義分割網路，不同於ViT模型，SegFormer使用一種分層特徵表示的方法，每個transformer層的輸出特徵尺寸逐層遞減，通過這種方式

論文筆記4：Segmenter: Transformer for Semantic Segmentation

論文地址：https://arxiv.org/abs/2105.05633 1 引言影象語義分割在單個影象塊級別通常表現得比較模糊，文章提出了一種基於tansformer的語義分割模型，可以在網路傳播過程中建模全域性上下文資訊。其網路結構是在V

A Unified Deep Model of Learning from both Data and Queries for Cardinality Estimation 論文解讀（SIGMOD 2021 UAE）

A Unified Deep Model of Learning from both Data and Queries for Cardinality Estimation 論文解讀（SIGMOD 2021）

app登陸到離線的時候，報錯：資料格式轉換錯誤n I/O error has occurred while writing a response message entity to the container output stream. org.glassfish.jersey.server.internal.process.MappableException

1. tomcat報錯 EVERE: An I/O error has occurred while writing a response message entity to the container output stream.

L2M-GAN: Learning to Manipulate Latent Space Semantics for Facial Attribute Editing閱讀筆記

L2M-GAN: Learning to Manipulate Latent Space Semantics for Facial Attribute Editing 2021 CVPR　　L2M-GAN: Learning To Manipulate Latent Space Semantics for Facial Attribute Editing (thecvf.com)

flyway maven plugin無法正常使用： Unable to connect to the database. Configure the url, user and password!

在pom.xml依賴中新增configuration內容，如下： <plugin> <groupId>org.flywaydb</groupId>

SLF4J ：Failed to load class "org.slf4j.impl.StaticLoggerBinder".

錯誤提示 SLF4J: Failed to load class "org.slf4j.impl.StaticLoggerBinder". SLF4J: Defaulting to no-operation (NOP) logger implementation

原創: druid配置及解決：Failed to bind properties under 'spring.datasource' to javax.sql.DataSource

如何沒有新增依賴log4依賴包會報錯：Failed to bind properties under \'spring.datasource\' to javax.sql.DataSource

通過Maven匯出war包時報錯：Failed to execute goal org.apache.maven.plugins:maven-war-plugin:2.2:war (default-war) on project

通過Maven匯出war包時報錯：Failed to execute goal org.apache.maven.plugins:maven-war-plugin:2.2:war (default-war) on project Ocr: Error assembling WAR: webxml attribute is required (or pre-existing WEB

關於表格的文字生成：Table-to-Text

我研究了3個例子：北京大學的wiki2bio、谷歌的ToTTo、微軟的WIKITABLETEXT 北京大學的wiki2bio

debian_linux系統_訪問真實環境rancher_證書問題相關_https相關_使用kubectl命令列檢視資源時報錯：Unable to connect to the server: x509: certificate signed by unknown authority

　　前言：近日在windows10上使用debian_linux虛擬系統使用kubectl命令列工具，訪問真實環境rancher時，無法訪問資源，丟擲異常：Unable to connect to the server: x509: certificate signed by unknown authority。

SpringBoot 啟動報錯：Failed to configure a DataSource

今天在開啟專案時遇到下面的問題，於是開啟baidu，進行一頓搜尋，發現解決方法都差不多，都是類似於在springBoot啟動類上加上

Error-IIS-ASP.NET：Unable to make the session state request to the session state server. Please ensure that the ASP.NET State service is started and that the client and server ports are the same.

ylbtech-Error-IIS-ASP.NET：Unable to make the session state request to the session state server. Please ensure that the ASP.NET State service is started and that the client and server ports are the

解決https負載報錯：unable to find valid certification path to requested target

一.錯誤原因 Java在訪問SSL加密的網站時，需要從JDK的KeyStore 裡面去查詢相對應得可信證書，如果不能從預設或者指定的KeyStore 中找到可信證書，就會報這個錯誤。另外，Java所使用的證書倉庫並不是Windows系統自帶的

Zookeeper超級使用者使用案例：How to remove ACL protected ZK Node

Problem There are time we would want to remove a ZK node in a secure cluster which is ACL protected. Something as below ACLs

《AdaptSegNet：Learning to Adapt Structured Output Space for Semantic Segmentation》論文筆記

1. 概述

2. 方法設計

2.1 網路結構

2.2 單層GAN結構

2.3 多層GAN結構

2.4 損失函式

3. 實驗結果

相關推薦