APPLYING DEEP LEARNING TO ANSWER SELECTION: A STUDY AND AN OPEN TASK 論文閱讀

阿新 • • 發佈：2019-02-03

論文名：APPLYING DEEP LEARNING TO ANSWER SELECTION:
A STUDY AND AN OPEN TASK
作者來自IBM Watson 團隊

思路

將QA問題轉換為 Text matching和text selection 的問題。該模型中，存在問題q，和候選答案集合A，目標是對與問題q，從集合A中選擇最合適的答案a。

問題q會和集合A中的每個答案a進行相關性計算，最後得分最高的答案a會被選中。

模型不能夠對需要推理的問題進行回答。

模型結構

這裡寫圖片描述
HL是一個非線性變換g(W*x+b)，P是maxpooling，T是tanh啟用函式。問題Q和答案A分別通過模型，得到兩個向量。最後計算兩個向量的餘弦距離。

模型中，Q與A共用HL變化和CNN網路模型引數。

模型訓練：

訓練過程最小化ranking loss。具體做法是：
訓練模型時每個樣本包括問題Q，正確回答A+和錯誤回答A-。分別計算餘弦距離cos（Q，A+）與cos（Q，A-）。當滿足cos（Q，A+）- cos（Q，A-） < m 時，m為一閾值，說明模型不能夠將A+ 答案排在足夠靠前，那麼進行權重更新。如果cos（Q，A+）- cos（Q，A-） >= m，不需要更新模型，更換A-回答，直到cos（Q，A+）- cos（Q，A-） < m。

為了減少運算時間，需要設定最大重選A-次數，論文中設定為50。

模型實現

APPLYING DEEP LEARNING TO ANSWER SELECTION: A STUDY AND AN OPEN TASK 論文閱讀

論文名：APPLYING DEEP LEARNING TO ANSWER SELECTION: A STUDY AND AN OPEN TASK 作者來自IBM Watson 團隊思路將QA問題轉換為 Text matching和text sel

【閱讀筆記】Applying Deep Learning To Airbnb Search

Applying Deep Learning To Airbnb Search Airbnb Inc. [email protected] 2018年10月25日 ABSTRACT 最初使用 gradient boosted decision tree model 來做

【論文閱讀筆記】Deep Learning based Recommender System: A Survey and New Perspectives

【論文閱讀筆記】Deep Learning based Recommender System: A Survey and New Perspectives 2017年12月04日 17:44:15 cskywit 閱讀數：1116更多個人分類：機器學習

Deep Learning based Recommender System: A Survey and New Perspectives （2）

感想篇幅有限，這是接著上面的續篇，let's continue 4.2基於Autoencoder的推薦系統現存的兩種把autoencoder運用到推薦系統上的方法為：（1）使用autoencoder在bottleneck層學習低維特徵表達；（2）直接在重構層填充評分矩

[筆記]機器學習基石 02 Learning to Answer Yes-No

一個 mage 轉置 ant 好的 mar too 也有裏的一 Perception Hypothesis Set 1 A Simple Hypothesis Set: the ‘Perceptron‘ 本節還是使用銀行發信用卡的例子，銀行掌握用戶的年

Lecture 2: Learning to Answer Yes/no

algorithm 新的叠代圖片檢查並且 AD 決定嘗試 Roadmap 1.感知器假設集假設空間 \(H\) 到底是什麽樣子？ \(H\)中的一個\(h\)，\(h\)由\(\mathbf{W}\) 和閾值決定（閾值可以作為\(w_0\)）舉個具體的栗

機器學習基石 Lecture2: Learning to Answer Yes/No

機器學習基石 Lecture2: Learning to Answer Yes/No Perceptron Hypothesis Set Perceptron Learning Algorithm Garrantee of PLA No

Deep Learning to Analyse Human Activities Recorded on Videos Analytics Insight

Analyzing live videos by leveraging deep learning is the trendiest technology aided by computer vision and multimedia analysis. Analysing live videos is a

Storm brewing? Weather buff uses deep learning to predict patterns

Meteorologists are starting to experiment with deep learning tech to predict severe weather patterns. David Gagne, a postdoctoral researcher at the US Nati

Deep learning with Apache SystemML, a discussion with AI engineer Prithviraj Sen from IBM Research

Romeo Kienzler works as a Chief Data Scientist in the IBM Watson IoT worldwide team helping clients to apply advanced machine learning at scale on their Io

論文：Threat of Adversarial Attacks on Deep Learning in Computer Vision: A Survey翻譯工作

**關於對抗性攻擊對深度學習威脅的研究** Naveed Akhtar and Ajmal Mian ACKNOWLEDGEMENTS: The authors thank Nicholas Carlini (UC Berkeley) and Dimit

【翻譯論文】Learning to Estimate 3D Human Pose and Shape from a Single Color Image （CVPR 2018）

因為科研的需要，最近閱讀了這篇文章，裡邊的一些術語儘量的翻譯的專業一點，如有不恰當的地方歡迎個位評論指正，還有就是如有涉及到版權的問題，請及時聯絡本人，本人會立馬刪除該工作解決了從單個彩色影象估計全身3D人體姿勢和形狀的問題。這是一項普遍存在基於迭代優化的解決方案的任務，

It isn't possible to write into a document from an asynchronously-loaded

-a ext end oss statistic scrip views tracking app It isn‘t possible to write into a document from an asynchronously-loaded 今天遇到了一

Deep Learning(花書)教材筆記-Math and Machine Learning Basics(線性代數拾遺)

有一個 -a forall align svd分解滿足 opera mach 最大 I. Linear Algebra 1. 基礎概念回顧 scalar: 標量 vector: 矢量，an array of numbers. matrix: 矩陣, 2-D array

A note from an open source lead developer who got banned from his community due to Code Of Conduct…

A note from an open source lead developer who got banned from his community due to Code Of Conduct violationsI have been, for twelve years, the most prolif

How to Predict Whether a Persons Eyes are Open or Closed Using Brain Waves

Tweet Share Share Google Plus A Case Study in How to Avoid Methodological Errors when Evaluating

Deep Learning 18：DBM的學習及練習_讀論文“Deep Boltzmann Machines”的筆記

前言論文“Deep Boltzmann Machines”是Geoffrey Hinton和他的大牛學生Ruslan Salakhutdinov在論文“Reducing the Dimensionality of Data with Neural Networks”合作後的又一次聯合發表的一篇

learning to Estimate 3D Hand Pose from Single RGB Images論文理解

**持續更新......** 概括：以往很多論文藉助深度資訊將2D上升到3D，這篇論文則是想要用網路訓練代替深度資料（裝置成本比較高），提高他的泛性，詮釋了只要合成數據集足夠大和網路足夠強，我就可以不用深度資訊。這篇論文的思路很清晰，主要分為三個部分： 1、HandSegNet 2、PoseNet 3、th

Typescript declaration: Merge a class and an interface

參考: https://stackoverflow.com/questions/47670959/typescript-declaration-merge-a-class-and-an-interface ----------------------------------------------

Codeforces Round #489 (Div. 2) ---- A. Nastya and an Array

給了我們一個數組，可以給整個陣列的元素加上一個數或者減去一個數，問多少次操作之後所有的數都變成了0 因為每次操作都是針對整個陣列的，那麼不相同的數一定不可能同時變成0，因此我們只需要對陣列去重即可，然後對於已經是0 的數，我們不需要給出操作，因此如果陣列中有0，答案減去

APPLYING DEEP LEARNING TO ANSWER SELECTION: A STUDY AND AN OPEN TASK 論文閱讀

思路

模型結構

模型訓練：

模型實現

相關推薦