Fear the REAPER A System for Automatic Multi-Document Summarization with Reinforcement Learning

阿新 • • 發佈：2018-11-07

Cody Rioux, Sadid A. Hasan, Yllias Chali

##Abstract

Achieve the largest coverage of the docu
ments content.目標的覆蓋整個文件的內容
Concentrate distributed information to hidden units layer by layer. 通過一層一層的隱藏單元集中分散的資訊
the whole deep architecture is fine tuned by minimizing the information loss of reconstruction validation. 整個框架是減少重建確認時發生的資訊丟失

According to the concentrated information, dynamic programming is used to seek most informative set of sentences as the summary
DP被用來計算最有資訊量的集合，來作為摘要
##Relatedwork
We explore the use of SARSA which is a derivative of TD(lamada) that models the action space in addition to the state space modelled by TD(lamada). Furthermore we explore the use of an algorithm not based on temporal difference methods, but instead on policy iteration techniques

REAPER (Relatedness-focused Extractive Automatic
summary Preparation Exploiting Reinfocement learning)
以相關性為中心的抽取自動摘要準備利用強化學習
##Motivation
TD(lamada) is relatively old as far as reinforcement learning (RL)
algorithms are concerned, and the optimal ILP did not outperform ASRL using the same reward function.
強化學習有很大打提升空間
基於查詢的摘要得到廣泛關注
不對句子壓縮的效果做進一步探討
##Model

TD(lamada)
時間差（TD）學習是一種基於預測的機器學習方法。它主要用於強化學習問題，據說是“ 蒙特卡羅思想和動態規劃（DP）思想的結合”。[1] TD類似於蒙特卡洛方法，因為它根據某種策略通過對環境進行取樣來學習，並且與動態規劃技術相關，因為它基於先前學習的估計來逼近其當前估計（稱為自舉）。TD學習演算法與動物學習的時間差模型有關。[2]
temporal difference methods-wiki
Approximate Policy Iteration
近似策略迭代（API）遵循一個不同的正規化，通過迭代地改進馬爾可夫決策過程的策略，直到策略收斂為止。
Sarsa演算法
Q演算法是當選擇下一步的時候會找最好的一個走（選最大Q值的）而sarsa是當選擇下一步的時候運用和上一步一樣/想等的Q值但是最後都會更新之前的一步從而達到學習的效果~
On-policy Sarsa演算法與Off-policy Q learning對比
##Experiment
Feature Space depends on the presence of top bigrams，而不用
tf *idf words
Reward Function
based on the n-gram concurrence score metric
the longest-common-subsequence recall metric

Immediate Rewards
Query Focused Rewards

Fear the REAPER A System for Automatic Multi-Document Summarization with Reinforcement Learning

Cody Rioux, Sadid A. Hasan, Yllias Chali ##Abstract Achieve the largest coverage of the docu ments content.目標的覆蓋整個文件的內容 Concentrate dis

【USE】《An End-to-End System for Automatic Urinary Particle Recognition with CNN》

Urine Sediment Examination（USE） JMOS-2018 目錄目錄 1 Background and Motivation 2 Innovation

How Cryptocurrency is the Next Operating System for Capitalism

Money won’t last forever — that is guaranteedIt didn’t exist when exchange evolved to become a feature of humanities first economic system, nor will it per

Disc Jam Case Study: Supporting a Mission for Fast-Paced Competitive Gameplay with Amazon Gamelift

About High Horse Entertainment Having gained over 20 years’ experience at both Treyarch and Activision, where they worked on succ

多文檔自己主動文摘：Multi-Document Summarization,MDS

tex con src img log multi fontsize doc fill 多文檔自己主動文摘：Multi-Document Summarization,MDS

Ranking with Recursive Neural Networks and Its Application to Multi-document Summarization

Cao Z, Wei F, Dong L, et al. Ranking with recursive neural networks and its application to multi-document summarization[C]// Twenty-Ninth AAAI Con

Query-Oriented Multi-Document Summarization via Unsupervised Deep Learning

Liu Y, Zhong S H, Li W. Query-oriented multi-document summarization via unsupervised deep learning[C]// Twenty-Sixth AAAI Conference on Artificial

Framework of Automatic Text Summarization Using Reinforcement Learning

Abekawa T, Abekawa T. Framework of automatic text summarization using reinforcement learning[C]// Joint Conference on Empirical Methods in Nat

the right pose to design a security system for sign up

引子最近有個虛擬練習專案，涉及到系統安全保障的設計，於是對安全保障這塊做了一些更深入的瞭解。發現了很多有趣的東西，開闊了眼界。中間查了一些資料，於是我打算重新整理，用更加循序漸進，大家都能懂的方式，說一說如何設計一個安全的系統。著名的安全事件首先來看看最近幾年比較著名

《The Design of a Practical System for Fault-Tolerant Virtual Machines》論文研讀

# VM-FT 論文研讀 **說明**：本文為論文 **《The Design of a Practical System for Fault-Tolerant Virtual Machines》** 的個人理解，難免有理解不到位之處，歡迎交流與指正。 **論文地址**：[VM-FT 論文](https

Use the following method printPrimes() for questions a-f below

int 技術分享 case code nes ati http conn design Use the following method printPrimes() for questions a-f below Code /************************

論文閱讀 | CrystalBall: A Visual Analytic System for Future Event Discovery and Analysis from Social Media Data

夏洛特 bstr soci 相同方式 PE VM src 測量 CrystalBall: A Visual Analytic System for Future Event Discovery and Analysis from Social Media Data 論文地

Design of a machine for the universal non-contact measurement of large free-form optics with 30 nm uncertainty

gid surface path cap data ota sig axis com 4.3. Metrology system design The metrology loop should measure the position of the probe rel

A SDN-based WiFi-VLC Coupled System for Optimised Service Provision in 5G Networks

讀後感; 今天讀了《A SDN-based WiFi-VLC Coupled System for Optimised Service Provision in 5G Networks》有感如下：摘要：室內可見光系統是一個有力的補充，它最近獲得了有力的關注，在5G網路短距離通訊中成為一個受

A Deep Learning-Based System for Vulnerability Detection(二)

　　接著上一篇，這篇研究實驗和結果。 A.用於評估漏洞檢測系統的指標 TP：為正確檢測到漏洞的樣本數量 FP：為檢測到虛假漏洞樣本的數量(誤報) FN：為未檢真實漏洞的樣本數量(漏報) TN：未檢測到漏洞樣本的數量　　這篇文獻廣泛使用指標假陽性率(FPR),假陰性率(FNR),真陽性率或者召回率

could not find a writer for the specified extension in function 'cv::imwrite_'的一種原因

在使用cv2.imwrite（）的時候出錯。原因在於你給的字尾opencv不支援，或者沒有後綴的檔案。比如應該是a.jpg，你寫成了ajpg。 Only 8-bit (or 16-bit unsigned (CV_16U) . in case of PNG, JPEG 20

AFLW:Annotated Facial Landmarks in the Wild: A large-scale, real-world database for facial landmark

簡單翻譯了一下AFLW的論文（解釋說明書）。 AFLW是一個人臉庫，一共有25993張人臉影象，它最突出的特點是在人臉關鍵點上定位了21個點，更容易被檢測。其次圖片質量比較高，不僅僅是室內，還有室外，側臉等難於檢測的情況都涵蓋在它的人臉庫中。 AFLW提供alw.sqlite，資料

論文筆記11:Development of a Music Recommendation System for Motivating Exercise

參考論文：Development of a Music Recommendation System for Motivating Exercise 圖片出不來，請參考我同文知乎連線：https://zhuanlan.zhihu.com/p/40912861 Abstract 雖然定期體育

opencv常見bug：could not find a writer for the specified extension in function cv::imwrite_

在opencv學習中，使用cv.imwrite(‘filename’, img)儲存照片檔案到本地，有時使用失誤可能會出現報錯，error: (-2) could not find a writer for the specified extension in function cv::im

LiveScan3D: A Fast and Inexpensive 3D Data Acquisition System for Multiple Kinect v2 Sensors

LiveScan3D：用於多個Kinect v2感測器的快速、低成本的3D資料採集系統文章翻譯引言：我們提出了一種利用多個Kinect v2感測器進行實時3D採集的方法。與使用單個感測器的方法不同，比如[1]，我們可以同時記錄多個視點的動態場景。我

Fear the REAPER A System for Automatic Multi-Document Summarization with Reinforcement Learning

相關推薦