Device Placement Optimization with Reinforcement Learning

阿新 • • 發佈：2018-06-19

規模一起專家 AR 運行 CP ear device 過去

摘要

過去許多年的神經網絡計算量規模擴大了許多，現在的應對方法是使用異質的CPU和GPU混合在一起組成的系統。問題是決定現在系統中哪個神經模型放置到哪個節點上是由專家根據其經驗和直覺來決定的。在本篇論文中，我們提出一個方法來優化TensorFlow的計算圖譜，方法的關鍵是使用一個序列模型來預測TensorFlow圖譜應該使用設備。預測的放置方法的運行時間是我們模型的反饋值。得到的結果顯示在Inception-V3的圖片分類算法，RNN LSTM和語言模型上相比於傳統的手動調整方法運行的時間更短。

2. 介紹

Device Placement Optimization with Reinforcement Learning

規模一起專家 AR 運行 CP ear device 過去摘要過去許多年的神經網絡計算量規模擴大了許多，現在的應對方法是使用異質的CPU和GPU混合在一起組成的系統。問題是決定現在系統中哪個神經模型放置到哪個節點上是由專家根據其經驗和直覺來決定的。在本篇論文中，我們

CS294-112 深度強化學習秋季學期（伯克利）NO.19 Guest lecture: Igor Mordatch (Optimization and Reinforcement Learning in Multi-Agent Settings)

nbsp setting TP for agent image learn ctu Go

Device Placement Optimization with Reinforcement Learning

摘要

2. 介紹

Device Placement Optimization with Reinforcement Learning

CS294-112 深度強化學習秋季學期（伯克利）NO.19 Guest lecture: Igor Mordatch (Optimization and Reinforcement Learning in Multi-Agent Settings)

論文筆記系列-Neural Architecture Search With Reinforcement Learning

Fear the REAPER A System for Automatic Multi-Document Summarization with Reinforcement Learning

網路結構搜尋（1）—— NAS（Neural architecture search with reinforcement learning）論文筆記

Playing Atari with Deep Reinforcement Learning

解讀continuous control with deep reinforcement learning（DDPG）

Deep Reinforcement Learning with Double Q-learning

Playing Atari with Deep Reinforcement Learning論文解讀

Learning to Communicate with Deep Multi-Agent Reinforcement Learning

Reinforcement Learning: Playing Doom with PyTorch

Reinforcement Learning with Q tables

Reinforcement Learning with Prediction

17-11-22 Deep Reinforcement Learning-based Image Captioning with Embedding Reward論文隨筆

NOTE:Deep Reinforcement Learning with a Natural Language Action Space

Continuous control with deep reinforcement learning

Reinforcement Learning Q-learning 算法學習-2

增強學習Reinforcement Learning經典算法梳理3：TD方法

how to study reinforcement learning(answered by Sergio Valcarcel Macua on Quora)

看DeepMind如何用Reinforcement learning玩遊戲

Device Placement Optimization with Reinforcement Learning

摘要

2. 介紹

相關推薦