Transformer-Attention is all you need

阿新 • • 發佈：2022-05-27

Attention（注意力機制）
圖片展示的Encoder-Decoder框架沒有體現“注意力模型”，可以把它看做是注意力不集中分心模型。因為在生成目標句子的單詞時，不論生成哪個單詞，它們使用的輸入句子的語義編碼C都是一樣的，沒有任何區別。而語義編碼C是由原句子中的每個單詞經過Encoder編碼產生的，這意味著原句子中任意單詞對生成某個目標單詞來說影響力都是相同的，這就是模型沒有體現出注意力的緣由。

計算重要程度e常用的有以下三種方式:

計算Encoder的序列h與Decoder的序列h的餘弦相似度.
在1的基礎上，乘上一個Wa，Wa是需要學習的引數，從學習到Encoder和Decoder的隱藏的打分e。

設計一個前饋神經網路，前饋神經網路的輸入是Encoder和Decoder的兩個隱藏狀態，Va、Wa都是需要學習的引數。

再將e使用softmax進行歸一化就得到權重分數α

將得分分別除以一個特定數值8（K向量的維度的平方根，通常K向量的維度是64）這能讓梯度更加穩定

多頭注意力機制
作用：

第一個方面，他擴充套件了模型關注不同位置的能力，這對翻譯一下句子特別有用，因為我們想知道“it”是指代的哪個單詞。
第二個方面，他給了自注意力層多個“表示子空間”。對於多頭自注意力機制，我們不止有一組Q/K/V權重矩陣，而是有多組（論文中使用8組），所以每個編碼器/解碼器使用8個“頭”（可以理解為8個互不干擾自的注意力機制運算），每一組的Q/K/V都不相同。然後，得到8個不同的權重矩陣Z，每個權重矩陣被用來將輸入向量投射到不同的表示子空間。
參考資料：

十分鐘理解Transformer
NLP中的RNN、Seq2Seq與attention注意力機制

Transformer-Attention is all you need

Attention（注意力機制）圖片展示的Encoder-Decoder框架沒有體現“注意力模型”，可以把它看做是注意力不集中分心模型。因為在生成目標句子的單詞時，不論生成哪個單詞，它們使用的輸入句子的語義編碼C都是一樣的，

Attention Is All You Need

目錄概主要內容Positional Encodingauto_regressive額外的細節程式碼 Vaswani A., Shazeer N., Parmar N., Uszkoreit J., Jones L., Gomez A. N., and Kaiser L. Attention is all you need. In Advances in Neural

paper 4：Attention is all you need

原博連結:論文解讀:Attention is All you need - 知乎 (zhihu.com) Attention用於計算“相關程度”。

筆記：讀Attention Is All You Need

筆記：Attention Is All You Need 作者：Ashish Vaswani et al.,NIPS 2017. 目錄 Motivation Model Attention

Attension Is All You Need

attention機制將整個句子作為輸入，從中抽取有用的資訊。每個輸出都跟整個句子優化，輸出的值為輸入的句子的詞向量的一個加權求和值。

One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL草讀

One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL（增強學習，針對小樣本魯棒性場景）

執行react專案，npm run start/build, 報錯 There might be a problem with the project dependency tree. It is likely not a bug in Create React App, but something you need to fix locally.

如題：這個問題困擾了我半天，網上搜索各種解決方法，都沒能解決，最後仔細讀一遍原因才發現問題很簡單，就是版本不一致

新增MySq出現The ‘InnoDB‘ feature is disabled； you need MySQL built with ‘InnoDB‘ to have it working

在使用navicat建立資料庫的時候，報錯提示：The ‘InnoDB’ feature is disabled; you need MySQL built with ‘InnoDB’ to have it working，自己取巧解決了這個問題，來這裡分享一下。

django.core.exceptions.ImproperlyConfigured: mysqlclient 1.3.13 or newer is required; you have 0.9.2的最佳處理方法，親測可用

django.core.exceptions.ImproperlyConfigured: mysqlclient 1.3.13 or newer is required; you have 0.9.3.遷移檔案時問題

(env) D:\\python_learn\\meiduo_project\\meiduo_mall>python manage.py makemigrations Traceback (most recent call last):

（資料遷移老問題）django.core.exceptions.ImproperlyConfigured: mysqlclient 1.3.13 or newer is required; you have 0.9.2

I、將你的Django降低到2.14以下即可：這個不用想，就要用最新的 II、升級的mysql客戶端版本至更高：電腦同時執行的還有php等其他語言，懶得折騰

ACM International Collegiate Programming Contest, Egyptian Collegiate Programming Contest (ECPC 2015) G. It is all about wisdom (二分,單源最短路)

題意:有\\(n\\)個點,\\(m\\)條邊,只有當你的智力值大於這條邊的\\(w\\)才能走,問在花費不超過\\(k\\)的情況下,從\\(1\\)走到\\(n\\)的所需的最小智力值.

laradock下mysql You need to specify one of MYSQL_ROOT_PASSWORD, MYSQL_ALLOW_EMPTY_PASSWORD and MYS...

上圖異常報錯 mysql You need to specify one of MYSQL_ROOT_PASSWORD, MYSQL_ALLOW_EMPTY_PASSWORD and MYSQL_RANDOM_ROOT_PASSWORD

ERROR 1419 (HY000) at line 9: You do not have the SUPER privilege and binary logging is enabled (you might want to use the less safe log_bin_trust_function_creators variable)

報錯原因在將函式或觸發器匯入MySQL資料庫時，會出現以下錯誤：“您沒有SUPER特權，並且啟用了二進位制日誌記錄（您*可能*想要使用不太安全的log_bin_trust_function_creators變數）”。

Transformer-Attention is all you need

Transformer-Attention is all you need

Attention Is All You Need

paper 4：Attention is all you need

筆記：讀Attention Is All You Need

Attension Is All You Need

One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL草讀

執行react專案，npm run start/build, 報錯 There might be a problem with the project dependency tree. It is likely not a bug in Create React App, but something you need to fix locally.

新增MySq出現The ‘InnoDB‘ feature is disabled； you need MySQL built with ‘InnoDB‘ to have it working

django.core.exceptions.ImproperlyConfigured: mysqlclient 1.3.13 or newer is required; you have 0.9.2的最佳處理方法，親測可用

django.core.exceptions.ImproperlyConfigured: mysqlclient 1.3.13 or newer is required; you have 0.9.3.遷移檔案時問題

（資料遷移老問題）django.core.exceptions.ImproperlyConfigured: mysqlclient 1.3.13 or newer is required; you have 0.9.2

ACM International Collegiate Programming Contest, Egyptian Collegiate Programming Contest (ECPC 2015) G. It is all about wisdom (二分,單源最短路)

laradock下mysql You need to specify one of MYSQL_ROOT_PASSWORD, MYSQL_ALLOW_EMPTY_PASSWORD and MYS...

ERROR 1419 (HY000) at line 9: You do not have the SUPER privilege and binary logging is enabled (you might want to use the less safe log_bin_trust_function_creators variable)

React Hooks: everything you need to know! 🚀（譯）

《The Matrix Calculus You Need For Deep Learning》讀書筆記

django.core.exceptions.ImproperlyConfigured: mysqlclient 1.4.0 or newer is required; you have 0.9.3.

CentOS 7.3 安裝 Redis 報錯“You need tcl 8.5 or newer in order to run the Redis test”

區分MT中常用的解碼策略——Decoding Strategies that You Need to Know for Response Generation

WARNING: You are using pip version 20.2.3； however, version 20.2.4 is available. You should consider

Transformer-Attention is all you need

相關推薦