Pytorch 自定義前饋反饋函式

阿新 • • 發佈：2021-01-12

torch.autograd.Function

given a random x, y, W1, W2.
y = ( W1 * x) * W2, to predict y with input x using gradient descent by minimizing squared Eculidean distance.
We redefine ReLU and achieve the forward pass and backward pass.

import torch

class MyReLU(torch.autograd.Function):
    """
    We can implement our own custom autograd Functions by subclassing
    torch.autograd.Function and implementing the forward and backward passes
    which operate on Tensors.
    """
    @staticmethod
    def forward(ctx, input):
        """
        In the forward pass we receive a Tensor containing the input and return
        a Tensor containing the output. ctx is a context object that can be used
        to stash information for backward computation. You can cache arbitrary
        objects for use in the backward pass using the ctx.save_for_backward method.
        """
        ctx.save_for_backward(input)
        return input.clamp(min=0)

    @staticmethod
    def backward(ctx, grad_output):
        """
        In the backward pass we receive a Tensor containing the gradient of the loss
        with respect to the output, and we need to compute the gradient of the loss
        with respect to the input.
        """
        input, = ctx.saved_tensors
        grad_input = grad_output.clone()
        grad_input[input<0] = 0
        return grad_input

dtype = torch.float
device = torch.device("cpu")

# device = torch.device("cuda:0")  # Uncomment this to run on GPU
# torch.backends.cuda.matmul.allow_tf32 = False  # Uncomment this to run on GPU

# The above line disables TensorFloat32. This a feature that allows
# networks to run at a much faster speed while sacrificing precision.
# Although TensorFloat32 works well on most real models, for our toy model
# in this tutorial, the sacrificed precision causes convergence issue.
# For more information, see:
# https://pytorch.org/docs/stable/notes/cuda.html#tensorfloat-32-tf32-on-ampere-devices

# N is batch size; D_in is input dimension;
# H is hidden dimension; D_out is output dimension.
N, D_in, H, D_out = 64, 1000, 100, 10

# Create random Tensors to hold input and outputs.
x = torch.randn(N, D_in, device=device, dtype=dtype)
y = torch.randn(N, D_out, device=device, dtype=dtype)

# Create random Tensors for weights.
w1 = torch.randn(D_in, H, device=device, dtype=dtype, requires_grad=True)
w2 = torch.randn(H, D_out, device=device, dtype=dtype, requires_grad=True)

lr = 1e-6

relu = MyReLU.apply
for i in range(500):
    
    y_pred = relu(x.mm(w1)).mm(w2)

    loss = (y_pred-y).pow(2).sum()

    if i % 100 == 99:
        print(i, loss.item())

    loss.backward()

# 引數的更新常規所使用的是`optim.step()`，去對定義在`optim`裡面的`model.parameters()`這裡進行更新
# 由於這裡我們不使用優化器，因此這裡直接手動進行更新，注意這裡已經不需要算梯度了，只是把已經算好的梯度進行更新
    with torch.no_grad(): 
        w1-=lr*w1.grad
        w2-=lr*w2.grad

        w1.grad.zero_()
        w2.grad.zero_()

輸出結果:

99 952.6715087890625
199 6.376166820526123
299 0.06997707486152649
399 0.0012868450721725821
499 0.00012174161383882165

Pytorch 自定義前饋反饋函式

torch.autograd.Function given a random x, y, W1, W2. y = ( W1 * x) * W2, to predict y with input x using gradient descent by minimizing squaredEculidean distance.

Pytorch: 自定義網路層例項

自定義Autograd函式對於淺層的網路，我們可以手動的書寫前向傳播和反向傳播過程。但是當網路變得很大時，特別是在做深度學習時，網路結構變得複雜。前向傳播和反向傳播也隨之變得複雜，手動書寫這兩個過程就會存在很

pytorch自定義二值化網路層方式

任務要求：自定義一個層主要是定義該層的實現函式,只需要過載Function的forward和backward函式即可,如下:

pytorch 自定義引數不更新方式

nn.Module中定義引數：不需要加cuda，可以求導，反向傳播 class BiFPN(nn.Module): def __init__(self,fpn_sizes):

pytorch 自定義卷積核進行卷積操作方式

一卷積操作：在pytorch搭建起網路時，大家通常都使用已有的框架進行訓練，在網路中使用最多就是卷積操作，最熟悉不過的就是

解決Pytorch自定義層出現多Variable共享記憶體錯誤問題

錯誤資訊: RuntimeError: in-place operations can be only used on variables that don\'t share storage with any other variables,but detected that there are 4 objects sharing it

Keras中自定義複雜的loss函式

Keras中自定義複雜的loss函式 By蘇劍林|2017-07-22 loss = Lambda(lambda x: K.relu(margin+x[0]-x[1]))([wrong_cos,right_cos])

PyTorch 自定義 Dataset 及訓練集、測試集劃分方法

技術標籤：PyTorch 基礎例項 1：自定義資料集類，torch.utils.data.random_split() 劃分訓練集和測試集，通過普通遍歷方式使用自定義資料集中的樣本

pytorch自定義dataloder的時候，返回引數

技術標籤：python、pytorch、numpy程式碼比如我想在返回矩陣同時，也返回地址。只需要將這兩個資訊用字典封裝起來一起返回。在train的時候用XXX[‘path’]和XXX[‘data’]呼叫即可。（for i,XXX in in enumera

PyTorch 自定義model簡單示例

技術標籤：大資料與AI自然語言處理神經網路pytorch PyTorch 自定義model簡單示例環境：python3.8 Pytorch介紹 Pytorch中文文件

PyTorch自定義資料載入：深究Dataset與DataLoader類

PyTorch自定義資料載入：深究Dataset與DataLoader類寫在文章開頭資料載入步驟建立Dataset物件建立DataLoader物件迴圈獲取資料用以訓練

pytorch自定義運算元

參照官方教程，實現pytorch自定義運算元。主要分為以下幾步：改寫運算元為torch C++版本

react hooks 如何自定義元件（react函式元件的封裝）

前言　　這裡寫一下如何封裝可複用元件。首先技術棧 react hooks + props-type + jsx封裝純函式元件。類元件和typeScript在這不做討論，大家別白跑一趟。

ckeditor5-vue自定義圖片上傳函式

ckeditor5-vue自定義圖片上傳函式文件：https://ckeditor.com/docs/ckeditor5/latest/builds/guides/integration/frameworks/vuejs.html

（pytorch-深度學習系列）pytorch實現自定義網路層，並自設定前向傳播路徑-學習筆記

pytorch實現自定義網路層，並自設定前向傳播路徑-學習筆記 1. 不包含模型引數的自定義網路層

Pytorch自動求導機制、自定義**函式和梯度

Pytorch自動求導機制、自定義**函式和梯度文章目錄 Pytorch自動求導機制、自定義**函式和梯度前言：１自動求導機制1.0 張量本身grad_fn1.1 torch.autograd1.1.1 torch.autograd.backward1.1.2 torch.autograd.g

新增 Hive 自定義函式

使用 Java 編寫好 UDF 或 UDAF 函式後，Hive 要如何使用這些自定義函式呢？ 1 在 HDFS 上建立存放 jar 包的目錄

Java自定義函式呼叫方法解析

這篇文章主要介紹了java自定義函式呼叫方法解析,文中通過示例程式碼介紹的非常詳細，對大家的學習或者工作具有一定的參考學習價值,需要的朋友可以參考下

Java使用自定義註解實現函式測試功能示例

本文例項講述了Java使用自定義註解實現函式測試功能。分享給大家供大家參考，具體如下：

MySQL通過自定義函式實現遞迴查詢父級ID或者子級ID

背景: 在MySQL中如果是有限的層次，比如我們事先如果可以確定這個樹的最大深度,那麼所有節點為根的樹的深度均不會超過樹的最大深度，則我們可以直接通過left join來實現。

Pytorch 自定義前饋反饋函式

torch.autograd.Function

相關推薦