循环神经网络前向传播

2023年4月8日上午12:33 • 循环神经网络

循环神经网络(Recurrent Neural Networks ，以下简称RNN)是一类输出和模型间有反馈的神经网络，它广泛的用于自然语言处理中的语音识别，手写书别以及机器翻译等领域。在DNN和CNN中，训练样本的输入和输出是比较的确定的。但是有一类问题DNN和CNN不好解决，就是训练样本输入是连续的序列,且序列的长短不一，比如基于时间的序列：一段段连续的语音，一段段连续的手写文字。这些序列比较长，且长度不一，比较难直接的拆分成一个个独立的样本来通过DNN/CNN进行训练。

而对于这类问题，RNN则比较的擅长。RNN假设我们的样本是基于序列的。比如是从序列索引1到序列索引τ的。对于这其中的任意序列索引号t,它对应的输入是对应的样本序列中的x(t)。而模型在序列索引号t位置的隐藏状态h(t)，则由x(t)和在t−1位置的隐藏状态h(t−1)共同决定。在任意序列索引号t，我们也有对应的模型预测输出o(t)。通过预测输出o(t)和训练序列真实输出y(t),以及损失函数L(t)，我们就可以用DNN类似的方法来训练模型，接着用来预测测试序列中的一些位置的输出。

循环神经网络前向传播

下面是利用Tensorflow搭建的RNN前向传播：

import os
import tensorflow as tf
import numpy as np
os.environ['TF_CPP_MIN_LOG_LEVEL']='2'
#定义RNN的参数
X = [1,2]
state = [0.0, 0.0]
w_cell_state = np.asarray([[0.1, 0.2], [0.3, 0.4]])
w_cell_input = np.asarray([0.5, 0.6])
b_cell = np.asarray([0.1, -0.1])
w_output = np.asarray([[1.0], [2.0]])
b_output = 0.1
#执行前向传播过程
for i in range(len(X)):
before_activation = np.dot(state, w_cell_state) + X[i] * w_cell_input + b_cell
state = np.tanh(before_activation)
final_output = np.dot(state, w_output) + b_output
print("before activation: ", before_activation)
print("state: ", state)
print("output: ", final_output)

运算结果：

循环神经网络前向传播

本站文章如无特殊说明，均为本站原创，如若转载，请注明出处：循环神经网络前向传播 - Python技术站

人工智能循环神经网络

赞 (0)

微信扫一扫

微信扫一扫

支付宝扫一扫

支付宝扫一扫

GRU循环神经网络

上一篇 2023年4月8日上午12:33

自然语言处理之循环神经网络

下一篇 2023年4月8日上午12:34

机器学习网址归纳

Machine Learning developers.google tensorflow 人工智能各种技术与算法 ***************机器学习实战**************** by 修行的猫_zq 机器学习实战python3 机器学习实战-python3-github ***************机器学习实战**************…

机器学习 2023年4月13日
000
目标检测

【目标检测大集合】R-FCN、SSD、YOLO2、faster-rcnn和labelImg实验笔记

转自：https://ask.julyedu.com/question/7490 R-FCNpaper:https://arxiv.org/abs/1605.06409作者代码：https://github.com/daijifeng001/R-FCN #matlab版本这里使用python版本的代码：https://github.com/Orpine/py…

2023年4月6日
000
tensorflow1.0 构建神经网络做图片分类

import tensorflow as tf from tensorflow.examples.tutorials.mnist import input_data mnist = input_data.read_data_sets(“MNIST_data”,one_hot=True) def add_layer(inputs,in_size,out_siz…

tensorflow 2023年4月8日
000
Caffe学习一网络参数和自定义网络基于theano的深度卷积神经网络

网络参数 # 测试总数/batchsize test_iter: 100 # 测试间隔 test_interval: 500 # 开始的学习率 base_lr: 0.01 # 冲量单元，用于加速收敛，v(t+1)=momentum*v(t)-lr*grad ; w(t+1)=w(t)+v(t+1） momentum: 0.9 # 权值衰减，用于惩罚项 wei…

Caffe 2023年4月8日
000
PyTorch实例：房价预测

import torch from torch.autograd import Variable # 构造0-100之间的均匀数字作为时间变量x x = Variable(torch.linspace(0,100).type(torch.FloatTensor)) # 时间点上的历史房价数据 rand = Variable(torch.randn(100))…

PyTorch 2023年4月7日
000
tensorflow去掉warning的方法

运行tensorflow程序时，提示： I tensorflow/core/platform/cpu_feature_guard.cc:141] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2 FMA 去掉提示的方法： v…

tensorflow 2023年4月8日
000
tensorflow

tensorflow常见问题

1. sess.run() hangs when called / sess.run() get stuck / freeze that ctrl+c can’t kill process 解决： 1 coord = tf.train.Coordinator() 2 threads = tf.train.start_queue_runners(sess=…

2023年4月6日
000
卷积神经网络卷积层后一定要跟激活函数吗？

The reason why neural network is more powerful than linear function is because neural network use the nonlinear function to map the dataset which is difficult to separate to separ…

卷积神经网络 2023年4月8日
000

合作推广

合作推广

返回顶部