Keras Layer 的 call(x) 和 input_shape

2023年4月8日上午9:11 • Keras

今天用Keras编程的时候发现一个问题，

···
input_layer = K.layers.Input(shape=(10,))

x = K.layers.Dense(20)(input_layer)
x = K.layers.Dense(20)(x)
···
以上写法是可行的，但是以下写法却不行

L = K.layers.Dense(20)
y = L(input_layer)
y = L(y)

前两个表达式正常，到了第3个表达式y=L(y)的时候就报input_shape错误。百思不得其解，按照Python编程的原则，一切皆对象

L = K.layers.Dense(20)
L(x)

和

K.layers.Dense(20)(x)

有何不同？

一番尝试，发现奥妙在于Keras Layers的设计。看一下官方的对于自定义Layer的说明，

call(x): this is where the layer's logic lives. Unless you want your layer to support masking, you only have to care about the first argument passed to call: the input tensor.

也就是说，当layer的callmethod被调用时，layer才实际存在，此时将传入input_shape。如果call没有被调用，此时layer中并没有input_shape的信息。

举例说明，

L = K.layers.Dense(20)
L.input_shape

此时编译器报错`AttributeError: The layer has never been called and thus has no defined input shape. 再看以下代码，

L = K.layers.Dense(20)
y = L(input_layer)
L.input_shape

此时编译器不报错，输出(None, 10)。照理说第二段代码并没有对L做任何操作，只是将L(input_layer)赋给了y，但是此时L确获得了input_shape这个参数。

结合call(x)的定义，一开始遇到的问题也就明白了。表达式y = L(input_layer)调用了L的callmethod，此时L这个layer才正式被初始化，其input_shape也根据传入的input_layer被赋值。因此，此时的L其实已经跟表达式K.layers.Dense(20)不一样了，后者未被调用，input_shape不存在。

以下这段代码之所以报input_shape错误，就是因为y = L(input_layer)使得L的input_shape被初始化为(10,)。因次当第三个表达式y=L(y)中y被传入L时，由于y的shape并不是(10,)，而是(20,)，与L中input_shape的值不一致，编译器报错。

L = K.layers.Dense(20)
y = L(input_layer)
y = L(y)

本站文章如无特殊说明，均为本站原创，如若转载，请注明出处：Keras Layer 的 call(x) 和 input_shape - Python技术站

Keras 人工智能

赞 (0)

微信扫一扫

微信扫一扫

支付宝扫一扫

支付宝扫一扫

【Python】keras神经网络识别mnist

上一篇 2023年4月8日上午9:10

Keras 构建DNN 对用户名检测判断是否为非法用户名（从数据预处理到模型在线预测）

下一篇 2023年4月8日上午9:11

积性函数求和：构造狄利克雷卷积将值域限定于powerful number

前情提要：$O(n^{0.75}/\log n)$ 时间的积性函数求和。当 $n \ge 10^{12}$ 的时候需要十几秒出解。如果积性函数的性质更好，那么我们可以更快地求和。假设积性函数 $f$ 和易于求和的积性函数 $g$ 满足 $f(p)=g(p)$，且 $f=g*h$, $g*h$ 表示 $g, h$ 的狄利克雷卷积，也就是 $f(n)=\su…

卷积神经网络 2023年4月7日
000
卷积交织/解交织C++程序

交织基数为M,交织深度为I的卷积交织/解交织程序，延时为I*(I-1)*M. 1 #include <iostream> 2 #include <vector> 3 #include <list> 4 #include <cstdint> 5 6 using namespace std; 7 8 vector&…

卷积神经网络 2023年4月7日
000
循环神经网络

循环神经网络RNN基本介绍

这篇文章很多内容是参考：http://www.wildml.com/2015/09/recurrent-neural-networks-tutorial-part-1-introduction-to-rnns/，在这篇文章中，加入了一些新的内容与一些自己的理解。循环神经网络(Recurrent Neural Networks，RNNs)已经在众多…

2023年4月8日
000
Caffe

caffe笔记之例程学习（三）

原文链接：caffe.berkeleyvision.org/tutorial/layers.html 创建caffe模型，首先要在protocol buffer 定义文件(prototxt)中定义结构。在caffe环境中，图像的明显特征是其空间结构。主要layers 主要功能主要类型其他卷积层提取特征 CONVOLUTION 学习率、数据维度池…

2023年4月5日
000
程序员初学机器学习的四种方式【转】

学习机器学习有很多方法，大多数人选择从理论开始。如果你是个程序员，那么你已经掌握了把问题拆分成相应组成部分及设计小项目原型的能力，这些能力能帮助你学习新的技术、类库和方法。这些对任何一个职业程序员来说都是重要的能力，现在它们也能用在初学机器学习上。要想有效地学习机器学习你必须学习相关理论，但是你可以利用你的兴趣及对知识的渴望，来激励你从实际例子学起，然后…

机器学习 2023年4月15日
000
目标检测

目标检测论文: Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample

Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample SelectionPDF: https://arxiv.org/pdf/1912.02424.pdfPyTorch: https://github.com/shanglian…

2023年4月8日
000
Caffe

ubuntu18+caffe+cuda

昨天安装caffe，因为用的是cuda10.2，遇到各种问题，最终也没有安装成功。使用cmake配置成功、生成成功、编译的时候报错。 1 /usr/local/cuda/include/cuda_runtime_api.h:9580:60: error: ‘cudaGraphExec_t’ was not declared in this scope 2 e…

2023年4月5日
000
卷积+池化+卷积+池化+全连接2

#!/usr/bin/env pythonimport tensorflow as tffrom tensorflow.examples.tutorials.mnist import input_data# In[2]:mnist = input_data.read_data_sets(‘MNIST_data’, one_hot=True)# 每个批次的大小…

卷积神经网络 2023年4月8日
000

合作推广

合作推广

返回顶部