keras使用多GPU并行训练模型 | keras multi gpu training

2023年4月8日上午2:31 • Keras

本文首发于个人博客https://kezunlin.me/post/95370db7/，欢迎阅读最新内容！

keras multi gpu training

multi_gpu_model

import tensorflow as tf
from keras.applications import Xception
from keras.utils import multi_gpu_model
import numpy as np

G = 8 
batch_size_per_gpu = 32
batch_size = batch_size_per_gpu * G

num_samples = 1000
height = 224
width = 224
num_classes = 1000

# Instantiate the base model (or "template" model).
# We recommend doing this with under a CPU device scope,
# so that the model's weights are hosted on CPU memory.
# Otherwise they may end up hosted on a GPU, which would
# complicate weight sharing.
with tf.device('/cpu:0'):
    model = Xception(weights=None,
                     input_shape=(height, width, 3),
                     classes=num_classes)

# Replicates the model on 8 GPUs.
# This assumes that your machine has 8 available GPUs.
parallel_model = multi_gpu_model(model, gpus=G)
parallel_model.compile(loss='categorical_crossentropy',
                       optimizer='rmsprop')

# Generate dummy data.
x = np.random.random((num_samples, height, width, 3))
y = np.random.random((num_samples, num_classes))

# This `fit` call will be distributed on 8 GPUs.
# Since the batch size is 256, each GPU will process 32 samples.
parallel_model.fit(x, y, epochs=20, batch_size=batch_size)

# Save model via the template model (which shares the same weights):
model.save('my_model.h5')

results

results from Multi-GPU training with Keras, Python, and deep learning on Onepanel.io
To validate this, we trained MiniGoogLeNet on the CIFAR-10 dataset with 4 V100 GPU.

Using a single GPU we were able to obtain 63 second epochs with a total training time of 74m10s.
However, by using multi-GPU training with Keras and Python we decreased training time to 16 second epochs with a total training time of 19m3s.
4x times speedup!

Reference

History

20190910:: created.

Copyright

Post author: kezunlin
Post link: https://kezunlin.me/post/95370db7/
Copyright Notice: All articles in this blog are licensed under CC BY-NC-SA 3.0 unless stating additionally.

本站文章如无特殊说明，均为本站原创，如若转载，请注明出处：keras使用多GPU并行训练模型 | keras multi gpu training - Python技术站

Keras 人工智能

赞 (0)

微信扫一扫

微信扫一扫

支付宝扫一扫

支付宝扫一扫

tf.keras 模型多个输入 tf.data.Dataset

上一篇 2023年4月8日

linux服务器上配置进行kaggle比赛的深度学习tensorflow keras环境详细教程

下一篇 2023年4月8日

Caffe

caffe小问题汇总（持续更新）

PS：所有问题均在caffe-windows下产生 1、为什么AlexNet中，InnerProduct_Layer(fc8)层的输出可以直接作为Accuracy_Layer层的输出？答：首先，我们要搞清楚，全连接层的输出是什么。全连接层的操作其实也是卷积操作，只不过要求卷积核的尺寸与输入进来的FeatureMap相同，因此全连接层输出的向量大小为1*1。…

2023年4月8日
000
keras中loss与val_loss的关系

loss是训练集的损失值，val_loss是测试集的损失值以下是loss与val_loss的变化反映出训练走向的规律总结： train loss 不断下降，test loss不断下降，说明网络仍在学习;（最好的） train loss 不断下降，test loss趋于不变，说明网络过拟合;（max pool或者正则化） train loss 趋于不变，te…

Keras 2023年4月6日
000
tensorflow

TeanorBoard可视化Tensorflow计算图步骤

或者显示No dashboards are active for the current data set.表示路径不对，不是计算图所在的文件夹，或者说没有生成日志文件。 1.写入一段代码 %matplotlib notebook import tensorflow as tf import matplotlib.pyplot as plt import n…

2023年4月8日
000
目标检测

大盘点｜YOLO 系目标检测算法总览

点击上方“3D视觉工坊”，选择“星标” 干货第一时间送达 YOLO目标检测算法诞生于2015年6月，从出生的那一天起就是“高精度、高效率、高实用性”目标检测算法的代名词。在原作者Joseph Redmon博士手中，YOLO经历了三代到YOLOv3，今年初Joseph Redmon宣告退出计算机视觉研究界后，YOLOv4、YOLOv5相继而出，且不论谁是正统…

2023年4月8日
000
Caffe

Caffe 单独测试添加的layer

之前那个博客记录了如何实现一个自己的层，这篇教你如何进行层的调试。首先把你在caffe/src/caffe/layers中你自己层的cpp代码copy到caffe/src/caffe/test中然后改名（因为我看那个目录里面命名都是这样命名的）：接着就按照这篇博客的做：http://www.cnblogs.com/louyihang-loves-bai…

2023年4月8日
000
机器学习笔记：Gradient Descent – 李小宝

机器学习笔记：Gradient Descent 　　最近掉进了Machine Learning的坑里，暑期听完了龙星计划的机器学习课程，走马观花看了一些书。最近找了Stanford的Machine Learning的公开课（http://v.163.com/special/opencourse/machinelearning.html），想系统地学习一遍，而…

机器学习 2023年4月12日
000
机器学习python实战—-线性回归

一、纲要　　线性回归的正规方程解法　　局部加权线性回归二、内容详述　　1、线性回归的正规方程解法　　线性回归是对连续型的数据进行预测。这里讨论的是线性回归的例子，对于非线性回归先不做讨论。这部分内容我们用的是正规方程的解法，理论内容在之前已经解释过了，正规方程为θ = (XT·X)-1·XT·y。值得注意的是这里需要对XT·X求逆矩阵，因此这个方程…

机器学习 2023年4月11日
000
使用keras时input_shape的维度表示问题说明

下面是关于“使用Keras时input_shape的维度表示问题说明”的完整攻略。 input_shape的维度表示在Keras中，input_shape参数用于指定输入数据的形状。它通常用于定义模型的第一层，以便Keras可以自动推断后续层的输入形状。input_shape参数的形式为(batch_size, input_dim)，其中batch_siz…

Keras 2023年5月15日
000

合作推广

合作推广

返回顶部