A Block Minifloat Representation for Training Deep Neural Networks

上传者：qqvisual75478 2021-01-24 05:34:20上传 .PDF文件 787.97 KB 热度 41次

A Block Minifloat Representation for Training Deep Neural Networks

Training Deep Neural Networks (DNN) with high efficiency can be difficult to achieve with native floating point representations and commercially available hardware. Specialized arithmetic with custom acceleration offers perhaps the most promising alternative.Ongoing research is trending towards narrow floating point representations, called minifloats, that pack more operations for a given silicon area and consume less power. In this paper, we introduce Block Minifloat (BM), a new spectrum of minifloat formats capable of training DNNs end-to-end with only 4-8 bit weight, activation and gradient tensors. While standard floating point representations have two degrees of freedom, via the exponent and mantissa, BM exposes the exponent bias as an additional field for optimization. Crucially, this enables training with fewer exponent bits, yielding dense integer-like hardware for fused multiply-add (FMA) operations. For ResNet trained on ImageNet, 6-bit BM achieves almost no degradation in floating point accuracy with FMA units that are $4.1\times(23.9\times)$ smaller and consume $2.3\times(16.1\times)$ less energy than FP8 (FP32). Furthermore, our 8-bit BM format matches floating-point accuracy while delivering a higher computational density and faster expected training times.

训练深度神经网络的块最小浮点表示

使用本地浮点表示和市售硬件很难实现高效地训练深度神经网络（DNN）。具有自定义加速功能的专门算法可能是最有前途的选择。.. 正在进行的研究正在朝着称为最小浮点数的狭窄浮点表示法发展，这种浮点表示法在给定的硅面积上包含更多的操作，并且消耗的功率更少。在本文中，我们介绍了Block Minifloat（BM），这是一种新型的minifloat格式，能够仅用4到8位的权重，激活和梯度张量来端对端训练DNN。标准浮点表示法通过指数和尾数具有两个自由度，而BM则将指数偏差作为优化的附加字段公开。至关重要的是，这使得能够用更少的指数位进行训练，从而产生了密集的类整数硬件，用于融合乘加（FMA）操作。对于在ImageNet上训练的ResNet，使用FMA单元的6位BM几乎不会降低浮点精度。 4.1×（23.9×）较小且消耗 2.3×（16.1×）能量比FP8（FP32）少。此外，我们的8位BM格式与浮点精度匹配，同时提供更高的计算密度和更快的预期训练时间。（阅读更多）

下载地址

用户评论

更多下载

下载地址

 立即下载

用户评论

发表评论

A Block Minifloat Representation for Training Deep Neural Networks

使用本地浮点表示和市售硬件很难实现高效地训练深度神经网络（DNN）。具有自定义加速功能的专门算法可能...

大小：787.97 KB | 2021-01-24 05:34:20

Training and Analyzing Deep Recurrent Neural Networks

AbstractTimeseriesoftenhaveatemporalhierarchy,with...

大小：0B | 2019-05-28 02:13:45

Understanding the difficulty of training deep feedforward neural networks

Understandingthedifficultyoftrainingdeepfeedforwar...

大小：0B | 2020-05-19 03:49:39

A Hitchhiker s Guide on distribute training of deep neural networks

神经网络分布式训练的算法和优化方法的概述性论文。包含梯度汇总算法，容错，压缩，精度等。

大小：0B | 2020-03-28 09:08:58

Understanding the difficulty of training deep feedforward neural networks.zip

Understanding the difficulty of training deep feed...

大小：1.48MB | 2021-04-06 18:25:04

Understanding the difficulty of training deep feedforward neural networks.pdf

有关BP算法深度学习巨头的论文原文，包含全部的论文内容。

大小：0B | 2020-05-19 03:49:32

Neural Networks and Deep Learning

ISBN978-3-319-94462-3ISBN978-3-319-94463-0(eBook)h...

大小：0B | 2019-09-22 20:25:23

A tutorial on training recurrent neural networks

高清版，Atutorialontrainingrecurrentneuralnetworks

大小：0B | 2019-08-01 23:27:36

Training Neural Networks without Gradients

Withthegrowingimportanceoflargenetworkmodelsandeno...

大小：0B | 2020-03-28 09:08:25

neural-networks-and-deep-learning

大小：0B | 2019-02-19 10:43:27

Neural Networks and Deep Learning pdf

Michael Nielsen http://neuralnetworksanddeeplearni...

大小：13.40MB | 2021-04-09 16:33:46

Neural networks and deep learning教程

英文版原版，ByMichaelNielsen/Dec2017。这个教程真的很不错，我自己看了2遍，深...

大小：0B | 2020-02-01 14:27:36

recent developments in deep neural networks

hinton的关于deeplearning的representation

大小：0B | 2019-09-19 11:14:52

8Deep neural networks

MostcurrentspeechrecognitionsystemsusehiddenMarkov...

大小：0B | 2019-09-26 09:46:00

Neural Networks and Deep Learning: A Textbook

大小：0B | 2018-12-08 23:18:37

Deep Neural Networks for YouTube Recommendations

youtube代表了目前规模最大、最复杂的工业推荐系统之一。在这篇文章里，我们从系统的角度上重点讲述...

大小：0B | 2020-05-17 05:58:49