Scalable and modularized RTL compilation of Convolutional Neural Networks...-应用技术相关资料下载-EEWORLD下载中心

Scalable and modularized RTL compilation of Convolutional Neural Networks...pdf

1星发布者: sigma

2021-01-07 | 1积分 | 3.13MB | 0 次下载

文档简介

标签：卷积神经网络 CNN FPGA

Scalable and modularized RTL compilation of Convolutional Neural Networks onto FPGA

作者：Yufei Ma, Yu Cao, Jae-sun Seo, Naveen Suda

Despite its popularity, deploying Convolutional Neural Networks (CNNs) on a portable system is still challenging due to large data volume, intensive computation and frequent memory access. Although previous FPGA acceleration schemes generated by high-level synthesis tools (i.e., HLS, OpenCL) have allowed for fast design optimization, hardware inefficiency still exists when allocating FPGA resources to maximize parallelism and throughput. A direct hardware-level design (i.e., RTL) can improve the efficiency and achieve greater acceleration. However, this requires an in-depth understanding of both the algorithm structure and the FPGA system architecture. In this work, we present a scalable solution that integrates the flexibility of high-level synthesis and the finer level optimization of an RTL implementation. The cornerstone is a compiler that analyzes the CNN structure and parameters, and automatically generates a set of modular and scalable computing primitives that can accelerate various deep learning algorithms.

加载更多

推荐下载

                        Mathematics for Computer Science

                        深度学习Pytorch快速入门（英文）

                        深入浅出Python量化交易实战 (段小手)

                        TensorFlow+Keras深度学习人工智能实践应用

                        人工智能的原理与方法

                        An Introduction to Statistical Learning: with Applications in R

                        Motion.Planning.of.Mobile.Multi-Limb.Robotic.Systems

                        人工智能小模型

                        人工智能中重要的一个问题, 用广度优先搜索的方法解决

                        在这个电脑中的虚拟市场中

                        人工神经网络

                        【DigiKey“智造万物，快乐不停”创意大赛】 钢铁侠的智能助手贾维斯

                        Modern Robotics Mechanics, Planning, and Control

                        获得GPU存储性能的四种方法

                        目前最好的深度神经网络硬件教程：MIT hardware for DNN-5-of-9-Advanced-Technology-Opportunities

                        人工智能狂潮  机器人会超越人类吗？

                        PRML中文翻译版

                        面向自然语言处理的知识增强方法（WSDM2023教程）

                        人工智能(AI)程序设计(面向对象语言)

                        快乐机器学习 ([新加坡] 王圣元)

                        ICCV2023论文汇总：迁移、低样本和持续学习 Transfer, Low-Shot, and Continual Learning

                        常用推荐算法（50页干货）

                        知能语音故事机说明书

                        基于因特尔架构的信步移动机器人控制器解决方案

                        中国人工智能应用市场研究报告 iResearch-2015

                        卡内基梅隆大学TOM+M.+MITCHELL的《机器学习》

                        人工智能算法资料

                        机器学习经典系列：机器学习与数据挖掘方法和应用（经典）

                        复旦大学神经网络与深度学习课程全套资料：数学基础

                        arm_embedded_machine_learning_design_dummies_guide

精选文集