Embedded-AI-Report

Wechat ID: NeuroMem

Embedded-AI-Report

关注模型压缩、低比特量化、移动端推理加速优化、部署

Awesome-Emebedded-AI

A curated list of awesome A.I. & Embedded/Mobile-devices resources, tools and more.

Looking for contributors. Submit a pull request if you have something to add :)
Please check the contribution guidelines for info on formatting and writing pull requests.

Device Benchmark

高通骁龙处理器排行榜,强大性能一览无余 | Qualcomm
手机CPU性能天梯图 CPU performance of mobile comparison | mydriver
Qualcomm Adreno GPU Performance as below:

Papers

Classic

[1512.03385] Deep Residual Learning for Image Recognition
[1610.02357] Xception: Deep Learning with Depthwise Separable Convolutions
[1611.05431] ResNeXt: Aggregated Residual Transformations for Deep Neural Networks

Overview

Representation

[1707.09926] A Framework for Super-Resolution of Scalable Video via Sparse Reconstruction of Residual Frames
[1608.01409] Faster CNNs with Direct Sparse Convolutions and Guided Pruning

Structure

[CVPR2017] Squeeze-and-Excitation networks (ILSVRC 2017 winner) at CVPR2017
[1707.06342] ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression
[1707.01083] ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices
[1704.04861] MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

[1707.06990] Memory-Efficient Implementation of DenseNets

[1706.03912] SEP-Nets: Small and Effective Pattern Networks

Binarization

[CVPR2017] Local Binary Convolutional Neural Networks [code]

Pruning

Learning to Prune: Exploring the Frontier of Fast and Accurate Parsing [code]

[CVPR'17] Designing Energy-Efficient Convolutional Neural Networks using Energy-Aware Pruning
[ICLR'17] Pruning Filters for Efficient ConvNets
[ICLR'17] Pruning Convolutional Neural Networks for Resource Efficient Inference
[ICLR'17] Soft Weight-Sharing for Neural Network Compression
[ICLR'16] Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
[NIPS'16] Dynamic Network Surgery for Efficient DNNs
[NIPS'15] Learning both Weights and Connections for Efficient Neural Networks

Quantization

[ICML'17] The ZipML Framework for Training Models with End-to-End Low Precision: The Cans, the Cannots, and a Little Bit of Deep Learning
[1412.6115] Compressing Deep Convolutional Networks using Vector Quantization
[CVPR '16] Quantized Convolutional Neural Networks for Mobile Devices
[ICASSP'16] Fixed-Point Performance Analysis of Recurrent Neural Networks
[arXiv'16] Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations
[ICLR'17] Loss-aware Binarization of Deep Networks
[ICLR'17] Towards the Limit of Network Quantization
[CVPR'17] Deep Learning with Low Precision by Half-wave Gaussian Quantization
[1706.02393] ShiftCNN: Generalized Low-Precision Architecture for Inference of Convolutional Neural Networks

LowRankApproximation

Distillation

Joint Compression

[1707.09102] Fine-Pruning: Joint Fine-Tuning and Compression of a Convolutional Network with Bayesian Optimization

Kernel Selection

[1703.09746] Coordinating Filters for Faster Deep Neural Networks
[1606.05316] Learning Infinite-Layer Networks: Without the Kernel Trick

Computation Precison/Resolution

Model Split

[ASPLOS’17] Neurosurgeon: Collaborative intelligence between the cloud and mobile edge
[1705.04630] Forecasting using incomplete models

Others

[1606.05316] Learning Infinite-Layer Networks: Without the Kernel Trick
[1608.02893] Syntactically Informed Text Compression with Recurrent Neural Networks
[1608.05148] Full Resolution Image Compression with Recurrent Neural Networks
[1707.09422] Hyperprofile-based Computation Offloading for Mobile Edge Networks
[1707.09855] Convolution with Logarithmic Filter Groups for Efficient Shallow CNN
[1707.09597] ScanNet: A Fast and Dense Scanning Framework for Metastatic Breast Cancer Detection from Whole-Slide Images
[1604.08772] Towards Conceptual Compression

FrameworkPaper

Experience

Codes

Model Compression

Model Encryption

OpenMined/Syft: Homomorphically Encrypted Deep Learning Library

Model Application

AR

Android

madeye/yolo-android: Quantized Tiny Yolo Demo on Android

iOS

Vulkan

Frameworks & Acceleration Library

Benchmark

baidu-research/DeepBench: Benchmarking Deep Learning operations on different hardware

Convertor

Model convertor. More convertors please refer deep-learning-model-convertor

NervanaSystems/caffe2neon: Tools to convert Caffe models to neon's serialization format

Mobile Video Process Library/Player

Other Toolkit

Data Set

HandNet - A dataset of depth images of hands

Course

This part contains related course, guides and tutorials.

Hardware

GPU

Company

News

2017-08-07

2017-07-24

Name		Name	Last commit message	Last commit date
Latest commit History 164 Commits
device_benchmark		device_benchmark
embedded-ai-report		embedded-ai-report
keynotes		keynotes
LICENSE		LICENSE
README.md		README.md
contributing.md		contributing.md
logo.jpg		logo.jpg
wechat_qrcode.jpg		wechat_qrcode.jpg

License

92ypli/awesome-embedded-ai

Folders and files

Latest commit

History

Repository files navigation

Embedded-AI-Report

Awesome-Emebedded-AI

Contents

Device Benchmark

Papers

Classic

Overview

Representation

Structure

Binarization

Pruning

Quantization

LowRankApproximation

Distillation

Joint Compression

Kernel Selection

Computation Precison/Resolution

Model Split

Others

FrameworkPaper

Experience

Codes

Model Compression

Model Encryption

Model Application

AR

Android

iOS

Vulkan

Frameworks & Acceleration Library

Benchmark

Convertor

Mobile Video Process Library/Player

Other Toolkit

Data Set

Course

Hardware

GPU

Company

News

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages