Combining deep learning model compression techniques

Показать описание

In this article, we evaluate the performance of combining several model compression techniques. The techniques assessed were dark knowledge distillation, pruning, and quantization. We found that in the scenario in which we developed the experiments, classification of chest x-rays, the combination of these three techniques yielded a new model capable of aggregating the individual advantages of each one. In the experiments we used a combination of deep models with 95.05% accuracy, a value higher than that reported in some related works but lower than the state of the art, whose accuracy is 96.39%. The accuracy of the compressed model in turn was 90.86%, a small loss compared to the gain obtained from the reduction, in bytes, in relation to the size of the original model. The size has been reduced from 841MB to 40KB, which opens up the possibility for using deep models in edge computing applications.

José Vitor Santos Silva

Рекомендации по теме

Combining deep learning model compression techniques

Combining deep learning model compression techniques

Combining deep learning model compression techniques

Quantization vs Pruning vs Distillation: Optimizing NNs for Inference

Model Compression Techniques for Deep Neural Networks | CS365

2.1 Challenges for TinyML (Part D) - ML Model Compression

VCT: A Video Compression Transformer: George Toederici

DeepSketch: A New Machine Learning-Based Reference Search Technique for Delta Compression - FAST&apo...

Parallel Compression Explained In 40 Seconds 💪

NVIDIA & Mistral AI's Mistral-NeMo-Minitron 8B: The Future of AI Efficiency and Accuracy

ML5G Challenge tutorial #4: Machine Learning Model Optimization and Compression

What is Deep Learning Model Compression?

Lecture 14 - Distributed Training and Gradient Compression (Part II) | MIT 6.S965

The Knowledge Within: Methods for Data-Free Model Compression

Xailient's Sabina Pokhrel Gives an Introduction to DNN Model Compression Techniques (Preview)

DeepCompression in a Nutshell

Reliable and Interpretable Artificial Intelligence -- Lecture 11 (Combining Deep Learning and Logic)

Towards Efficient Model Compression via Learned Global Ranking

Lecture 19 - Efficient Video Understanding and Generative Models | MIT 6.S965

Shield: Fast, Practical Defense and Vaccination for Deep Learning using JPEG Compression

[ICML 2024] LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging

Use this before your drum buss compression🥁

Lecture 14 - Distributed Training and Gradient Compression (Part II) | MIT 6.S965

WWDC23: Use Core ML Tools for machine learning model compression | Apple

MobiSys 2018 - On-Demand Deep Model Compression for Mobile Devices: