What is Model Compression

« Back to Glossary Index

Pruning neural networks for edge deployment.

Synonyms:
Model optimization, Neural network compression
Defnition:
Model compression reduces model size for faster inference and lower costs.

Variations:
Pruning and quantization techniques

Hello popup window