Sorry, you need to enable JavaScript to visit this website.

Predictive Coding For Lossless Dataset Compression

Citation Author(s):
Madeleine Barowsky, Alexander Mariona, Flavio P. Calmon
Submitted by:
Madeleine Barowsky
Last updated:
23 June 2021 - 11:40am
Document Type:
Document Year:
Presenters Name:
Madeleine Barowsky
Paper Code:



Lossless compression of datasets is a problem of significant theoretical and practical interest. It appears naturally in the task of storing, sending, or archiving large collections of information for scientific research. We can greatly improve encoding bitrate if we allow the compression of the original dataset to decompress to a permutation of the data. We prove the equivalence of dataset compression to compressing a permutation-invariant structure of the data and implement such a scheme via predictive coding. We benchmark our compression procedure against state-of-the-art compression utilities on the popular machine-learning datasets MNIST and CIFAR-10 and outperform for multiple parameter sets.

0 users have voted:

Dataset Files

Dataset Compression ICASSP poster.pdf