Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

Compressing and Randomly Accessing Sequences

Abstract: 

In this paper we consider the problem of storing sequences of symbols in
a compressed format, while supporting random access to the symbols without
decompression. Although this is a well-studied problem when the data is
textual, the kind of sequences we look at are not textual, and we argue
that traditional compression methods used in the text algorithms community
(such as compressors targeting $k$-th order empirical entropy) do not
perform as well on these sequential data, and simpler methods such
as Huffman-coding the deltas between sequence elements give better
compression performance. We discuss data structures that allow
random access to sequence elements that target such measures.

up
0 users have voted:

Comments

The problem dealt in this poster seems to be the same as https://sigport.org/documents/towards-better-compressed-representations,
but here you follow a delta-encoded approach.
Since you said that the theorem shown in the poster is unattractive because $S$ may be very large, I wonder how large $S$ actually is on the used datasets.
We know that $S <= nσ$, but it could be much smaller.

Paper Details

Authors:
Laith Ali Abdulsahib, Diego Arroyuelo, Rajeev Raman
Submitted On:
21 April 2020 - 10:31am
Short Link:
Type:
Poster
Event:
Presenter's Name:
Rajeev Raman
Paper Code:
DCC-197
Session:
Posters
Document Year:
2020
Cite

Document Files

main.pdf

(35)

Subscribe

[1] Laith Ali Abdulsahib, Diego Arroyuelo, Rajeev Raman, "Compressing and Randomly Accessing Sequences", IEEE SigPort, 2020. [Online]. Available: http://sigport.org/5109. Accessed: Jul. 07, 2020.
@article{5109-20,
url = {http://sigport.org/5109},
author = {Laith Ali Abdulsahib; Diego Arroyuelo; Rajeev Raman },
publisher = {IEEE SigPort},
title = {Compressing and Randomly Accessing Sequences},
year = {2020} }
TY - EJOUR
T1 - Compressing and Randomly Accessing Sequences
AU - Laith Ali Abdulsahib; Diego Arroyuelo; Rajeev Raman
PY - 2020
PB - IEEE SigPort
UR - http://sigport.org/5109
ER -
Laith Ali Abdulsahib, Diego Arroyuelo, Rajeev Raman. (2020). Compressing and Randomly Accessing Sequences. IEEE SigPort. http://sigport.org/5109
Laith Ali Abdulsahib, Diego Arroyuelo, Rajeev Raman, 2020. Compressing and Randomly Accessing Sequences. Available at: http://sigport.org/5109.
Laith Ali Abdulsahib, Diego Arroyuelo, Rajeev Raman. (2020). "Compressing and Randomly Accessing Sequences." Web.
1. Laith Ali Abdulsahib, Diego Arroyuelo, Rajeev Raman. Compressing and Randomly Accessing Sequences [Internet]. IEEE SigPort; 2020. Available from : http://sigport.org/5109