compression - Compressing floating point data

Question

Welcome To Ask or Share your Answers For Others

compression - Compressing floating point data

asked Oct 24, 2021 in Technique[技术] by 深蓝 (71.8m points)

compression - Compressing floating point data

Are there any lossless compression methods that can be applied to floating point time-series data, and will significantly outperform, say, writing the data as binary into a file and running it through gzip?

Reduction of precision might be acceptable, but it must happen in a controlled way (i.e. I must be able to set a bound on how many digits must be kept)

I am working with some large data files which are series of correlated doubles, describing a function of time (i.e. the values are correlated). I don't generally need the full double precision but I might need more than float.

Since there are specialized lossless methods for images/audio, I was wondering if anything specialized exists for this situation.

Clarification: I am looking for existing practical tools rather than a paper describing how to implement something like this. Something comparable to gzip in speed would be excellent.

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

1 Answer

深蓝 · Answer 1 · 2021-10-23T18:28:21+0000

Here are some ideas if you want to create your own simple algorithm:

Use xor of the current value with the previous value to obtain a set of bits describing the difference.
Divide this difference into two parts: one part is the "mantissa bits" and one part is the "exponent bits".
Use variable length encoding (different number of bits/bytes per value), or any compression method you choose, to save these differences. You might use separate streams for mantissas and exponents, since mantissas have more bits to compress.
This may not work well if you are alternating between two different time-value streams sources. So you may have to compress each source into a separate stream or block.
To lose precision, you can drop the least significant bits or bytes from the mantissa, while leaving the exponent intact.

Categories

compression - Compressing floating point data

compression - Compressing floating point data

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags