knowledge-database (beta)

Current group: comp.compression

PAQ6 AnalogModel question

PAQ6 AnalogModel question  
Jim Leonard
 Re: PAQ6 AnalogModel question  
Matt Mahoney
From:Jim Leonard
Subject:PAQ6 AnalogModel question
Date:19 Jan 2005 12:51:48 -0800
I'm familiar with most lossless audio compression schemes, the basic
operation being the encoding of the output of a predictor. But in
browsing through the PAQ6 code I found the following description of the
AnalogModel context submodel:

"Contexts drop the low order bits, and include the position within the
file modulo 2, 3, or 4."

Forgive my inexperience in this area, but why are the low order bits
dropped? I understand this makes the high order bits easier to
compress, but the low order bits are mostly noise if taken out of
context, right? Can anyone explain why this is done?
From:Matt Mahoney
Subject:Re: PAQ6 AnalogModel question
Date:20 Jan 2005 07:55:05 -0800

Jim Leonard wrote:
> I'm familiar with most lossless audio compression schemes, the basic
> operation being the encoding of the output of a predictor. But in
> browsing through the PAQ6 code I found the following description of
the
> AnalogModel context submodel:
>
> "Contexts drop the low order bits, and include the position within
the
> file modulo 2, 3, or 4."
>
> Forgive my inexperience in this area, but why are the low order bits
> dropped? I understand this makes the high order bits easier to
> compress, but the low order bits are mostly noise if taken out of
> context, right? Can anyone explain why this is done?

Dropping the low order bits uses the approximate value as the context.
The model can be trained faster when there are fewer distinct contexts
and generalize to similar contexts. The exact contexts are still
available in the other models (CharModel, RepeatModel, SparseModel),
and these are blended into the prediction.

-- Matt Mahoney
   

Copyright © 2006 knowledge-database   -   All rights reserved