> how about making a small theme about durilca segmentation
> and seg_file so we can point to it in the future?
Well, here you go:
1. I reformatted and patched some more stuff in seg_file
(got rid of some of its dynamic allocation mostly).
And also added an "unseg_file" utility to reconstruct
the file from segments.
Now its used like this:
2. The seg_file utility is written by Dmitry Shkarin,Code:seg_file.exe acrord32.exe unseg_file acrord32.unp
with original version at http://compression.ru/ds/seg_file.rar.
It splits the file into several "segments", containing data blocks
with similar statistics, while between different segments statistics
(supposedly) don't match.
Its estimation is based on a bytewise frequency model
with simple hashed contexts (only 256 contexts by default, but tunable).
3. This same (or unnoticeably improved maybe) algorithm is used in
Shkarin's durilca compressors http://compression.ru/ds/durilca.rar
for -t1 mode.
Older versions of durilca had a hidden -l option for dumping the
segments, in a way similar to what seg_file does.
However, durilca -t1 -l might generate more segments
due to disasm32 executable filter - the x86-like segments
are processed with it and multiple preprocessed segments are
generated on output.