Results 1 to 3 of 3

Thread: Microsoft Office save XML zipped files

  1. #1
    Member BetaTester's Avatar
    Join Date
    Dec 2010
    Location
    Brazil
    Posts
    43
    Thanks
    0
    Thanked 3 Times in 3 Posts

    Exclamation Microsoft Office save XML zipped files

    Most office backups, are compounds by Microsoft Office files type, as docx and xlsx.

    These files are zipped XML, a standard save similar to the OpenOffice.

    For example, testing 7zip "PPMD 1024MB word32", Rar "Best", and FreeArc "-mx" in my doc.

    LTT2 expand-----148.195.235 bytes
    LTT2.docx----------4.322.991 bytes
    LTT2.docx.7z-------4.311.135 bytes
    LTT2.docx.arc------4.304.844 bytes
    LTT2.docx.rar-------4.288.586 bytes
    LTT2expand.rar----2.224.025 bytes
    LTT2expand.7z-----1.128.672 bytes
    LTT2expand.arc------867.204 bytes

    Expanding file and the recompressing, can be increased compression.

    In the case of backups made ​​in offices, consisting of thousands of Microsoft Office files, this procedure greatly reduce the size of backups.
    Last edited by BetaTester; 13th July 2012 at 21:06.

  2. #2
    Member
    Join Date
    Jun 2009
    Location
    Puerto Rico
    Posts
    164
    Thanks
    62
    Thanked 13 Times in 9 Posts
    I did a research on this a long time ago and I came to the same conclusion as you. In my case, I unzipped the docx and pptx and then compressed them with PAQ. Then, to restore the file I would extract the PAQ file then compress it with ZIP file.

  3. #3
    Member m^2's Avatar
    Join Date
    Sep 2008
    Location
    Ślůnsk, PL
    Posts
    1,612
    Thanks
    30
    Thanked 65 Times in 47 Posts
    It's a well known fact. I did a detailed benchmark some time ago and came to conclusion that 4x4 was faster and way stronger than the default, though my dataset was less than perfect. I think the benchmark should be available somewhere on this forum.

Similar Threads

  1. Microsoft Visual Studio 11
    By encode in forum The Off-Topic Lounge
    Replies: 1
    Last Post: 28th April 2012, 14:40
  2. Need help to migrate libbsc.com from Office Live to github
    By Gribok in forum The Off-Topic Lounge
    Replies: 3
    Last Post: 23rd April 2012, 01:29
  3. (Open) Office help needed
    By m^2 in forum The Off-Topic Lounge
    Replies: 2
    Last Post: 19th August 2011, 21:36
  4. A Microsoft study on deduplication
    By m^2 in forum Data Compression
    Replies: 1
    Last Post: 5th May 2011, 18:15
  5. XWRT 3.2 (former XML-WRT) with LPAQ6 support released
    By Bulat Ziganshin in forum Forum Archive
    Replies: 2
    Last Post: 3rd November 2007, 00:51

Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •