User Tools

Site Tools


torrentzip

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
torrentzip [2022/01/30 14:35] johnsanctorrentzip [2022/11/20 11:07] (current) johnsanc
Line 4: Line 4:
 This specification is intended to define the implementation of zip as used by the TorrentZip standard. This specification is intended to define the implementation of zip as used by the TorrentZip standard.
  
-===== Archive Format =====+===== Archive format =====
 ==== General format of a torrentzipped .zip file with n files: ==== ==== General format of a torrentzipped .zip file with n files: ====
  
-^Files - Start^+^Archive Start^
 |[local file header 1]| |[local file header 1]|
 |[file data 1]| |[file data 1]|
Line 22: Line 22:
 ^Central Directory - End (EOCD file offset)^ ^Central Directory - End (EOCD file offset)^
 |[end of central directory record]| |[end of central directory record]|
 +^Archive End^
  
  
Line 27: Line 28:
 ^Type ^Attribute ^Value ^Description ^  ^Type ^Attribute ^Value ^Description ^ 
 |UInt32|Local file header signature|(0x04034b50)| | |UInt32|Local file header signature|(0x04034b50)| |
-|UInt16|Version needed to extract|20|File is compressed using Deflate compression| +|UInt16|Version needed to extract|20 or 45|20 = File is compressed using Deflate compression / 45 if this record contains zip64 information
-|UInt16|General purpose bit flag|2|Maximum compression option was used|+|UInt16|General purpose bit flag|2|Maximum compression option was used, bit 11 (0x800) is set for unicode filename|
 |UInt16|Compression method|8|The file is Deflated| |UInt16|Compression method|8|The file is Deflated|
 |UInt16|Last mod file time|48128|11:32 PM| |UInt16|Last mod file time|48128|11:32 PM|
Line 36: Line 37:
 |UInt32|Uncompressed size| |File Uncompressed Size| |UInt32|Uncompressed size| |File Uncompressed Size|
 |UInt16|Filename length| |Filename length| |UInt16|Filename length| |Filename length|
-|UInt16|Extra field length|0|No extra field information|+|UInt16|Extra field length| |Normally 0, Length of Extra field data if zip64 extra field information is included|
 |Byte[]|Filename (variable size)| |Byte array of filename| |Byte[]|Filename (variable size)| |Byte array of filename|
  
-|**ⓘ NOTE:** \\ The default values show are required to have consistent torrentzipped files. Default time/date of 11:32pm 12/24/1996 is the date of the first ever MAME release.|+<WRAP info>The default values show are required to have consistent torrentzipped files. Default time/date of 11:32pm 12/24/1996 is the date of the first ever MAME release.</WRAP>
  
 ==== File data x: ==== ==== File data x: ====
Line 50: Line 51:
 |UInt32|Central file header signature|(0x02014b50)| | |UInt32|Central file header signature|(0x02014b50)| |
 |UInt16|Version made by|0|MS_DOS and OS/2 (FAT/FAT32 file systems)| |UInt16|Version made by|0|MS_DOS and OS/2 (FAT/FAT32 file systems)|
-|UInt16|Version needed to extract|20|File is compressed using Deflate compression| +|UInt16|Version needed to extract|20 or 45|20 = File is compressed using Deflate compression / 45 if this record contains zip64 information
-|UInt16|General purpose bit flag|2|Maximum compression option was used|+|UInt16|General purpose bit flag|2|Maximum compression option was used, bit 11 (0x800) is set for unicode filename|
 |UInt16|Compression method|8|The file is Deflated| |UInt16|Compression method|8|The file is Deflated|
 |UInt16|Last mod file time|48128|11:32 PM| |UInt16|Last mod file time|48128|11:32 PM|
Line 59: Line 60:
 |UInt32|Uncompressed size| |File Uncompressed Size| |UInt32|Uncompressed size| |File Uncompressed Size|
 |UInt16|File name length| |Filename length| |UInt16|File name length| |Filename length|
-|UInt16|Extra field length|0|No extra field information|+|UInt16|Extra field length| |Normally 0, Length of Extra field data if zip64 extra field information is included|
 |UInt16|File comment length|0|No file comment| |UInt16|File comment length|0|No file comment|
 |UInt16|Disk number start|0|Multi disk storage not used so set to disk 0| |UInt16|Disk number start|0|Multi disk storage not used so set to disk 0|
Line 67: Line 68:
 |Byte[]|File name (variable size)| |Byte array of filename| |Byte[]|File name (variable size)| |Byte array of filename|
  
-==== End Of Central Directory: ====+==== End of Central Directory: ====
 ^Type ^Attribute ^Value ^Description ^  ^Type ^Attribute ^Value ^Description ^ 
 |UInt32|End of central dir signature|(0x06054b50)| | |UInt32|End of central dir signature|(0x06054b50)| |
Line 79: Line 80:
 |Byte[22]|.ZIP file comment|TORRENTZIPPED-XXXXXXXX| | |Byte[22]|.ZIP file comment|TORRENTZIPPED-XXXXXXXX| |
  
-|**ⓘ NOTE:** \\ See above 'General format of a torrentzipped .zip file with n files' for SOCD & EOCD |+<WRAP info>See above 'General format of a torrentzipped .zip file with n files' for SOCD & EOCD</WRAP>
  
  
-===== The TorrentZipped Files Comments ===== +===== The TorrentZipped files comment ===== 
-The .ZIP file comments in the End of Central directory is used to check the validity of the torrentzipped file.+The .ZIP file comment in the End of Central Directory is used to check the validity of the torrentzipped file.
  
-The comment must be formatted as the 22 bytes of ''TORRENTZIPPED-XXXXXXXX''. The ''XXXXXXXX'' is the CRC32 of the central directory records stored as hexadecimal upper case text. (the CRC32 of the bytes in the file between SOCD & EOCD)+The comment must be formatted as the 22 bytes of ''TORRENTZIPPED-XXXXXXXX''. The ''XXXXXXXX'' is the CRC32 of the central directory records stored as hexadecimal upper case text. (The CRC32 of the bytes in the file between SOCD & EOCD)
  
 This comment ensures that if any change is made to the files within the zip this checksum will no longer match the byte data in the central directory, and in this way we can check the validity of a torrentzip file. This comment ensures that if any change is made to the files within the zip this checksum will no longer match the byte data in the central directory, and in this way we can check the validity of a torrentzip file.
 +===== Filename Encoding =====
 +The filenames of the compressed files in a zip file are stored in the local header and the central directory as byte arrays. Zip was original build on early IBM PCs, and as such uses [[https://en.wikipedia.org/wiki/Code_page_437|code page 437]] to convert a string to a byte array to store the filenames. With the arrival of unicode multiple different methods where added to the official zip format to permit unicode filenames to be stored in a zip file. Trrntzip format uses the general purpose bit 11 method. So to store a filename in a trrntzip zip file you must first see if the filename can be stored using code page 437, if not then UTF8 encoding should be used in the byte arrays, this is then indicated by setting bit 11 of the General Purpose Bit Flags both in the local header and central directory.  
  
-===== File Order with a TorrentZip =====+ 
 +===== File order with a TorrentZip =====
 For the creation of consistent torrentzipped files, the file order is also very important. Files must be sorted by filename using a lower case sort. For the creation of consistent torrentzipped files, the file order is also very important. Files must be sorted by filename using a lower case sort.
  
Line 97: Line 101:
 This leads to a possible naming inconsistency. The zip file format states: //"All slashes should be forward slashes ‘/’ as opposed to backwards slashes ‘\’"//. So Torrentzip will change all ''\'' characters to ''/''. This must be done before sorting, to ensure that the sort is performed correctly. This leads to a possible naming inconsistency. The zip file format states: //"All slashes should be forward slashes ‘/’ as opposed to backwards slashes ‘\’"//. So Torrentzip will change all ''\'' characters to ''/''. This must be done before sorting, to ensure that the sort is performed correctly.
  
-===== Directory Entries and Empty Directories =====+===== Directory entries and empty directories =====
 A directory entry is stored in a zip by adding a file entry ending in a directory separator character with a zero size and CRC. So directory ''set1'' would be stored as a zero length, zero CRC file called ''set1/''. Some zip programs when adding the previously mentioned file ''set1/test1.rom'' will also add the directory ''set1/'', this creates an inconsistency problem. In this example the ''set1/'' directory entry is unnecessary, as the filename ''set1/test1.rom'' implies the existents of the ''set1/'' directory. To resolve this inconsistency un-needed directories should be removed from the zip, the only needed directory entries are empty directories that are not implied by any file entries. A directory entry is stored in a zip by adding a file entry ending in a directory separator character with a zero size and CRC. So directory ''set1'' would be stored as a zero length, zero CRC file called ''set1/''. Some zip programs when adding the previously mentioned file ''set1/test1.rom'' will also add the directory ''set1/'', this creates an inconsistency problem. In this example the ''set1/'' directory entry is unnecessary, as the filename ''set1/test1.rom'' implies the existents of the ''set1/'' directory. To resolve this inconsistency un-needed directories should be removed from the zip, the only needed directory entries are empty directories that are not implied by any file entries.
  
Line 109: Line 113:
 kept to create the empty directory, as removing it would completely remove the ''set2'' directory. kept to create the empty directory, as removing it would completely remove the ''set2'' directory.
  
-===== Repeat Files =====+===== Repeat files =====
 Another test that could be performed is checking for repeat file entries inside the zip, most zip programs have a hard time handling this and will just ignore this repeat giving the user no way of knowing there is a repeat filename problem. So it would fix another possible inconsistency if torrentzip scanning at least warned about repeat filename being found inside a zip. Another test that could be performed is checking for repeat file entries inside the zip, most zip programs have a hard time handling this and will just ignore this repeat giving the user no way of knowing there is a repeat filename problem. So it would fix another possible inconsistency if torrentzip scanning at least warned about repeat filename being found inside a zip.
  
Line 115: Line 119:
   * [[http://www.pkware.com/documents/casestudies/APPNOTE.TXT|Official ZIP file specification]]    * [[http://www.pkware.com/documents/casestudies/APPNOTE.TXT|Official ZIP file specification]] 
   * [[http://sourceforge.net/projects/trrntzip/|Original TorrentZip source code]]   * [[http://sourceforge.net/projects/trrntzip/|Original TorrentZip source code]]
-  * [[http://zlib.net/|zlib Compression]]+  * [[http://zlib.net/|zlib compression]]
torrentzip.txt · Last modified: 2022/11/20 11:07 by johnsanc