The Full Wiki

More info on File archiver

File archiver: Wikis


Note: Many of our articles have direct quotes from sources you can cite, within the Wikipedia article! This article doesn't yet, but we're working on it! See more info or our list of citable articles.


From Wikipedia, the free encyclopedia

A file archiver is a computer program that combines a number of files together into one archive file, or a series of archive files, for easier transportation or storage. Many file archivers employ Archive formats that provide lossless data compression to reduce the size of the archive which is often useful for transferring a large number of individual files over a high latency network like the Internet.

The most basic archivers just take a list of files and concatenate their contents sequentially into the archive. In addition the archive must also contain some information about at least the names and lengths of the originals, so that proper reconstruction is possible. Most archivers also store metadata about a file that the operating system provides, such as timestamps, ownership and access control.

The process of making an archive file is called archiving or packing. Reconstructing the original files from the archive is termed unarchiving, unpacking or extracting.




Unix Archiver Tools

Unlike integrated archival and compression tools like PKZIP, Winzip, and WinRAR, the Unix tools ar, tar, cpio (for "archiver", "tape archiver" and "copy in/out" respectively) act as archivers but not compressors. Users of the Unix tools typically add compression by compressing the result of packing (and uncompressing before unpacking), most often using the gzip or bzip2 programs. Modern tar programs can automatically invoke a (de)compression program, giving the appearance that tar itself handles compression and decompression.

This approach has two advantages:

  • It follows the Unix toolbox concept that each program should accomplish a single, well-done task, as opposed to attempting to accomplish everything with one tool. As compression technology progresses, users may use different compression programs without having to modify or abandon their archiver.
  • The archival program can potentially detect redundancy across two or more archived files, and need not store redundant data. An archiver compressing each archived file in isolation usually cannot exploit inter-file redundancies, because compression tends to hide redundancy.

In practice, compression with tar is usually done on the entire archive, and the second advantage listed above is not obtained. This type of compression - compressing the archive itself - has three disadvantages as compared with compressing within the archive:

  • Extracting one file requires decompressing all the files that are before the file in the archive. This may take many minutes for a large archive.
  • Modification is even more inconvenient than extraction - just changing a single character of one of the archived files will typically require that the entire archive be uncompressed, updated, and then recompressed.
  • It's impossible or more difficult to take advantage of inter-file redundancy.

See also

External links


Got something to say? Make a comment.
Your name
Your email address