Skip to content

c3se/tar2zip

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

tar2zip

This is simple CLI utility that converts tar files to zip files. Uncompressed zip files are better for random access than tar archives, especially compared to compressed tar archives. This makes uncompressed zipfiles very useful for distributed filesystems where FileIO is usually a limiting factor, like what is common on HPC systems.

It aims for two main features

  1. Environment agnostic. For a given tar file the created zip file should be identical even when run on different systems.
  2. Streaming. The unpacked archive should not need to be stored in full as intermediate step (either on disk or memory).

There are some additional features from using Python standard libraries:

  • Automatic detection of tar compression
  • Python as only hard dependency

Alternatives

There are a number of alternative solutions, but they typically only fulfill one of the two main features.

Installation

  1. Get the source (git clone or download archive and unpack)
  2. pip install .

Usage

tar2zip --help

Known limitations

  • Only deals with regular files in the tar archive. Empty directories are ignored and symlinks throws an error.
  • Duplicate files in tar archives throws an error.

About

Simple Python-based CLI tool to convert tar archive to uncompressed zip archives.

Resources

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages