Skip to content

emmadickson/unwarcit

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

UNWARCIT: WARC (and WACZ) Unzipping Library

Background

This library provides a command line interface to unzip warc and wacz files.

Builds off of the warcio library to read and validate warc files and the py-wacz library to validate wacz files.

Both libraries are provided by Webrecorder

Setup

Install by cloning the repo and then running: python3 setup.py install

You can now run the tool like so: unwarcit metro_capture2.wacz data.warc --output myfolder

You can pass a single file or a list of files, either warc or wacz, separated by spaces to unwarcit by placing them after the unwarcit command. unwarcit warcfile1.warc warcfile2.warc waczfile.wacz

Configuration Options

Unwarcit currently accepts the following parameters:
      --help                                Show help                  [str]
      --version                             Show version number        [int]
      --output                              The folder to output the results to [str]

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages