WARC (web archive file format) package

Supported features:

  • Parses and reads .warc files (basic, low-level API).
  • Write .warc files (also supports per-record compression and tracking of offsets).
  • Serialize CDXJ records (index of the .warc file), based off offsets.

Libraries

warc