Skip to content

warc-dl

This pipeline allows extracting data from WARC files on a CPU cluster and streaming it to a GPU server, where it is processed.

Install from the command line
Learn more about packages
$ docker pull ghcr.io/webis-de/warc-dl:master

Recent tagged image versions

  • Published about 2 years ago · Digest
    sha256:d8918cf4938fccd0bdbc2c61ec6f4d35f319094b2d6d2c03a466111912d25c17
    153 Version downloads
  • Published over 2 years ago · Digest
    sha256:d2d563ea19693f6a4385889b1bbbd28807f89e32728b807135fc7575b248e351
    167 Version downloads

Loading


Last published

2 years ago

Issues

1

Total downloads

434