Skip to content

giuseeFG/TAGMining

Repository files navigation

TAGMining

Third AGIW project.

CLUEWEB USAGE

  • Run warc_parser.py on warc file and set a destination path for new file.
  • install npm libraries
  • Run parser_Clue_Web.js on new warc file generated by warc_parser.py.

COMMON CRAWL USAGE

  • install npm libraries
  • Run parser_Clue_Web.js on new warc file generated by warc_parser.py.

ENJOY!

About

Third AGIW project.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published