Skip to content

Latest commit

 

History

History
39 lines (29 loc) · 930 Bytes

README.md

File metadata and controls

39 lines (29 loc) · 930 Bytes

Goodreads Browser

Custom crawler and interface providing sort and filter options for the goodreads database.

Page types supported:

  • Lists
  • Regular search
  • Shelves
  • Tags (global Shelves, requires session cookie ID)

(Private profiles cannot be crawled, obviously)

Usage

Crawl the site using the following command(s):

npm run start <type> <arg1> <arg2>

npm run start list <id>
npm run start shelf <userId> <shelfName>
npm run start tag <name> <sessionCookieId>
npm run start search <name>

If you have too many database files and would like to merge them run:

npm run merge <file1> <file2> ...

A new merged database will be written to disk.

Open the browser

Browser

cd gr-search && yarn run dev

Will start the browser on localhost:3000.

Load the generated database files from results/ using the "Database" field (multiple files supported).