Skip to content
@extractus

Extractus

A set of extractor tools for devs

Welcome to Extractus

We develop and share open source tools for collecting media content.

You can use one or combination of them to build news sites, create automated content systems for marketing campaign or gather dataset for NLP projects, etc.

Here is an example based on our news engine.

If you have any idea, or want more features, or face any problem while using them, please create issue.

In the future, we would like to add more dedicated tools for extracting links, tweets, audios, videos, products, crypto/stock prices.

We have not much time. This is self-training and non-profit side project. Contributions and collaborators are always welcomed 🙂


Pinned Loading

  1. article-extractor article-extractor Public

    To extract main article from given URL with Node.js

    JavaScript 1.6k 140

  2. oembed-extractor oembed-extractor Public

    Extract oEmbed data from given webpage

    JavaScript 109 43

  3. feed-extractor feed-extractor Public

    Simplest way to read & normalize RSS/ATOM/JSON feed data

    JavaScript 167 33

Repositories

Showing 5 of 5 repositories
  • article-extractor Public

    To extract main article from given URL with Node.js

    extractus/article-extractor’s past year of commit activity
    JavaScript 1,611 MIT 140 6 1 Updated Nov 9, 2024
  • extractus Public
    extractus/extractus’s past year of commit activity
    HTML 11 MIT 0 4 (1 issue needs help) 0 Updated Jul 25, 2024
  • oembed-extractor Public

    Extract oEmbed data from given webpage

    extractus/oembed-extractor’s past year of commit activity
    JavaScript 109 MIT 43 1 0 Updated Jun 25, 2024
  • feed-extractor Public

    Simplest way to read & normalize RSS/ATOM/JSON feed data

    extractus/feed-extractor’s past year of commit activity
    JavaScript 167 MIT 33 6 0 Updated May 7, 2024
  • .github Public

    Organization meta data

    extractus/.github’s past year of commit activity
    0 MIT 0 0 0 Updated Dec 3, 2022

Top languages

JavaScript HTML

Most used topics

Loading…