Skip to content

plugin to transform from HTML (rehype) to prose (retext)

License

Notifications You must be signed in to change notification settings

rehypejs/rehype-retext

Repository files navigation

rehype-retext

Build Coverage Downloads Size Sponsors Backers Chat

rehype plugin to support retext.

Contents

What is this?

This package is a unified (rehype) plugin to support retext.

unified is a project that transforms content with abstract syntax trees (ASTs). rehype adds support for HTML to unified. retext adds support for natural language to unified. hast is the HTML AST that rehype uses. nlcst is the natural language AST that retext uses. This is a rehype plugin that transforms hast into nlcst to support retext.

When should I use this?

This project is useful if you want to check natural language in HTML. The retext ecosystem has many useful plugins to check prose, such as retext-indefinite-article which checks that a and an are used correctly, or retext-readability which checks that sentences are not too complex. This plugins lets you use them on HTML documents.

This plugin is not able to apply changes by retext plugins (such as done by retext-smartypants) to the HTML content.

This plugin is built on hast-util-to-nlcst, which does the work on syntax trees. rehype focusses on making it easier to transform content by abstracting such internals away.

Install

This package is ESM only. In Node.js (version 16+), install with npm:

npm install rehype-retext

In Deno with esm.sh:

import rehypeRetext from 'https://esm.sh/rehype-retext@5'

In browsers with esm.sh:

<script type="module">
  import rehypeRetext from 'https://esm.sh/rehype-retext@5?bundle'
</script>

Use

Say we have the following file example.html:

<!doctype html>
<meta charset=utf8>
<title>Hello!</title>
<article>
  A implicit sentence.
  <h1>This and and that.</h1>
</article>

…and our module example.js looks as follows:

import rehypeParse from 'rehype-parse'
import rehypePresetMinify from 'rehype-preset-minify'
import rehypeRetext from 'rehype-retext'
import rehypeStringify from 'rehype-stringify'
import retextEnglish from 'retext-english'
import retextIndefiniteArticle from 'retext-indefinite-article'
import retextRepeatedWords from 'retext-repeated-words'
import {read} from 'to-vfile'
import {unified} from 'unified'
import {reporter} from 'vfile-reporter'

const file = await unified()
  .use(rehypeParse)
  .use(
    rehypeRetext,
    unified()
      .use(retextEnglish)
      .use(retextIndefiniteArticle)
      .use(retextRepeatedWords)
  )
  .use(rehypePresetMinify)
  .use(rehypeStringify)
  .process(await read('example.html'))

console.error(reporter([file]))
console.log(String(file))

…then running node example.js yields:

example.html
5:3-5:4   warning Unexpected article `A` before `implicit`, expected `An` retext-indefinite-article retext-indefinite-article
6:12-6:19 warning Unexpected repeated `and`, remove one occurrence        and                       retext-repeated-words

⚠ 2 warnings
<!doctypehtml><meta charset=utf8><title>Hello!</title><article>A implicit sentence.<h1>This and and that.</h1></article>

API

This package exports no identifiers. The default export is rehypeRetext.

unified().use(rehypeRetext, options)

Bridge or mutate to retext.

Parameters
Returns

Transform (Transformer).

Notes
  • if a processor is given, uses its parser to create a new nlcst tree, then runs the plugins attached to with that (bridge mode); you can add a parser to processor for example with retext-english; other plugins used on the processor should be retext plugins
  • if a parser is given, uses it to create a new nlcst tree, and returns it (mutate mode); you can get a parser by importing Parser from retext-english for example; other plugins used after rehypeRetext should be retext plugins

Types

This package is fully typed with TypeScript. It exports no additional types.

Compatibility

Projects maintained by the unified collective are compatible with maintained versions of Node.js.

When we cut a new major release, we drop support for unmaintained versions of Node. This means we try to keep the current release line, rehype-retext@5, compatible with Node.js 16.

This plugin works with unified version 6+, rehype version 4+, and retext version 7+.

Security

rehype-retext does not change the syntax tree so there are no openings for cross-site scripting (XSS) attacks.

Related

Contribute

See contributing.md in rehypejs/.github for ways to get started. See support.md for ways to get help.

This project has a code of conduct. By interacting with this repository, organization, or community you agree to abide by its terms.

License

MIT © Titus Wormer