Skip to content
@kuhumcst

Centre for Language Technology, University of Copenhagen

Popular repositories Loading

  1. cstlemma cstlemma Public

    Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefix, infix, suffix, circumfix). Rules are obtained by supervis…

    C++ 36 6

  2. stucco stucco Public archive

    An experimental adaptive UI toolkit.

    Clojure 31 1

  3. DanNet DanNet Public

    The Danish WordNet as an RDF graph.

    Clojure 21

  4. xml-hiccup xml-hiccup Public

    Convert XML into Hiccup in Clojure and ClojureScript.

    Clojure 21 1

  5. taggerXML taggerXML Public

    Modernized version of Eric Brill's Part Of Speech tagger.

    C++ 17 6

  6. tf-idf tf-idf Public

    A reasonably performant TF-IDF implementation.

    Clojure 12 1

Repositories

Showing 10 of 64 repositories
  • gml Public

    Create training sets for tagger and lemmatiser for Middle Low German.

    0 GPL-3.0 0 0 0 Updated Mar 21, 2025
  • texton Public

    Text Tonsorium - a toolbox that automatically arranges NLP tools in workflows and enacts them with user's inputs

    PHP 4 0 1 0 Updated Mar 19, 2025
  • texton-Java Public

    Web-based workflow management system that computes candidate tool workflows given input file(s) and the user's requirements regarding the output. Afterwards, runs a workflow selected by the user from the list of candidates. Implemented in Bracmat (~75%) and Java (~25%).

    Java 2 GPL-3.0 2 0 0 Updated Mar 19, 2025
  • clarin-tei Public Forked from kuhumcst/glossematics
    Clojure 0 1 3 0 Updated Feb 25, 2025
  • cstlemma Public

    Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefix, infix, suffix, circumfix). Rules are obtained by supervised learning from a full form - lemma list.

    C++ 36 GPL-2.0 6 2 0 Updated Feb 25, 2025
  • hiccup-tools Public

    Navigate and manipulate Hiccup documents.

    Clojure 1 0 1 0 Updated Jan 29, 2025
  • 1 0 0 0 Updated Jan 10, 2025
  • DanNet Public

    The Danish WordNet as an RDF graph.

    Clojure 21 MIT 0 34 0 Updated Jan 7, 2025
  • HTML 0 0 0 0 Updated Dec 18, 2024
  • texton-bin Public

    Binary executable files used by services in the Text Tonsorium.

    0 0 0 0 Updated Dec 9, 2024