This release adds several languages to both the Aggressive and Pragmatic tokenizers, including Serbian, Ukranian, Bulgarian, and more. The sentiment data file is now bundled with the library as well.
This release adds several languages to both the Aggressive and Pragmatic tokenizers, including Serbian, Ukranian, Bulgarian, and more. The sentiment data file is now bundled with the library as well.