ucto

ucto

Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuatio

C++61gpl-3.0

3 months ago

computational-linguisticsfolialanguage

python-ucto

python-ucto

This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first

Cython29

7 months ago

computational-linguisticsfolianlp