scossu bf2cfab416 Use Unicode classes for word boundary markers. 1 vecka sedan
..
data bf2cfab416 Use Unicode classes for word boundary markers. 1 vecka sedan
unit_tests e4a21ee4d3 WIP Tibetan and test update. 2 veckor sedan
__init__.py 03c0fcd820 Separate test folder in unit and integration. 10 månader sedan
integration.py aa924fb9b7 Add tests for precomposed character normalization. 6 månader sedan