Python Unicode Collation Algorithm: Difference between revisions
Jump to navigation
Jump to search
SarahDualeh (talk | contribs) (Add references) |
SarahDualeh (talk | contribs) mNo edit summary |
||
| Line 15: | Line 15: | ||
</blockquote> | </blockquote> | ||
[[category:tools]] | [[category:tools]] | ||
[[category:Unicode]] | [[category:Unicode]] | ||
Latest revision as of 14:04, 13 August 2021
Available
Author
- James Tauber
Description
From the Github page (accessed 2020-10-06):
Python Unicode Collation Algorithm (PyUCA) is a Python implementation of the Unicode Collation Algorithm (UCA). It is used for sorting non-English strings properly. The core of the algorithm involves multi-level comparison. For example, café comes before caff because at the primary level, the accent is ignored and the first word is treated as if it were cafe. The secondary level (which considers accents) only applies then to words that are equivalent at the primary level.