Python Unicode Collation Algorithm: Difference between revisions

From The Digital Classicist Wiki
Jump to navigation Jump to search
(Add references)
mNo edit summary
 
Line 15: Line 15:
</blockquote>
</blockquote>


==Presentations==
 
* [https://github.com/SunoikisisDC/SunoikisisDC-2019-2020/wiki/DC-Session-4-Python Spring 2020: Introduction to Python] (Paula Granados García, Matteo Romanello)
[[category:tools]]
[[category:tools]]
[[category:Unicode]]
[[category:Unicode]]

Latest revision as of 14:04, 13 August 2021

Available

Author

  • James Tauber

Description

From the Github page (accessed 2020-10-06):

Python Unicode Collation Algorithm (PyUCA) is a Python implementation of the Unicode Collation Algorithm (UCA). It is used for sorting non-English strings properly. The core of the algorithm involves multi-level comparison. For example, café comes before caff because at the primary level, the accent is ignored and the first word is treated as if it were cafe. The secondary level (which considers accents) only applies then to words that are equivalent at the primary level.