Skip to content

Properly handle final sigma lowercase#56

Open
Jules-Bertholet wants to merge 1 commit intowithoutboats:masterfrom
Jules-Bertholet:final-sigma
Open

Properly handle final sigma lowercase#56
Jules-Bertholet wants to merge 1 commit intowithoutboats:masterfrom
Jules-Bertholet:final-sigma

Conversation

@Jules-Bertholet
Copy link

The rules for determining how a Greek sigma should be lowercased are more complex than just "is it the last character in the word." The full rule is defined in the Unicode Standard under "Final_Sigma": https://www.unicode.org/versions/Unicode15.1.0/ch03.pdf#G54277

Implementing the rules in heck would require shipping static data for the Case_Ignorable Unicode property. So this commit instead uses the standard library implementation, which unfortunately costs an extra heap allocation.

The rules for determining how a Greek sigma should be lowercased
are more complex than just "is it the last character in the word."
The full rule is defined in the Unicode Standard under "Final_Sigma":
<https://www.unicode.org/versions/Unicode15.1.0/ch03.pdf#G54277>

Implementing the rules ourselves would require shipping
static data for the `Case_Ignorable` Unicode property.
So this commit instead uses the standard library implementation,
which unfortunately costs a heap allocation.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant