In this document [[RFC2119]] keywords in uppercase italics have their usual meaning. We also use these stylistic conventions:
Definitions appear with a different background color and decoration like this.
-
Best practices appear with a different background color and decoration like this.
+
+
Best practices appear with a different background color and decoration like this.
+
Issues, gaps, and recommendations for future work appear with a different background color and decoration like this.
@@ -208,10 +210,18 @@
Matching variation due to language
Case Folding
-
+
+
+
By default, string searching SHOULD be case-insensitive using Unicode's case-folding algorithms.
+
+
+
+
User agents MAY offer a search sensitivity option to authors and end-users to configure search case-sensitivity.
+
+
A user might expect a term entered in lowercase to match uppercase equivalents (and perhaps vice-versa). Sub-string matching features, such as the browser "find" command, often offer a user-selectable option for matching (or not) the case of the input to that of the text.
-
For a survey of case folding, see the discussion here in [[CHARMOD-NORM]].
+
For a survey of case folding, see the discussion here in [[CHARMOD-NORM]] and [[Unicode]] Chapter 5 in the section titled Case Mappings.
@@ -295,7 +305,11 @@
Script Equivalence
East Asian Width
-
+
+
+
String searching SHOULD match between full-width and half-width character forms.
+
+
Some compatibility characters were encoded into Unicode to account for single- or multibyte representation in legacy character encodings or for compatibility with certain layout behaviors in East Asian languages.