-
Notifications
You must be signed in to change notification settings - Fork 2.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix matching user input to datalist values #4814 #7003
Conversation
Changes the description of datalist matching to use the term 'search' instead of 'substring match' and adds examples of some of the textual variation issues that user agents might consider. Adds a reference to CHARMOD-NORM (String Matching).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Text looks great, just some nits on formatting.
source
Outdated
@@ -125282,6 +125286,10 @@ INSERT INTERFACES HERE | |||
<dt id="refsCHARMOD">[CHARMOD]</dt> | |||
<dd>(Non-normative) <cite><a href="https://www.w3.org/TR/charmod/">Character Model for the World Wide Web 1.0: Fundamentals</a></cite>, M. Dürst, F. Yergeau, R. Ishida, M. Wolf, T. Texin. W3C.</dd> | |||
|
|||
<dt id="refsCHARMODNORM">[CHARMODNORM]</dt> <dd>(Non-normative) <cite><a |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Wrapping is confused here. (The great rewrapper does not work well with dt/dds.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done.
@@ -125282,6 +125286,10 @@ INSERT INTERFACES HERE | |||
<dt id="refsCHARMOD">[CHARMOD]</dt> | |||
<dd>(Non-normative) <cite><a href="https://www.w3.org/TR/charmod/">Character Model for the World Wide Web 1.0: Fundamentals</a></cite>, M. Dürst, F. Yergeau, R. Ishida, M. Wolf, T. Texin. W3C.</dd> | |||
|
|||
<dt id="refsCHARMODNORM">[CHARMODNORM]</dt> <dd>(Non-normative) <cite><a | |||
href="https://www.w3.org/TR/charmod-norm/">Character Model for the World Wide Web: String |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Use https://w3c.github.io/charmod-norm/ instead (we always cite EDs in WHATWG specs, never TR).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Will fix.
(I don't think there is an ED for charmod just above, which may be an exception to the rule... which I faithfully copied).
@@ -125282,6 +125286,10 @@ INSERT INTERFACES HERE | |||
<dt id="refsCHARMOD">[CHARMOD]</dt> | |||
<dd>(Non-normative) <cite><a href="https://www.w3.org/TR/charmod/">Character Model for the World Wide Web 1.0: Fundamentals</a></cite>, M. Dürst, F. Yergeau, R. Ishida, M. Wolf, T. Texin. W3C.</dd> | |||
|
|||
<dt id="refsCHARMODNORM">[CHARMODNORM]</dt> <dd>(Non-normative) <cite><a | |||
href="https://www.w3.org/TR/charmod-norm/">Character Model for the World Wide Web: String | |||
Matching</a></cite>, A.Phillips. W3C.</dd> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Space after "A."
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done.
Um. Sorry, but this appears to have lost sight of the main point of the issue raised, which was that (while additional forms of matching would be nice and are certainly encouraged) Unicode normalisation and case folding really SHOULD be an expected baseline. The proposed text put no emphasis or urgency on those two things. @aphillips in case you didn't see the email i sent to you, here was my suggestion for the edit (it's probably needs some improvement, but it tries to emphasise the importance of normalisation and case-folding):
|
@r12a Okay. I tried to follow our original bug report. What do you think of:
Note a few wording insertions, particularly "appropriate" with "case folding". Should we say "language-appropriate"? Suggest edits. |
@aphillips I'm not clear why you'd want to qualify case-folding with '(language-)appropriate'. This is case folding, not case conversion. With case-folding there's no need to know the language of the text.
I think that that text is redundant. You say the same thing as the first sentence in the rest of the paragraph. Wrt the second sentence, this is searching, rather than string-matching, so the only examples that are relevant are those related to Unicode normalisation and case-folding, so if we keep the link, i'd put it after the "Substring matching should be done..." sentence. |
This includes the discussion of the I18N WG in the teleconference of 2021-09-02.
I have updated the text with the results of our discussion in the I18N teleconference of 2021-09-02. @r12a please check the results. I notice that what we're describing here is basically a subset of #3539 (
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM with nit, but I guess we should also get @r12a's signoff?
Thanks so much for working on this.
code point sequences caused by different keyboard- or input-specific mechanisms do not interfere | ||
with the matching process. Case variations should be ignored, which may require language-specific | ||
case mapping. For examples of these, see <cite>Character Model for the World Wide Web: String | ||
Matching</cite> <ref spec=CHARMODNORM>. User agents may also provide other matching features: for |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nit: move the <ref spec=CHARMODNORM>
to the end of the paragraph after the final .
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nit: i'd put commas in, to make it easier to read:
code point sequences, caused by different keyboard- or input-specific mechanisms, do not
- Added commas for readability. - Moved CHARMODNORM ref tag to end of paragraph.
Changes the description of datalist matching to use the term
'search' instead of 'substring match' and adds examples of some
of the textual variation issues that user agents might consider.
Adds a reference to CHARMOD-NORM (String Matching).
(See WHATWG Working Mode: Changes for more details.)
/input.html ( diff )
/references.html ( diff )