Speak typed words based on TextInfo if possible #8110

LeonarddeR · 2018-03-22T08:15:25Z

Link to issue number:

Closes #8065
Fixes #7812
Fixes #6215

Summary of the issue:

When speaking typed words, NVDA always relies on a buffer to speak the typed word. This causes the following problems.

When selecting text and then typing a word, NVDA doesn't speak the first letter of the typed word. This is an issue reported by a @BabbageCom customer.
When editing a word, NVDA does not speak the word as how it has been edited, but only announces the characters that have been added to the word.

Description of how this pull request fixes the issue:

NVDA now tries to
use the TextInfo information to announce the last entered word. This works as follows:

speech.speakTypedCharacters no longer speaks typed words, but devotes this to a new function, speech.speakPreviousWord that takes the word separator as an argument.
speech.speakPreviousWord creates a TextInfo for the caret and calls textInfo.findWordBeforeCaret. This new function manipulates the textInfo and:
- Returns True if the TextInfo can be used to announce the previous word
- Returns False if the caret is currently part of a word, unless the last word separator was a space. This is the logic that provides the fix for NVDA Breaks Words At Apostrophes with Speak Typed Words #6215. No word echo will be performed.
- Raises a LookupError if no suitable word can be found to speak, in which case speech.speakPreviousWord falls back to the buffer based word echo.
speech.speakPreviousWord also deals with reporting of spelling errors as you type.
NVDAObject._reportErrorInPreviousWord has been removed, logic from this function has been spread out over the two functions explained above.
This disables uniscribe based word offset calculations by default and enables uniscribe specifically for edit text controls. This is because the idea of uniscribe about what characters separate words is totally different from what most other applications do (i.e. dot, comma are not considered word separators). Note that this will result in slightly different moving by word behaviour in virtual buffers.

Open to discussion

In this pr, I disabled word echo for non editable cases (i.e. where there is no caret). If desired, that can be reverted, but I'd say speaking typed words doesn't make sense if there's nowhere to be typed. An exception could be typing in lists in order to quickly navigate to list items (e.g. in Explorer), but I'd also say that word echo in these cases works delaying.
I added a caret snapshot variable for personal debugging purposes, but also thought I'd be handy to keep it.

Testing performed:

Tested several applications, including:

Notepad
Wordpad
Microsoft Word, both with and without UIA
Mozilla Firefox
Mozilla Thunderbird
Google Chrome
Libre office

Known issues with pull request:

It turns out that firefox word offset calculation (i.e. based on IAccessibleTextObject.TextAtOffset) is broken. This is a known bug that also affected spelling error as you type announcements, though in that case this problem was much less prevalent. It looks like Thunderbird and chrome do not have this issue or in a less prevalent way, but I didn't test these that extensively. As of the current implementation of this pr, we shouldn't be affected by this, though it needs testing.
From an UX perspective, word announcements behavior will differ from what people are accustomed to in Notepad and Firefox, as some word separators are considered part of a word. This is most prevalent when pressing dot or comma, which won't give a typed word announcement.
In LibreOffice, IAccessibleTextObject.TextAtOffset seems to use uniscribe for word boundaries whereas LibreOffice itself does not. Therefore, this pr overrides _getWordOffsets for SymphonyTextInfo to use word offsets calculation based on OffsetsTextInfo. This is much more reliable, except for words containing comma's (i.e. 1,3, 4,5, etc.)

Change log entry:

Changes
- NVDA's announcement of typed words is now based on what the current application considers a word. (Improve word echo to read the last added/edited word #8065)
  - Note that for some applications (e.g. Notepad, Firefox) this results in slightly different behavior when pressing dot or comma while typing.
Bug fixes
- When speaking of typed words is enabled and typing overrides a selection, NVDA no longer omits the first character when announcing the first typed word. (nvda reads cut words when typing in a cell in excel #7812)
- When speaking of typed words is enabled, NVDA will now treat words containing apostrophes (such as won't and aren't) as one word if the used application prescribes this. (NVDA Breaks Words At Apostrophes with Speak Typed Words #6215)

ehollig · 2018-03-22T23:13:38Z

How would this work with Chinese and Japanese or with languages that do not have spaces?

LeonarddeR · 2018-03-23T05:36:58Z

Good point, that ought to be tested as I don't speak these languages. @josephsl, would you be able to testdrive this for Korean?

jiangtiandao · 2018-03-25T08:04:53Z

I cloned the code and tried.
It seems that reading on Chinese input method works as usual.

dnz3d4c · 2018-03-26T00:50:04Z

Could you provide a test build for Korean users?

LeonarddeR · 2018-03-26T06:29:12Z

@dnz3d4c commented on 26 mrt. 2018 02:50 CEST:

Could you provide a test build for Korean users?

Sure. Here is a try build.

dnz3d4c · 2018-03-26T07:25:56Z

Thanks @LeonarddeR

michaelDCurran · 2018-04-15T21:57:32Z

It is likely that this may regress #456. Although it was not totally clear from the reporter, moving to uniscribe for virtualBuffers fixed Thai word segmentation issues.
What would be the impact to this pr if we kept using uniscribe for virtualBuffers? Was this change necessary to implement speak typed words properly, or was it just a nice improvement that fitted well with it?

michaelDCurran · 2018-04-15T22:22:53Z

In what real-world cases will this code fall back to the old character buffer implementation?

LeonarddeR · 2018-04-16T11:56:24Z

@michaelDCurran commented on 15 Apr 2018, 23:57 CEST:

What would be the impact to this pr if we kept using uniscribe for virtualBuffers?

That would have no impact at all.

Was this change necessary to implement speak typed words properly, or was it just a nice improvement that fitted well with it?

The latter. I think we should just keep using uniscribe for now.

LeonarddeR · 2018-04-16T12:27:35Z

@michaelDCurran commented on 16 Apr 2018, 00:22 CEST:

In what real-world cases will this code fall back to the old character buffer implementation?

I could think of several of these cases, I tried to document them in the code.

When trying to look up a word before the caret, but there is no word before the caret. I believe this applies to cases where you press enter in the python console for example, in which case the last word you entered isn't part of the TextInfo as soon as NVDA tries to get the last typed word. This also applies to Miranda NG
Sometimes, the IA2Text implementation for Firefox seems to lag behind, in which case text info based word echo would be unreliable. I need to recheck the symptoms that happen when we would use TextInfo based word echo in these cases.
When using an editor that uses auto indentation, that editor adds several spaces or tabs at the start of a new line when pressing enter. That makes TextInfo based echo unreliable.

michaelDCurran · 2018-04-16T21:39:13Z

Re Firefox updating fast enough, I wonder if we can delay our code a bit. Either re-queue it one more time, or perhaps even do something similar to hasCaretMoved, but perhaps for only 50ms or so. For auto indenting, we could specifically handle this in some kind of script for kb:enter perhaps? Then we'd no to check the line above if the current line started with whitespace. Just some out of the box ideas.

LeonarddeR · 2018-04-30T14:27:38Z

@michaelDCurran commented on 16 apr. 2018 23:39 CEST:

Re Firefox updating fast enough, I wonder if we can delay our code a
bit. Either re-queue it one more time, or perhaps even do something
similar to hasCaretMoved, but perhaps for only 50ms or so.

I'm not 100% sure whether it is because Firefox lags behind. It really feels like it, though. However, using a short delay might make this somewhat more clear.

For auto indenting, we could specifically handle this in some kind of
script for kb:enter perhaps? Then we'd no to check the line above if the
current line started with whitespace.

Interesting idea, I'm going to try this.

LeonarddeR · 2018-04-30T15:04:03Z

@michaelDCurran commented on 16 apr. 2018 23:39 CEST:

For auto indenting, we could specifically handle this in some kind of
script for kb:enter perhaps?

This is now covered. I reverted the firefox hack while at it, since it didn't play nice with reporting the typed character before sending enter to the system. I will have to look into the firefox issue again.

LeonarddeR · 2018-05-03T14:16:06Z

@michaelDCurran commented on 16 apr. 2018 23:39 CEST:

Re Firefox updating fast enough, I wonder if we can delay our code a
bit. Either re-queue it one more time, or perhaps even do something
similar to hasCaretMoved, but perhaps for only 50ms or so.

Queuing one additional time doesn't seem to be enough, so there are now three attempts with 5 ms inbetween.

michaelDCurran · 2018-05-06T23:18:19Z

Currently a unit test fails: FAIL: test_onlySpaces (tests.unit.test_textInfos.TestFindWordBeforeCaret_exceptions)

LeonarddeR · 2018-05-07T03:58:52Z

Ah, I missed that, but really need to do some additional local testing anyway. This feature is really hard to get right. I'll look into it again today, thanks a lot for doing an additional review.

LeonarddeR · 2018-05-07T13:50:12Z

The more and more I test, the more I'm tempted to disable this for IA2Web objects altogether. I'm experiencing several issues, including the following:

Create a new issue on Github for NVDA.
Start typing above one of the headings (i.e. create a blank line above a ## line).

IN quite a few occurrences, when pressing space after typing a word, NVDA announces the number sign that starts the new line as part of the typed word.

This reverts commit 22d148ba086ea8b9122049697c5cf178cf82a76b.

LeonarddeR · 2019-08-06T11:05:39Z

I realised that a disadvantage of the new approach is that it is more difficult to test, as it now relies on actual caret movement to cache the position before the caret moves. Having said that, the current approach is much less error prone.

I will have to look into this.

LeonarddeR · 2019-11-21T10:01:07Z

The current implementation is a bit sluggish in UIA consoles, probably because of the _caretMovementTimeoutMultiplier.

…o sluggish

AppVeyorBot · 2019-11-21T10:28:23Z

PR introduces Flake8 errors 😲

See test results for Failed build of commit 365e8a1202

dpy013 · 2019-11-26T12:51:08Z

hi @LeonarddeR
Will this pr be merged into 2019.3?
thanks

LeonarddeR · 2019-11-26T14:07:49Z

I wrote this pull request on behalf of @BabbageCom. As I'm leaving @BabbageCom after the 29th of November, I can no longer afford maintaining this pr other than applying very basic review actions. If this pull request requires major changes, they will have to be applied by someone else, e.g. @sjfbol or whoever else is willing to take it.

hi @LeonarddeR
Will this pr be merged into 2019.3?
thanks

Nope.

michaelDCurran · 2020-04-08T06:09:39Z

Although there are definite advantages of this approach, there are several limitations and bugs noted in the pr description. Also @LeonarddeR has suggested he will have no time to work on it in future. Therefore we are closing this. However, if anyone wants to pick up this work they are welcome. At very least learning from how some of this code was done.

Adriani90 · 2023-09-06T18:37:17Z

Labeled this as abandoned so someone can find it easier by filtering the label.

Adriani90 · 2024-05-17T20:08:23Z

cc: @mltony, @cary-rowen this might be interesting in light of word navigation and word pronouncing, maybe #16219 have opened up some new possibilities.
Note this seems abandoned code as per comment above, so someone else could take it up.

LeonarddeR changed the title ~~I8065~~ Speak typed words based on TextInfo if possible Mar 22, 2018

feerrenrut requested a review from michaelDCurran April 9, 2018 08:34

michaelDCurran previously approved these changes May 6, 2018

View reviewed changes

LeonarddeR added the component/text-info TextInfo objects and text review label May 18, 2018

Leonard de Ruijter added 9 commits July 18, 2018 08:42

Speak last word based on textInfo

295e983

Always respect TextInfos for word boundaries and fix SOffice

a635137

Python console: rely on TextInfo word echo

53637f8

Huge cleanup

128247e

Additional check for line separator when using notepad

7bff750

Revert remove old pre Windows Vista code

5ca05cd

This reverts commit 22d148ba086ea8b9122049697c5cf178cf82a76b.

Provide unit tests

ac21a33

Work around lagging TextInfo for Firefox

35ae023

Small cleanup of unrequired changes

20f8f98

Leonard de Ruijter added 5 commits July 26, 2019 06:53

Merge remote-tracking branch 'origin/master' into i8065

4dfc68d

Revert trailing spaces accidentally removed

2db5064

Merge remote-tracking branch 'origin/master' into i8065

02d2919

Linting actions

250ac8a

Remove now obsolete tests

6e25ce3

Merge remote-tracking branch 'origin/master' into i8065

ff98104

LeonarddeR mentioned this pull request Oct 3, 2019

"Speak typed words" behaves like "speak typed characters" when entering Asian text in Notepad and other edit fields and documents #2762

Open

LeonarddeR added the BabbageWork Pull requests filed on behalf of Babbage B.V. label Oct 8, 2019

Leonard de Ruijter added 4 commits November 7, 2019 09:36

Merge remote-tracking branch 'origin/master' into i8065

30382f1

Merge remote-tracking branch 'origin/master' into i8065

d8fed5d

Fix potential issue in editableText

744fc3a

Play spelling error wave using speech command

7c3cc67

Leonard de Ruijter added 3 commits November 21, 2019 11:03

Set max timeout to 0.015, any timeouts higher than that make typing t…

38769c6

…o sluggish

Don't use text info to speak typed words in UIA consoles

141f76e

Don't speak words using textINfo if the retrieved word is blank

d8af08e

Linting

a8aec6b

surfer0627 mentioned this pull request Feb 21, 2020

Speak typed word not correctly reading edited word in 2019.3.1 #10808

Closed

michaelDCurran closed this Apr 8, 2020

Adriani90 added the Abandoned requested reports or updates are missing since more than 1 year, author or users are not available. label Sep 6, 2023

CyrilleB79 mentioned this pull request Oct 26, 2024

Reads the entire word when it is modified, #17326

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speak typed words based on TextInfo if possible #8110

Speak typed words based on TextInfo if possible #8110

LeonarddeR commented Mar 22, 2018 •

edited

Loading

ehollig commented Mar 22, 2018

LeonarddeR commented Mar 23, 2018 via email

jiangtiandao commented Mar 25, 2018 •

edited

Loading

dnz3d4c commented Mar 26, 2018 •

edited

Loading

LeonarddeR commented Mar 26, 2018

dnz3d4c commented Mar 26, 2018

michaelDCurran commented Apr 15, 2018

michaelDCurran commented Apr 15, 2018

LeonarddeR commented Apr 16, 2018

LeonarddeR commented Apr 16, 2018

michaelDCurran commented Apr 16, 2018 via email

LeonarddeR commented Apr 30, 2018

LeonarddeR commented Apr 30, 2018

LeonarddeR commented May 3, 2018

michaelDCurran commented May 6, 2018

LeonarddeR commented May 7, 2018 via email

LeonarddeR commented May 7, 2018

LeonarddeR commented Aug 6, 2019

LeonarddeR commented Nov 21, 2019

AppVeyorBot commented Nov 21, 2019

dpy013 commented Nov 26, 2019

LeonarddeR commented Nov 26, 2019

michaelDCurran commented Apr 8, 2020

Adriani90 commented Sep 6, 2023

Adriani90 commented May 17, 2024

Speak typed words based on TextInfo if possible #8110

Speak typed words based on TextInfo if possible #8110

Conversation

LeonarddeR commented Mar 22, 2018 • edited Loading

Link to issue number:

Summary of the issue:

Description of how this pull request fixes the issue:

Open to discussion

Testing performed:

Known issues with pull request:

Change log entry:

ehollig commented Mar 22, 2018

LeonarddeR commented Mar 23, 2018 via email

jiangtiandao commented Mar 25, 2018 • edited Loading

dnz3d4c commented Mar 26, 2018 • edited Loading

LeonarddeR commented Mar 26, 2018

dnz3d4c commented Mar 26, 2018

michaelDCurran commented Apr 15, 2018

michaelDCurran commented Apr 15, 2018

LeonarddeR commented Apr 16, 2018

LeonarddeR commented Apr 16, 2018

michaelDCurran commented Apr 16, 2018 via email

LeonarddeR commented Apr 30, 2018

LeonarddeR commented Apr 30, 2018

LeonarddeR commented May 3, 2018

michaelDCurran commented May 6, 2018

LeonarddeR commented May 7, 2018 via email

LeonarddeR commented May 7, 2018

LeonarddeR commented Aug 6, 2019

LeonarddeR commented Nov 21, 2019

AppVeyorBot commented Nov 21, 2019

dpy013 commented Nov 26, 2019

LeonarddeR commented Nov 26, 2019

michaelDCurran commented Apr 8, 2020

Adriani90 commented Sep 6, 2023

Adriani90 commented May 17, 2024

LeonarddeR commented Mar 22, 2018 •

edited

Loading

jiangtiandao commented Mar 25, 2018 •

edited

Loading

dnz3d4c commented Mar 26, 2018 •

edited

Loading