[WIP] Unify case sensitive topic names #733

preetmishra · 2020-07-23T12:21:30Z

This unifies case sensitive topic names using lowercase topic names as an invariant.

Crux

We need some kind of invariant which we can rely upon for lookups and comparisons given that topics can change their casing at any point. Consequently, I propose to use lowercase topics as keys wherever store/index data.

Commits

The current commit structure is temporary. I have fixed one thing per commit to represent how I went about the changes but we would definitely want to squash them (except the first) before merging.

I would greatly appreciate feedback about the proposal and what else should be fixed.

neiljp

@preetmishra This seems like a good first step (and the refactoring exposes a potential edge case bug?), but there various points that we need to address, as demonstrated by manual testing. This is not a complete list, but for example:

If a user sends to case and CASE then two separate unread counts appear in the topic list, and only one appears in the message list if you're in that topic narrow
I have an existing instance which triggered this where there are unreads in topics with two 'different' names (by case), both with unreads on czo, but your code is not combining them.
Editing a message topic which matches by case causes it to disappear, but not appear in a new/different topic

Essentially, while fetching messages goes through index_messages, there are other situations we need to consider - anywhere where we compare topics, which is potentially a lot of different places.

Perhaps fundamentally, if we get an update to a topic (eg. edited latest message, or new messge), should we change the topic of every message we have stored? Or just update the rendering?

neiljp · 2020-07-23T15:34:29Z

zulipterminal/helper.py

-                topics_in_stream[msg['subject']] = set()
-            topics_in_stream[msg['subject']].add(msg['id'])
+        if msg['type'] == 'stream' and len(narrow) == 2:
+            narrow_topic = narrow[1][1]


narrow_topic vs narrow_topics?

The narrow can only have one topic, right?

zulipterminal/helper.py

preetmishra · 2020-07-24T19:31:27Z

@neiljp Thanks for the review and the pointers! 👍

I have reworked the fundamental approach that I had to now store lowercase topic names for lookups and comparisons (see #733 (comment)). I have also addressed the three issues that you reported.

zulipterminal/model.py

preetmishra · 2020-07-27T19:06:21Z

Updated with improved commits, more comments and test amendments (except which are related to muted topics).

preetmishra · 2020-07-31T15:04:28Z

Updated to resolve conflicts.

This extracts msg_topic and narrow_topics as variables and amends the conditional accordingly.

The intent is to use lowercase topic names, as an invariant, in the data structures that we locally use to keep track of topics and its metadata (e.g. unread count). canonicalize_topic() and compare_lowercase() are added as helpers. Tests amended.

Also added narrow_with_canonical_topic().

Tests amended.

Also, amended related _update_topic_index().

neiljp · 2020-08-30T01:39:33Z

@preetmishra This looks to cover a good number of comparison cases; this was blocked on another PR?

To clarify, this uses:

lower case in internal structures (that seems clearest)
"latest" topic names in display? (last-but-one commit references that?)
the 'real' case in what remaining places? (for display ^)

I think this will be clearer with the now-merged #675 and when the topic list updates with something like #785, as we should be able to test internally more easily.

This looks good, though I've not dug into all the cases so far. Pending further review, this seems reasonable - my concern is whether we might consider locally handling topics with ids, which may simplify this issue - ie. each topic_id in a stream and (stream_id, topic_id) would be unique, and so we can have a 'latest name' (for display) for each id, and the comparisons would all occur at the point where topic names are converted to ids.

The first commit seems like a separate cleanup, is that correct?

zulipbot · 2021-01-30T20:34:28Z

Heads up @preetmishra, we just merged some commits that conflict with the changes your made in this pull request! You can review this repository's recent commits to see where the conflicts occur. Please rebase your feature branch against the upstream/main branch and resolve your pull request's merge conflicts accordingly.

preetmishra added the feedback wanted label Jul 23, 2020

neiljp reviewed Jul 23, 2020

View reviewed changes

neiljp removed the feedback wanted label Jul 23, 2020

neiljp requested a review from sumanthvrao July 23, 2020 16:25

preetmishra force-pushed the feat-unify-similar-topics branch from 013e101 to f050695 Compare July 24, 2020 19:02

zulipbot added the size: L [Automatic label added by zulipbot] label Jul 24, 2020

preetmishra added the feedback wanted label Jul 24, 2020

preetmishra changed the title ~~Index case insensitive topic names by narrowed topic~~ Unify case sensitive topic names Jul 25, 2020

sumanthvrao reviewed Jul 26, 2020

View reviewed changes

zulipterminal/model.py Outdated Show resolved Hide resolved

preetmishra removed the feedback wanted label Jul 27, 2020

preetmishra changed the title ~~Unify case sensitive topic names~~ [WIP] Unify case sensitive topic names Jul 27, 2020

neiljp added this to the Release after upcoming milestone Jul 27, 2020

preetmishra force-pushed the feat-unify-similar-topics branch from f050695 to 80d55d2 Compare July 27, 2020 18:43

zulipbot added size: XL [Automatic label added by zulipbot] and removed size: L [Automatic label added by zulipbot] labels Jul 27, 2020

preetmishra force-pushed the feat-unify-similar-topics branch from 80d55d2 to 42448ec Compare July 27, 2020 19:05

preetmishra changed the title ~~[WIP] Unify case sensitive topic names~~ Unify case sensitive topic names Jul 27, 2020

preetmishra added the PR needs review PR requires feedback to proceed label Jul 27, 2020

preetmishra force-pushed the feat-unify-similar-topics branch from 42448ec to 6a2b55a Compare July 31, 2020 15:04

preetmishra changed the title ~~Unify case sensitive topic names~~ [AWAITING] Unify case sensitive topic names Aug 6, 2020

preetmishra removed the PR needs review PR requires feedback to proceed label Aug 6, 2020

preetmishra changed the title ~~[AWAITING] Unify case sensitive topic names~~ [WIP] Unify case sensitive topic names Aug 14, 2020

preetmishra added 5 commits August 21, 2020 23:09

refactor: helper: Simplify the topic index block in index_messages().

eae5cd9

This extracts msg_topic and narrow_topics as variables and amends the conditional accordingly.

helper/model: Update repr lookup for _have_last_message and pointer.

602004e

Also added narrow_with_canonical_topic().

model: Use canonical_topic() for muted topics.

1c54d9d

conftest/helper/views: Use canonical_topic() for unread_counts.

9e87f9e

Tests amended.

preetmishra added 3 commits August 21, 2020 23:10

model: Use compare_lowercase() for topics while updating messages.

3af0aac

model/views: Acknowledge case sensitive topics in topic list update.

110166e

Also, amended related _update_topic_index().

boxes: Unify case sensitive topic names for stream narrows.

0338c11

preetmishra force-pushed the feat-unify-similar-topics branch from 6a2b55a to 0338c11 Compare August 21, 2020 18:01

preetmishra changed the title ~~[WIP] Unify case sensitive topic names~~ Unify case sensitive topic names Aug 21, 2020

preetmishra added the PR needs review PR requires feedback to proceed label Aug 21, 2020

neiljp removed the PR needs review PR requires feedback to proceed label Aug 30, 2020

preetmishra changed the title ~~Unify case sensitive topic names~~ [WIP] Unify case sensitive topic names Aug 30, 2020

neiljp modified the milestones: 0.6.0, Release after next Jan 28, 2021

Base automatically changed from master to main January 30, 2021 20:30

zulipbot added the has conflicts label Jan 30, 2021

neiljp force-pushed the main branch from f9f483a to 793e73d Compare December 15, 2021 21:30

neiljp modified the milestones: Next Release, Release after next Mar 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Unify case sensitive topic names #733

[WIP] Unify case sensitive topic names #733

preetmishra commented Jul 23, 2020 •

edited

Loading

neiljp left a comment

neiljp Jul 23, 2020

preetmishra Jul 24, 2020

preetmishra commented Jul 24, 2020

preetmishra commented Jul 27, 2020

preetmishra commented Jul 31, 2020

neiljp commented Aug 30, 2020

zulipbot commented Jan 30, 2021

[WIP] Unify case sensitive topic names #733

Are you sure you want to change the base?

[WIP] Unify case sensitive topic names #733

Conversation

preetmishra commented Jul 23, 2020 • edited Loading

Crux

Commits

neiljp left a comment

Choose a reason for hiding this comment

neiljp Jul 23, 2020

Choose a reason for hiding this comment

preetmishra Jul 24, 2020

Choose a reason for hiding this comment

preetmishra commented Jul 24, 2020

preetmishra commented Jul 27, 2020

preetmishra commented Jul 31, 2020

neiljp commented Aug 30, 2020

zulipbot commented Jan 30, 2021

preetmishra commented Jul 23, 2020 •

edited

Loading