Use response selector keys for confusion matrix labelling #7423

hotzenklotz · 2020-12-01T10:00:40Z

Proposed changes:

At the moment, when running an NLU evaluation for response selectors the plotted confusion matrix with use the training data utterances as plot labels. This small PR uses the response selector keys and predicted (sub)-intents as labels instead. This make the confusion matrix a lot more readable and useful.
Similarly, the response_selection_report.json uses the same labels leading to better comprehension.

Examples
Before

After

Status (please check what you already did):

added some tests for the functionality
updated the documentation
updated the changelog (please check changelog for instructions)
reformat files using black (please check Readme for instructions)

…ation

sara-tagger · 2020-12-01T13:00:14Z

Thanks for submitting a pull request 🚀 @tttthomasssss will take a look at it as soon as possible ✨

dakshvar22

Thanks for fixing this on 1.10.x branch 👍 I suggested a few changes and based on them there are a few more places where you will have to make changes. For example, line 412-414(return actual and predicted labels instead of actual and predicted texts), line 1488(use actual and predicted labels instead of actual and predicted texts to compute metrics), line 241(not filter response examples based on empty texts).
All of these changes are actually implemented on master already if you would like to view them for reference.

dakshvar22 · 2020-12-02T22:05:29Z

rasa/nlu/test.py

+            if isinstance(response_prediction_full_intent, str):
+                response_prediction_full_intent = response_prediction_full_intent.split(
+                    "/"
+                )[1]


I would suggest not splitting the predicted retrieval_intent/sub_intent because you could have same sub_intent under multiple retrieval intents. So, let's compare for e.g. faq/ask_name to faq/ask_weather and not ask_name to ask_weather.

dakshvar22 · 2020-12-02T22:06:40Z

rasa/nlu/test.py

            response_target = example.get("response", "")
+            response_key = example.get(RESPONSE_KEY_ATTRIBUTE, "")


Following from the above comment, this would then change to example.get_combined_intent_response_key()

dakshvar22 · 2020-12-02T22:22:26Z

rasa/nlu/test.py

@@ -62,7 +63,7 @@

 ResponseSelectionEvaluationResult = namedtuple(
    "ResponseSelectionEvaluationResult",
-    "intent_target " "response_target " "response_prediction " "message " "confidence",
+    "intent_target response_key response_target response_prediction_full_intent response_prediction message confidence",


It looks like with this change we can also get rid of intent_target, response_target and response_prediction. Can you scrub those off as well? There will be some more places where you will have to make changes.

hotzenklotz · 2020-12-09T16:39:24Z

@dakshvar22 Thanks for the feedback. I applied your suggestions. Please see commit 021bcb1

hotzenklotz · 2020-12-15T10:42:33Z

@dakshvar22 Is anything else required for this PR?

dakshvar22

Thanks for fixing this and addressing all my comments. ✨

hotzenklotz · 2020-12-16T16:02:17Z

@dakshvar22 To my dismay I discovered that my refactoring reintroduced a bug in my original PR. Unfortunately, it was captured by any of the unit the tests. I came across it when running a new response selector evaluation today. Please see this commit for a fix: hotzenklotz@fd037be

In short, the default value for a response selector key was wrong. It needs to default to None for all "regular" intent examples or otherwise those won't be filtered out further down the pipeline and crash the sklearn reports.

cc @tmbo

Amendmend to PR #7423

hotzenklotz added 3 commits December 1, 2020 10:10

use response selector keys for plotting confusion matrix during evalu…

91ff6af

…ation

applied formatting

c81ffcb

added changelog

35fdb52

hotzenklotz changed the base branch from master to 1.10.x December 1, 2020 10:06

tmbo requested a review from dakshvar22 December 1, 2020 10:08

fix tests

c198393

sara-tagger requested a review from tttthomasssss December 1, 2020 13:00

fix test formatting

0f3642e

dakshvar22 requested changes Dec 2, 2020

View reviewed changes

tmbo removed the request for review from tttthomasssss December 7, 2020 09:49

applied PR feedback

021bcb1

dakshvar22 self-requested a review December 9, 2020 19:38

dakshvar22 approved these changes Dec 15, 2020

View reviewed changes

dakshvar22 merged commit a9fd74f into RasaHQ:1.10.x Dec 15, 2020

hotzenklotz added a commit to hotzenklotz/rasa that referenced this pull request Dec 16, 2020

amendmend to PR RasaHQ#7423

fd037be

hotzenklotz mentioned this pull request Dec 17, 2020

Amendmend to PR #7423 #7575

Merged

4 tasks

dakshvar22 added a commit that referenced this pull request Dec 18, 2020

Merge pull request #7575 from hotzenklotz/conf_mat_response_selector

e59b818

Amendmend to PR #7423

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use response selector keys for confusion matrix labelling #7423

Use response selector keys for confusion matrix labelling #7423

hotzenklotz commented Dec 1, 2020 •

edited

Loading

sara-tagger commented Dec 1, 2020

dakshvar22 left a comment •

edited

Loading

dakshvar22 Dec 2, 2020

dakshvar22 Dec 2, 2020

dakshvar22 Dec 2, 2020

hotzenklotz commented Dec 9, 2020

hotzenklotz commented Dec 15, 2020

dakshvar22 left a comment

hotzenklotz commented Dec 16, 2020

		response_target = example.get("response", "")
		response_key = example.get(RESPONSE_KEY_ATTRIBUTE, "")

Use response selector keys for confusion matrix labelling #7423

Use response selector keys for confusion matrix labelling #7423

Conversation

hotzenklotz commented Dec 1, 2020 • edited Loading

sara-tagger commented Dec 1, 2020

dakshvar22 left a comment • edited Loading

Choose a reason for hiding this comment

dakshvar22 Dec 2, 2020

Choose a reason for hiding this comment

dakshvar22 Dec 2, 2020

Choose a reason for hiding this comment

dakshvar22 Dec 2, 2020

Choose a reason for hiding this comment

hotzenklotz commented Dec 9, 2020

hotzenklotz commented Dec 15, 2020

dakshvar22 left a comment

Choose a reason for hiding this comment

hotzenklotz commented Dec 16, 2020

hotzenklotz commented Dec 1, 2020 •

edited

Loading

dakshvar22 left a comment •

edited

Loading