Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create 2022-07-26-phone-number-capture-dataset.md #108

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

Anirudh257
Copy link
Contributor

No description provided.

@swarajdalmia swarajdalmia requested a review from janaab11 August 1, 2022 09:52
Copy link
Contributor

@janaab11 janaab11 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Overall structure is there - similar to this. But:

  1. the content is missing is some places, and
  2. needs more emphasis on the metrics and how to read them


# Introduction

A bottleneck of a dialogue system is its ability to extract information from the utterances. The information is extracted in the form of **frames**, that represents all the different types of intentions that the system can extract from user utterances and **slots**, that are the different type of possible values.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is frames the same as intents ? lets use that to be consistent


## Sentence Error Rate

Sentence Error Rate is a robust extension to WER(Word Error Metric), used to evaluate the working of an ASR system.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Need more on how this is defined

1. **System performance** - this is captured through entity and slot metrics. in the case of alphanumeric here, we focus on SER (Sentence Error Rate)
2. **User experience** - this is captured through a subjective UX score, assigned through analysis (by the CUX function)

We expect, system performance (≈SER) to be better for entities captured across two turns - since we are parsing smaller sub-entities independently and (naively) expect SER to be a function of CER (Character Error Rate) and length.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If we keep this note, then it needs more explanation. What is SER and why is it a function of these two ? seems like some assumptions are being skipped


Sentence Error Rate is a robust extension to WER(Word Error Metric), used to evaluate the working of an ASR system.

We averaged the calculations across different callers, for each variation across different ASR systems.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

better sentence construction

We averaged the calculations across different callers, for each variation across different ASR systems.
![image](https://user-images.githubusercontent.com/16001446/180979426-e4ddc17f-a5e6-4af6-9659-e3f4e5166fac.png)

## UX Score
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This section needs a lot more! Can start with a summary of the UX report

> Single Turn is a more natural way to collect a phone number than Two turns.

# Future Work
## Validate on a larger dataset
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lets skip this, and add confidence intervals instead

For example, for the above conversation to book a flight, the frame will be of the type:

![image](https://user-images.githubusercontent.com/16001446/180951715-d0169425-8f17-443e-a823-23dff1e5af46.png)
# Motivation
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can this be merged with the previous section ? Introduction isnt saying much really

categories: [Machine Learning]
image: assets/images/demo1.jpg
layout: post
authors: [anirudhthatipelli]
Copy link
Contributor

@janaab11 janaab11 Aug 29, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

adding adithyanarayan as a co-author since he worked on all the evaluations here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants