Skip to content

Create 2022-07-26-phone-number-capture-dataset.md#108

Open
Anirudh257 wants to merge 1 commit intoskit-ai:masterfrom
Anirudh257:master
Open

Create 2022-07-26-phone-number-capture-dataset.md#108
Anirudh257 wants to merge 1 commit intoskit-ai:masterfrom
Anirudh257:master

Conversation

@Anirudh257
Copy link
Contributor

No description provided.

@swarajdalmia swarajdalmia requested a review from janaab11 August 1, 2022 09:52
Copy link
Contributor

@janaab11 janaab11 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Overall structure is there - similar to this. But:

  1. the content is missing is some places, and
  2. needs more emphasis on the metrics and how to read them


# Introduction

A bottleneck of a dialogue system is its ability to extract information from the utterances. The information is extracted in the form of **frames**, that represents all the different types of intentions that the system can extract from user utterances and **slots**, that are the different type of possible values.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is frames the same as intents ? lets use that to be consistent


## Sentence Error Rate

Sentence Error Rate is a robust extension to WER(Word Error Metric), used to evaluate the working of an ASR system.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Need more on how this is defined

1. **System performance** - this is captured through entity and slot metrics. in the case of alphanumeric here, we focus on SER (Sentence Error Rate)
2. **User experience** - this is captured through a subjective UX score, assigned through analysis (by the CUX function)

We expect, system performance (≈SER) to be better for entities captured across two turns - since we are parsing smaller sub-entities independently and (naively) expect SER to be a function of CER (Character Error Rate) and length.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If we keep this note, then it needs more explanation. What is SER and why is it a function of these two ? seems like some assumptions are being skipped


Sentence Error Rate is a robust extension to WER(Word Error Metric), used to evaluate the working of an ASR system.

We averaged the calculations across different callers, for each variation across different ASR systems.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

better sentence construction

We averaged the calculations across different callers, for each variation across different ASR systems.
![image](https://user-images.githubusercontent.com/16001446/180979426-e4ddc17f-a5e6-4af6-9659-e3f4e5166fac.png)

## UX Score
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This section needs a lot more! Can start with a summary of the UX report

> Single Turn is a more natural way to collect a phone number than Two turns.

# Future Work
## Validate on a larger dataset
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lets skip this, and add confidence intervals instead

For example, for the above conversation to book a flight, the frame will be of the type:

![image](https://user-images.githubusercontent.com/16001446/180951715-d0169425-8f17-443e-a823-23dff1e5af46.png)
# Motivation
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can this be merged with the previous section ? Introduction isnt saying much really

categories: [Machine Learning]
image: assets/images/demo1.jpg
layout: post
authors: [anirudhthatipelli]
Copy link
Contributor

@janaab11 janaab11 Aug 29, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

adding adithyanarayan as a co-author since he worked on all the evaluations here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants