-
Notifications
You must be signed in to change notification settings - Fork 99
Quizzes
Currently, our tokenizer cannot handle acronyms and 's when they are followed by another punctuation (e.g., "U.S." or Jinho's,). Your task is to improve the tokenizer so it can handle such cases.
- Download
tokenize_punctuation.pyortokenize_regex.py. - Modify
tokenize_*.pyso it can handle the above situation. - Put
tokenize_*.pyunder the directorycs329/quiz1.
Write phrase structure trees for the following sentence in the Penn Treebank format.
Emory University is a private research university in metropolitan Atlanta, located in the Druid Hills.
- Examples of phrase structure trees in the Penn Treebank can be found here.
- The Penn Treebank guidelines can be found here.
- Create a text file
quiz2.txtcontaining two trees in the parenthetical notation and put it under the directorycs329/quiz2. - Stanford parser: demo.
-
Create a virtual machine at Azure: guidelines.
-
Login to your Azure account by typing the following command on a terminal. If you don't have a terminal, try putty. Replace
yourIDwith your Emory ID and104.209.xxx.xxwith your public IP address. -
Install Java 8 by running the following commands:
sudo apt-add-repository ppa:webupd8team/java sudo apt-get update sudo apt-get install oracle-java8-installer sudo apt-get install oracle-java8-set-default -
Enter the following commands:
cat /etc/*-release > info.txt java -version 2>> info.txt -
Login to your
mathcsaccount: -
Create a directory
cs329/quiz3by entering the following command:mkdir -p cs329/quiz3 -
Copy
info.txtfrom your Azure account to yourmathcsaccount:scp [email protected]:~yourID/info.txt cs329/quiz3/ -
Check if
info.txtis copied to yourmathcsaccount by running the following command:cat cs329/quiz3/info.txt
- What is the difference between syntax and semantics? What is the difference between dependency structures and predicate argument structures? Create a file
quiz4.txtundercs329/quiz4containing your answer.
- Go to the WordNet online demo.
- Given the paragraph below, answer the following questions using WordNet. Create your own entailments if needed. Create a file
quiz5.txtundercs329/quiz5containing your answer. - How many siblings do I have?
- Do I get along with my step-sister?
- Was my brother sleeping around 3am last night?
- Is my sister usually depressed?
- Was my sister depressed yesterday?
- Did I ever know my sister’s birthday?
- Did my sister get a gift from me?
I have a big brother, a little sister, and a step-sister.
I bond with my step-sister.
My brother snored at 3am last night.
My sister is usually cheerful but she was low yesterday because I forgot her birthday so I bought a gift for her.
Write an abstract (up to 200 words) about your approach on homework 2. Try to signify your work over the baseline approach. We will select the top 12 work for presentations. Create a file quiz6.txt under cs329/quiz6 containing your answer.
Describe a mechanism that would change the learning rate during the training in a meaningful way. Create a file quiz7.txt under cs329/quiz7 containing the answer.
Laplace smoothing gives a way of avoiding 0 probability; however, it still gives a very small probability that does not necessarily reflect the true probability of the unseen word. Describe a way of improving Laplace s smoothing. Create a file quiz8.txt under cs329/quiz8 containing the answer.
Copyright © 2016 Emory University - All Rights Reserved.
