CSE-567-Computational_Linguistic

Natural Language Processing programs and projects (implmented in Prolog)

1) Project - Bigram Model Sentence Evaluator

Problem: Prolog project to evaluate the correctness of english sentence using bigram model

Approach: The project constructs a Prolog bigram language model using small DA_Corpus.text corpus.

The DA_Corpus.text corpus is normalized using unix commands.
Created a prolog readable unigram.pl and bigram.pl database from normalized corpus.
In the final step, implemented bigram_model.pl which computes the probability of any word sequence, of any size, via a predicate called calc_prob/2. The predicate calc_prob/2 works in log space and applies laplace smoothing on fly to compute the probability of given sentence.

Sample outputs: As shown in the output below, sentence like "the book fell" will have better value than "i fell on the book"

Similarly the sentence like "the book that he wanted fell on my feet" will have better value than "book the that he wanted fell on my feet"

2) Program - Roman Decimal Convertor

Problem: Prolog program to convert Roman to Decimal and vice-versa till 20 numbers

Program: RomanDecimalConversion.pl

Sample outputs:

3) Project - Sentence Tagger

Problem: Identify all possible tags for given sentence with there correctness probability.

Approach: The project makes use of Viterbi algorithm to compute all the possible tag list with probability for given sentence. tagger.pl

Sample outputs:

4) Project - Word Similarity

Problem: Prolog project for finding the cosine similarity between two given words and finding most similar words of a given word

Word Similarity

Approach: The project applies cosine distance rule to find the probability of two words similarity. This is then extended to identify and rank all the similar words for given word.

5) Project - Smart Refrigerator - A Natural language interface for fridge

Problem: Develop a Natural Language interface for Fridge. The interface should be capable for parsing english sentence, evaluating the data from data model (mini database), and respond to user query appropriately.

Smart Refrigerator

Approach: The project is divided into three sub modules namely Parsing, ModelChecker, and Response.

Parsing module applies First Order Logic on tokenized input string to create the formula for given sentence. It does this by applying lexicons and rules of english grammer. The module uses augmented version of SR Parser (Shift Reduce Parser) to parse the sentence.

ModelChecker evaluate the output of Parser using model data (Prolog database for fridge). It identifies if the sentence was declarative, interrogative or content question.

Response module prints the result of ModelChecker based on response type.

Name		Name	Last commit message	Last commit date
Latest commit History 127 Commits
bigram-sentence-evaluator		bigram-sentence-evaluator
roman-decimal-convertor		roman-decimal-convertor
sentence-tagging		sentence-tagging
smart-refrigerator		smart-refrigerator
word-similarity		word-similarity
.gitattributes		.gitattributes
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CSE-567-Computational_Linguistic

1) Project - Bigram Model Sentence Evaluator

2) Program - Roman Decimal Convertor

3) Project - Sentence Tagger

4) Project - Word Similarity

5) Project - Smart Refrigerator - A Natural language interface for fridge

About

Uh oh!

Releases

Packages

Languages

deep-mishra/CSE-567-Computational_Linguistic

Folders and files

Latest commit

History

Repository files navigation

CSE-567-Computational_Linguistic

1) Project - Bigram Model Sentence Evaluator

2) Program - Roman Decimal Convertor

3) Project - Sentence Tagger

4) Project - Word Similarity

5) Project - Smart Refrigerator - A Natural language interface for fridge

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages