This repository involves writing two main programs:
- A program to parse the corpus and index it with Elasticsearch.
- A query processor, which runs queries from an input file using a selected retrieval model
Retrieval Models Implemented as a part of Task 2 :
- Okapi TF
- TF-IDF
- Okapi BM25
- Unigram LM with Laplace smoothing
- Unigram LM with Jelinek-Mercer smoothing