GitHub - Ansh2709/NLP-project: Project predict whether the two questions have similar meaning or not and thus can be very useful for question answering sites like Quora

Used Exploratory Data Analysis(EDA) techniques,used pipeline based on Data Ingestion->Text Preprocessing(used word tokenization,stop words removal,lower casing,tags removal,etc)->Feature Engineering(used BOW,but other WorldtoVec deep learning technique we can use here as well,tf-idf,n-grams,one-hot encoding also can be use here as well)->Data Modelling->Data Evaluation Used different supervised learning algorihms under machine learning used for classification i.e. random forest classifier,decision tree etc Used my self created features,that helps to achieve better accuracy Evaluated it using confusion matrix,accuracy,and f1-score One can use stemming and lemmatization under text preprocessing also can use other ML algo and can use more data for analysis for better outcomes

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
bow-with-basic-features.ipynb		bow-with-basic-features.ipynb
bow-with-preprocessing-and-advanced-features.ipynb		bow-with-preprocessing-and-advanced-features.ipynb
initial_EDA.ipynb		initial_EDA.ipynb
only-bow.ipynb		only-bow.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

Ansh2709/NLP-project

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages