This project explores a dataset of 830 books related to data science from Amazon.
The analysis includes:
- Data cleaning & preprocessing
- Exploratory Data Analysis (EDA)
- Clustering to group similar books
The dataset contains 830 entries and the following features:
titleauthorpriceprice (including used books)pagesavg_reviewsn_reviewsstar5star4star3star2star1dimensionsweightlanguagepublisherISBN-13linkcomplete_link
- Python
- Pandas, NumPy
- Matplotlib, Seaborn
- Scikit-learn
Books were grouped into clusters