Skip to content

Exploratory data analysis and clustering of Amazon data science books to identify trends in pricing, ratings, and audience focus

Notifications You must be signed in to change notification settings

RuddyKay/Amazon-Book-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

6 Commits
Β 
Β 
Β 
Β 

Repository files navigation

Amazon-Book-Analysis πŸ“š

This project explores a dataset of 830 books related to data science from Amazon.

Overview πŸ“Š

The analysis includes:

  • Data cleaning & preprocessing
  • Exploratory Data Analysis (EDA)
  • Clustering to group similar books

Dataset πŸ“

The dataset contains 830 entries and the following features:

  • title
  • author
  • price
  • price (including used books)
  • pages
  • avg_reviews
  • n_reviews
  • star5
  • star4
  • star3
  • star2
  • star1
  • dimensions
  • weight
  • language
  • publisher
  • ISBN-13
  • link
  • complete_link

Tools & Libraries πŸ› οΈ

  • Python
  • Pandas, NumPy
  • Matplotlib, Seaborn
  • Scikit-learn

Results 🧠

Books were grouped into clusters

About

Exploratory data analysis and clustering of Amazon data science books to identify trends in pricing, ratings, and audience focus

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published