ML

🧠 Amazon product Review using Clustering Technique(unsupervised learning) and Preprocessed by NLTK

Objective

This project aimed to uncover hidden patterns in Amazon product reviews using unsupervised machine learning. By converting review texts into numerical vectors using TF-IDF, and applying KMeans clustering, the project grouped similar reviews and extracted top keywords per cluster. This enables understanding of major themes or topics customers frequently talk about.


Tasks

  1. Preprocessed text using NLTK: tokenization, stopword removal, etc.
  2. Converted review text into TF-IDF vectors.
  3. Applied KMeans Clustering to group similar reviews.
  4. Extracted and displayed top keywords for each cluster.
  5. Visualized cluster insights to interpret major review themes.

Skills Learned

  1. Text pre-processing (tokenization, cleaning)
  2. Vectorizing text with TF-IDF
  3. Applying KMeans for unsupervised learning
  4. Evaluating clusters using inertia and the Elbow method
  5. Extracting meaningful keywords from clusters
  6. Interpreting clustering output for real-world product insights

Tools Used

📈 Output

image

image

image