Saved searches

Use saved searches to filter your results more quickly

Cancel Create saved search Sign up Reseting focus

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Notifications You must be signed in to change notification settings

Kal-Lemma/Clothes-Recommendation-System

This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.

Go to file

Folders and files

Last commit message Last commit date

Latest commit

History

View all files

Repository files navigation

Clothes Recommendation System

DataSet

Exploratory Data Analysis

Screen Shot 2019-09-05 at 1.02.00 PM

  1. Dealing with highly sparse data originally, but primarily looking at at the main three factors for the recommendation system; usernames, item ids, and ratings.
  2. Removed 68 NaNs from our ratings column.
  3. Number of customers: 32,399 Number of products: 1,376 Number of transactions: 82,790

Recommendation System Plan

  1. The best choice was a Model-Based Collaborative Filtering method using Surprise.
  2. Memory-Based; Can Use multiple different similarity metrics to find out out which performs best: Pearson, Cosine, Jaccard.
  3. Model-Based; Using Singular Value Decomposition (SVD) to decrease the dimensions of our utility matrix and extract latent factors. SVD essentially turning our Recommendation problem into an Optimization one.
  4. Root Mean Square Error (RMSE) is our metric for performance.
  5. Using Model-Based (Matrix factorization) rather than Memory-Based collaborative filtering to make faster predictions with less data than the original.

Results

Screen Shot 2019-09-05 at 1.02.20 PM

Screen Shot 2019-09-05 at 1.02.34 PM

Future Steps