Advait Ramesh Iyer

I find unbiased and generalisable patterns, and effectively communicate insights to non-technical audiences.

Unsupervised Learning and Generalization Performance

less than 1 minute read

Unsupervised Learning

Performed k-means clustering of Spotify’s dataset which contains 13 features related to each song: acousticness, danceability, duration_ms, energy, instrumentalness, key, liveness, loudness, mode, speechiness, tempo, time_signature, and valence.

Step 1: Computed first 2 Principle Components (PC’s).

A pipeline was fitted which first standardizes the values w.r.t. Z-score of each metric, and then performs PCA. The factor loadings for each metric for the first 2 PC’s are as follows:

Step 2: Visualized various songs wrt the PC’s.

Step 3: Created pandas dataframes of the top-3 metrics w.r.t. each PC.

For PC 1:

For PC 2:

Step 4: Run K-means for 10 PC’s, and 5 clusters.

Generalization Performance

Twitter Facebook Google+ LinkedIn

You May Also Enjoy

Dynamic newsvendor model for Optimistic and Pessimistic policy-based profit forecasting

Abstract

Primer for Linear Algebra

Character Recognition: Can Machine Learning Identify Human Written Characters?

Despite the rise of personal computers and smartphones, many people and businesses are dependant on hand written notes. For many people, written notes are fa...

Analysis of co-purchased products on Amazon

Large sized graphs are difficult to visualize, as they are computationally very expensive to plot. In such cases, we have to rely on algorithms which help us...