ML-For-Beginners/5-Clustering
Vedant Bahel 8508ca7e5d
Create README.hi.md (#593)
2022-05-12 21:30:18 -04:00
..
1-Visualize (spanish) Quiz app spanish translation (#591) 2022-05-10 12:25:30 -04:00
2-K-Means (spanish) Quiz app spanish translation (#591) 2022-05-10 12:25:30 -04:00
data re-numbering lesson groups 2021-06-04 12:19:11 -04:00
images image paths 2021-06-07 12:09:48 -04:00
translations Create README.hi.md (#593) 2022-05-12 21:30:18 -04:00
README.md improve (#340) 2021-09-11 20:40:44 -04:00

README.md

Clustering models for machine learning

Clustering is a machine learning task where it looks to find objects that resemble one another and group these into groups called clusters. What differs clustering from other approaches in machine learning, is that things happen automatically, in fact, it's fair to say it's the opposite of supervised learning.

Regional topic: clustering models for a Nigerian audience's musical taste 🎧

Nigeria's diverse audience has diverse musical tastes. Using data scraped from Spotify (inspired by this article, let's look at some music popular in Nigeria. This dataset includes data about various songs' 'danceability' score, 'acousticness', loudness, 'speechiness', popularity and energy. It will be interesting to discover patterns in this data!

A turntable

Photo by Marcela Laskoski on Unsplash

In this series of lessons, you will discover new ways to analyze data using clustering techniques. Clustering is particularly useful when your dataset lacks labels. If it does have labels, then classification techniques such as those you learned in previous lessons might be more useful. But in cases where you are looking to group unlabelled data, clustering is a great way to discover patterns.

There are useful low-code tools that can help you learn about working with clustering models. Try Azure ML for this task

Lessons

  1. Introduction to clustering
  2. K-Means clustering

Credits

These lessons were written with 🎶 by Jen Looper with helpful reviews by Rishit Dagli and Muhammad Sakib Khan Inan.

The Nigerian Songs dataset was sourced from Kaggle as scraped from Spotify.

Useful K-Means examples that aided in creating this lesson include this iris exploration, this introductory notebook, and this hypothetical NGO example.