Cluster Analysis

The ‘Cluster Analysis and Unsupervised Machine Learning in Python’ course will teach you data science techniques for pattern recognition, data mining, K-means clustering, hierarchical clustering, and KDE. The course will also teach how to apply Scipy’s Hierarchical Clustering library to data.

The course will help you to write a GMM in Python code and explains algorithmically how Hierarchical Agglomerative Clustering works. The course is usually available for INR 1,299 on Udemy but students can click on the link and get the ‘Cluster Analysis and Unsupervised Machine Learning in Python’ for INR 449.

Who all can opt for this course?

  • Data science and machine learning enthusiasts, both students and professionals
  • Who wish to learn more about cluster analysis and unsupervised machine learning
  • Those who are interested in writing their own clustering code
  • Professionals with an interest in automatically identifying patterns in large data sets

Course Highlights

Key HighlightsDetails
Registration LinkApply Now!
PriceINR 449 (INR 1,299) 72% off
Duration08 Hours
Rating4.6/5
Student Enrollment24,960 students
InstructorLazy Programmer Team https://www.linkedin.com/in/lazyprogrammerteam
Topics CoveredK-Means Clustering, Hierarchical Clustering, Gaussian Mixture Models
Course LevelIntermediate
Total Student Reviews4,700

Learning Outcomes

  • Recognize the standard K-Means algorithm
  • Recognize and list the drawbacks of K-Means clustering
  • The soft or fuzzy K-Means Clustering algorithm should be understood
  • Create code that uses soft k-means clustering
  • Be familiar with hierarchical clustering
  • Describe the algorithmic operation of hierarchical agglomerative clustering
  • Use the Hierarchical Clustering package in Scipy to analyse data
  • Know how to interpret a dendrogram
  • Recognize the various distance metrics that are applied to clustering
  • Know the distinctions between UPGMA, Ward linkage, full linkage, and single linkage
  • Knowing how to estimate density using the Gaussian mixture model
  • Codify a GMM in Python
  • When does GMM replace K-Means Clustering, the algorithm for expectation-maximization, in detail
  • Learn how GMM circumvents several K-Means limitations
  • Recognize the Singular Covariance issue and how to resolve it

Course Content

S.No.Module (Duration)Topics
1.Introduction to Unsupervised Learning (37 minutes)Introduction
Course Outline
What is unsupervised learning used for?
Why Use Clustering?
Where to get the code
How to Succeed in this Course
2.K-Means Clustering (02 hours 29 minutes)An Easy Introduction to K-Means Clustering
Hard K-Means: Exercise Prompt 1
Hard K-Means: Exercise 1 Solution
Hard K-Means: Exercise Prompt 2
Hard K-Means: Exercise 2 Solution
Hard K-Means: Exercise Prompt 3
Hard K-Means: Exercise 3 Solution
Hard K-Means Objective: Theory
Hard K-Means Objective: Code
Soft K-Means
The Soft K-Means Objective Function
Soft K-Means in Python Code
How to Pace Yourself
Visualizing Each Step of K-Means
Examples of where K-Means can fail
Disadvantages of K-Means Clustering
How to Evaluate a Clustering (Purity, Davies-Bouldin Index)
Using K-Means on Real Data: MNIST
One Way to Choose K
K-Means Application: Finding Clusters of Related Words
Clustering for NLP and Computer Vision: Real-World Applications
Suggestion Box
3.Hierarchical Clustering (43 minutes)Visual Walkthrough of Agglomerative Hierarchical Clustering
Agglomerative Clustering Options
Using Hierarchical Clustering in Python and Interpreting the Dendrogram
Application: Evolution
Application: Donald Trump vs. Hillary Clinton Tweets
4.Gaussian Mixture Models (GMMs) (01 hour 38 minutes)Gaussian Mixture Model (GMM) Algorithm
Write a Gaussian Mixture Model in Python Code
Practical Issues with GMM / Singular Covariance
Comparison between GMM and K-Means
Kernel Density Estimation
GMM vs Bayes Classifier (pt 1)
GMM vs Bayes Classifier (pt 2)
Expectation-Maximization (pt 1)
Expectation-Maximization (pt 2)
Expectation-Maximization (pt 3)
Future Unsupervised Learning Algorithms You Will Learn
5.Setting Up Your Environment (FAQ by Student Request) (37 minutes)Anaconda Environment Setup
How to install Numpy, Scipy, Matplotlib, Pandas, IPython, Theano, and TensorFlow
6.Extra Help With Python Coding for Beginners (FAQ by Student Request) (42 minutes)How to Code by Yourself (part 1)
How to Code by Yourself (part 2)
Proof that using Jupyter Notebook is the same as not using it
Python 2 vs Python 3
7.Effective Learning Strategies for Machine Learning (FAQ by Student Request) (59 minutes)How to Succeed in this Course (Long Version)
Is this for Beginners or Experts? Academic or Practical? Fast or slow-paced?
Machine Learning and AI Prerequisite Roadmap (pt 1)
Machine Learning and AI Prerequisite Roadmap (pt 2)
8.Appendix / FAQ Finale (08 minutes)What is the Appendix?
BONUS

Resources Required

  • Python and Numpy coding skills are required
  • Install Numpy and Scipy
  • Probability, matrix maths

Featured Review

Arnav Arya (5/5) : Well structured and pragmatic course for best practices in clustering analysis

Pros

  • Mathieu Fortier (5/5) : Focus is placed on pragmatism, and the coding examples are excellent in providing a deeper understanding of the concepts in an applied setting.
  • Quang Hoang (5/5) : I will try my best to advance my understanding by using what I’ve learned.
  • Haresh Patel (5/5) : This was an excellent introductory course to clustering data using python.
  • Thomas Chiang (5/5) : A very good course for students with a programming mindset, who want to understand the algos and program them from scratch.

Cons

  • Octavio Vite S. (2.5/5) : the part of how the show you the theory and maybe how to write the code is OK, but you not have almost nothing to practice, he does not take data from the real word to show you how to do it.
  • Carlos C. (2.5/5) : The course is quite fast
  • Vishal B. (2.5/5) : Other resources online have looked much better . Not much explanation behind each step
  • Sergey M. (2.5/5) : Other courses present the same concepts in clearer way

About the Author

The instructor of this course is Lazy Programmer Team who is a Artificial Intelligence and Machine Learning Engineer. With 4.7 Instructor Rating and 51,803 Reviews on Udemy, he/she offers 17 Courses and has taught 188,652 Students so far.

  • Although Instructor have also been recognised as a data scientist, big data engineer, and full stack software engineer, Instructor currently spend the majority of his/her time as an artificial intelligence and machine learning engineer with an emphasis on deep learning
  • Instructor earned the first master’s degree in computer engineering with a focus on machine learning and pattern recognition more than ten years ago
  • Instructor’s second master’s degree in statistics with a focus on financial engineering was awarded to the Instructor
  • Data scientist and big data engineer with experience in online advertising and digital media (optimising click and conversion rates) (building data processing pipelines)
  • Instructor routinely use big data technologies like Hadoop, Pig, Hive, MapReduce, and Spark
  • Instructor developed deep learning models for text modelling, image and signal processing, user behaviour prediction, and click-through rate estimation
  • In Instructor’s work with recommendation systems, I’ve used collaborative filtering and reinforcement learning, and we validated the findings using A/B testing
  • Instructor have instructed students at universities like Columbia University, NYU, Hunter College, and The New School in data science, statistics, machine learning, algorithms, calculus, computer graphics, and physics
  • Instructor web programming skills have helped numerous businesses
  • Instructor handle all of the server-side backend work, frontend HTML/JS/CSS work, and operations/deployment work

Comparison Table

ParametersCluster Analysis and Unsupervised Machine Learning in PythonData Science: Supervised Machine Learning in PythonUnsupervised Deep Learning in Python
OffersINR 449 (INR 1,299) 72% offINR 455 (INR 3,499) 87% offINR 455 (INR 3,499) 87% off
Duration8 hours6.5 hours10 hours
Rating4.7 /54.6 /54.8 /5
Student Enrollments24,95120,36520,183
InstructorsLazy Programmer TeamLazy Programmer TeamLazy Programmer Team
Register HereApply Now!Apply Now!Apply Now!

Leave feedback about this

  • Rating