Machine Learning
-
Assumptions of K-Means Clustering – Part 1
K-Means Clustering is a popular unsupervised machine learning algorithm that groups data into a predefined number of clusters based on similarity. However, like every algorithm, K-Means makes certain assumptions about the data and the structure of the problem. Understanding these assumptions is crucial to applying…
-
Beyond K-Means: Other Notions of Distance
K-Means clustering is a great starting point for understanding unsupervised machine learning. However, it comes with important limitations, especially when data is complex or doesn’t conform to its assumptions. In this article, we explore what lies beyond K-Means — alternative methods and concepts that better…
-
Making Sense of Unstructured Data: How to Evaluate Clustering?
Clustering is one of the most popular techniques for exploring and organizing unstructured data—like text, images, or customer behavior. But once you apply a clustering algorithm, how do you know if it actually worked well? In this post, we’ll break down the basics of clustering…
-
Making Sense of Unstructured Data: Graph Theory
Unstructured data is often described as any data that does not have a predefined structure or is not easily organized into rows and columns like structured data. Examples include text, images, audio, and social media posts. This kind of data is abundant in today’s digital…
-
Making Sense of Unstructured Data: Kullback-Leibler (KL) Divergence
In today’s data-driven world, most of the information we encounter is unstructured. From social media posts and emails to images and audio recordings, unstructured data surrounds us. Understanding and making sense of this data is crucial for businesses and researchers alike. One fundamental concept that…
-
Making Sense of Unstructured Data – Covariance
In the world of data science, much of the information we encounter is messy, unpredictable, and without a clear structure. This is known as unstructured data, and it includes things like text, images, videos, and audio. Making sense of unstructured data is crucial for extracting…
-
Making Sense Of Unstructure Data – Understanding Unsupervised Learning
In the world of data science and machine learning, algorithms are usually grouped into three main categories: In this post, we will focus on unsupervised learning. What is Unsupervised Learning? Unsupervised learning is a type of machine learning where computers learn from data that has…