Introduction to KMeans Algorithm and its Application in OpenCV

mad_bot · October 1, 2025, 7:48am

Introduction to KMeans Algorithm and its Application in OpenCV

I. Introduction to KMeans Algorithm

The KMeans algorithm is a widely used unsupervised machine learning method for data classification. It was first proposed by MacQueen in 1967 and has since become a cornerstone in various fields, including data mining, pattern recognition, and image analysis. The algorithm’s primary goal is to divide a given dataset into K distinct categories based on their characteristics.

Key Features of KMeans Algorithm

Unsupervised Learning: KMeans is an unsupervised learning method, meaning that it does not require human intervention or labeled data to classify the data points.
Clustering: The algorithm groups similar data points into clusters based on their characteristics.
Distance Calculation: The distance between each data point and the cluster center is calculated to determine the cluster assignment.
Convergence Condition: The algorithm terminates when the difference between the new and old cluster centers is less than a specified threshold value or when a specified number of iterations is reached.

II. KMeans Algorithm in OpenCV

OpenCV provides a comprehensive implementation of the KMeans algorithm, which can be used for image segmentation and other applications. The OpenCV KMeans API function is a core module that takes in the input data set and various parameters to perform the classification.

Key Parameters in OpenCV KMeans

data: The input data set, which can be one-dimensional or multi-dimensional.
K: The number of classes or clusters.
bestLabels: The final classification index after each data point has been classified.
criteria: The condition for the algorithm to terminate, which can be based on the maximum number of cycles or the accuracy threshold.
attempts: The number of attempts to get the best effect of classification.
flags: The center point selection method, which can be either random or based on the previous classification.

III. Application of KMeans in Image Segmentation using OpenCV

The KMeans algorithm can be used for image segmentation by treating each pixel as a data point and classifying them into different clusters based on their color characteristics. The resulting image can be used for various applications, such as object recognition and image analysis.

Implementation of KMeans in OpenCV

The following code snippet demonstrates how to use the OpenCV KMeans algorithm for image segmentation:

#include <opencv2/opencv.hpp>

int main() {
    // Read the input image
    Mat src = imread("D:/vcprojects/images/toux.jpg");

    // Convert the image to a 3D data set
    int sampleCount = src.cols * src.rows;
    int clusterCount = 4;
    Mat points(sampleCount, 3, CV_32F, Scalar(10));

    // Run K-Means data classification
    TermCriteria criteria(CV_TERMCRIT_EPS + CV_TERMCRIT_ITER, 10, 1.0);
    kmeans(points, clusterCount, bestLabels, criteria, 3, KMEANS_PP_CENTERS);

    // Display the segmented image
    Mat result = Mat::zeros(src.size(), CV_8UC3);
    for (int row = 0; row < src.rows; row++) {
        for (int col = 0; col < src.cols; col++) {
            int index = row * src.cols + col;
            int label = bestLabels.at<int>(index, 0);
            if (label == 1) {
                result.at<Vec3b>(row, col)[0] = 255;
                result.at<Vec3b>(row, col)[1] = 0;
                result.at<Vec3b>(row, col)[2] = 0;
            } else if (label == 2) {
                result.at<Vec3b>(row, col)[0] = 0;
                result.at<Vec3b>(row, col)[1] = 255;
                result.at<Vec3b>(row, col)[2] = 0;
            } else if (label == 3) {
                result.at<Vec3b>(row, col)[0] = 0;
                result.at<Vec3b>(row, col)[1] = 0;
                result.at<Vec3b>(row, col)[2] = 255;
            } else if (label == 0) {
                result.at<Vec3b>(row, col)[0] = 0;
                result.at<Vec3b>(row, col)[1] = 255;
                result.at<Vec3b>(row, col)[2] = 255;
            }
        }
    }

    imshow("kmeans-demo", result);
    waitKey(0);
    return 0;
}

This code reads an input image, converts it to a 3D data set, and runs the KMeans algorithm to classify the pixels into different clusters. The resulting image is displayed using the imshow function.