Excuse me. Some Terminologies: Classification vs Clustering vs Regression

This is a short post to describe some terms used in data mining.

Classification arranges data into classes/categories using a labeled dataset.

Clustering separates an unlabeled dataset into clusters/groups of similar objects.

Regression develops a model to predict continuous numerical values. 

Classification is a supervised learning algorithm, while Clustering is an unsupervised algorithm. Regression is considered supervised learning because the model is trained using both the input features and output labels - which can be numerical values.

Supervised means we rely on labelled training data. Unsupervised means unlabeled training data.

That's all for now from DC-DEN!

Comments