fbpx
How to Choose the Right Estimator for Your Machine Learning Task How to Choose the Right Estimator for Your Machine Learning Task
Model training is an important component of a data science model development pipeline. Choosing the right machine learning algorithm can be... How to Choose the Right Estimator for Your Machine Learning Task

https://odsc.com/boston

Getting Started:

Machine learning tasks can be broadly classified into 3 broad categories: supervised, unsupervised, and reinforcement learning, depending on the presence of target class or business constraints.

(Source), Scikit-learn cheat-sheet to choose the right estimator

1. Classification:

The training data should at least have a few hundred of instances to train a robust model. For the classification tasks, the target class should be discrete.

machine learning algorithm

(Source), Scikit-learn cheat-sheet to choose the right estimator for classification tasks

2. Regression:

Scikit-learn package offers the implementation of various regression estimators for a dataset with continuous target class labels.

(Source), Scikit-learn cheat-sheet to choose the right estimator for regression tasks

3. Clustering:

For unsupervised tasks, the scikit-learn package offers various clustering packages.

machine learning algorithm

(Source), Scikit-learn cheat-sheet to choose the right estimator for clustering tasks

4. Dimension Reduction:

Dimension reduction techniques are used to reduce the dimensionality of the data, by projecting the existing data into a new space or dimension. Scikit-learn package offers various dimension reduction techniques discussed below.

machine learning algorithm

(Source), Scikit-learn cheat-sheet to choose the right estimator for dimension reduction tasks

Conclusion:

In this article, we have discussed a cheat sheet or set of rules that can be followed to choose the right estimator for your dataset. The above-discussed rules or machine learning algorithms are restricted to scikit-learn packages.

References:

[1] Scikit-learn documentation: https://scikit-learn.org/stable/tutorial/machine_learning_map/index.html

Article by Satyam Kumar

ODSC Community

The Open Data Science community is passionate and diverse, and we always welcome contributions from data science professionals! All of the articles under this profile are from our community, with individual authors mentioned in the text itself.

1