Skip to content

Categories

On this page

Machine Learning Zoo

Machine Learning Zoo

Function-based Categorization

Supervised Classification

  • Support Vector Machines (SVM)
  • Discriminant Analysis
  • Naive Bayes
  • Nearest Neighbors
  • Neural Network

Supervised Regression

Linear Models
  • Linear Regression
  • Logistic Regression
  • Ridge Regression
  • Lasso Regression
Tree Models
  • Decision Trees
  • Random Forests
  • Gradient Boosting Regression
  • Hierarchical Regression
  • XGBoost
Ensemble Models
  • Ensembles
Neural Networks Models
  • Neural Networks

Unsupervised Clustering

  • k-Means Clustering
  • DBSCAN
  • HDBSCAN
  • Agglomerative Hierarchical Clustering
  • OPTICS
  • Gaussian Mixture Models
  • Hidden Markov
  • Neural Networks

Unsupervised Association Rules

  • Apriori
  • FP-growth
  • FP-Max
  • Eclat
  • Hypergeometric Network

Unsupervised Dimensionality Reduction

  • PCA
  • t-distributed Stochastic Neighbor Embedding (t-SNE)
  • Uniform Manifold Approximation and Projection (UMAP)
  • ICA
  • Pairwise Controlled Manifold Approximation (PaCMAP)

Semi-supervised Classification

Semi-supervised Regression

Similarity-based Categorization

Dimensionality Reduction Algorithms

Seek and exploit the inherent structure in the data in an unsupervised manner in order to summarize or describe data using less information.

Tasks: dimensionality reduction

  • Principal Component Analysis (PCA)
  • Principal Component Regression (PCR)
  • Partial Least Squares Regression (PLSR)
  • Sammon Mapping
  • Multidimensional Scaling (MDS)
  • Projection Pursuit
  • Linear Discriminant Analysis (LDA)
  • Mixture Discriminant Analysis (MDA)
  • Quadratic Discriminant Analysis (QDA)
  • Flexible Discriminant Analysis (FDA)

Regression Algorithms

Modeling the relationship between variables that is iteratively refined using a measure of error in the predictions made by the model.

Tasks: regression, classification

  • Ordinary Least Squares Regression (OLSR)
  • Linear Regression
  • Logistic Regression
  • Stepwise Regression
  • Multivariate Adaptive Regression Splines (MARS)
  • Locally Estimated Scatterplot Smoothing (LOESS)

Regularization Algorithms

An extension made to another method (typically regression methods) that penalizes models based on their complexity, favoring simpler models that are also better at generalizing.

Tasks: regression

  • Ridge Regression
  • Least Absolute Shrinkage and Selection Operator (LASSO)
  • Elastic Net
  • Least-Angle Regression (LARS)

Decision Tree Algorithms

Decision tree methods construct a model of decisions made based on actual values of attributes in the data. Decisions fork in tree structures until a prediction decision is made for a given record.

Tasks: classification, regression

  • Classification and Regression Tree (CART)
  • Iterative Dichotomiser 3 (ID3)
  • C4.5 and C5.0 (different versions of a powerful approach)
  • Chi-squared Automatic Interaction Detection (CHAID)
  • Decision Stump
  • M5
  • Conditional Decision Trees

Bayesian Algorithms

Bayesian methods are those that explicitly apply Bayes’ Theorem, an approach to calculate conditional probability based on prior knowledge, and the naive assumption that each feature is independent to each other.

Tasks: regression, classification

  • Naive Bayes
  • Gaussian Naive Bayes
  • Multinomial Naive Bayes
  • Averaged One-Dependence Estimators (AODE)
  • Bayesian Belief Network (BBN)
  • Bayesian Network (BN)

Clustering Algorithms

Concerning with using the inherent structures in the data to best organize the data into groups of maximum commonality.

Tasks: clustering

  • k-Means
  • k-Medians
  • Expectation Maximisation (EM)
  • Hierarchical Clustering

Instance Based Algorithms

Decision problem with instances or examples of training data that are deemed important or required to the model by building up a database of example data and compare new data to the database using a similarity measure in order to find the best match and make a prediction.

Tasks: clustering, classification

  • k-Nearest Neighbor (kNN)
  • Learning Vector Quantization (LVQ)
  • Self-Organizing Map (SOM)
  • Locally Weighted Learning (LWL)
  • Support Vector Machines (SVM)

Association Rule Learning Algorithms

Association rule learning methods extract rules that best explain observed relationships between variables in data.

Tasks: association

  • Apriori algorithm
  • Eclat algorithm

Artificial Neural Network Algorithms

Models that are inspired by the structure and/or function of biological neural networks.

Tasks: classification, regression

  • Perceptron
  • Multilayer Perceptrons (MLP)
  • Back-Propagation
  • Stochastic Gradient Descent
  • Hopfield Network
  • Radial Basis Function Network (RBFN)

Deep Learning Algorithms

Modern update to Artificial Neural Networks by building much larger and more complex neural networks.

Tasks: classification, regression

  • Convolutional Neural Network (CNN)
  • Recurrent Neural Networks (RNNs)
  • Long Short-Term Memory Networks (LSTMs)
  • Stacked Auto-Encoders
  • Deep Boltzmann Machine (DBM)
  • Deep Belief Networks (DBN)

Ensemble Algorithms

Models composed of multiple weaker models that are independently trained and whose predictions are combined in some way to make the overall prediction.

Tasks: classification, regression

  • Boosting. To fit a sequence that is made out of weak learners to the weighted versions of the data, more weight is given to the examples that were misclassified in the earlier rounds.
  • Bootstrapped Aggregation (Bagging). Reduce the variance of an estimate is by the average, to average together the multiple estimates.
  • Stacking
  • AdaBoost
  • Weighted Average (Blending)
  • Stacked Generalization (Stacking)
  • Gradient Boosting Machines (GBM)
  • Gradient Boosted Regression Trees (GBRT)
  • Random Forest

Feature selection algorithms

Algorithm accuracy evaluation

Performance measures

Optimization algorithms

References

Images

MS-cheatsheet

selection
ml-cheatsheet

Edit this page
Last updated on 3/7/2023