Get Started with Statistics and Machine Learning Toolbox
Statistics and Machine Learning Toolbox™ provides functions and apps to describe, analyze, and model data. You can use descriptive statistics, visualizations, and clustering for exploratory data analysis, fit probability distributions to data, generate random numbers for Monte Carlo simulations, and perform hypothesis tests. Regression and classification algorithms let you draw inferences from data and build predictive models either interactively, using the Classification and Regression Learner apps, or programmatically, using AutoML.
For multidimensional data analysis and feature extraction, the toolbox provides principal component analysis (PCA), regularization, dimensionality reduction, and feature selection methods that let you identify variables with the best predictive power.
The toolbox provides supervised, semi-supervised and unsupervised machine learning algorithms, including support vector machines (SVMs), boosted decision trees, k-means, and other clustering methods. You can apply interpretability techniques such as partial dependence plots and LIME, and automatically generate C/C++ code for embedded deployment. Many toolbox algorithms can be used on data sets that are too big to be stored in memory.
Tutorials
- Exploratory Analysis of Data
Explore the distribution of data using visualizations and descriptive statistics.
- Hypothesis Testing with Two Samples
Use hypothesis testing to analyze gas prices measured across the state of Massachusetts during two separate months.
- Evaluate Optimal Number of Clusters
Identify the optimal number of clusters in a data set by using the
evalclusters
function. - Assess Regression Neural Network Performance
Use
fitrnet
to create a feedforward regression neural network model with fully connected layers, and assess the performance of the model on test data. - Train Decision Trees Using Classification Learner App
Create and compare classification trees, and export trained models to make predictions for new data.
About Machine Learning
- Machine Learning in MATLAB
Discover machine learning capabilities in MATLAB® for classification, regression, clustering, and deep learning, including apps for automated model training and code generation.
Interactive Learning
Statistics Onramp
Free one-hour online statistics course
Machine Learning Onramp
Free two-hour online machine learning course