fitrqnet

Train regression quantile neural network

Since R2024b

collapse all in page

Syntax

Mdl = fitrqnet(Tbl,ResponseVarName)

Mdl = fitrqnet(Tbl,formula)

Mdl = fitrqnet(Tbl,Y)

Mdl = fitrqnet(X,Y)

Mdl = fitrqnet(___,Name=Value)

[Mdl,AggregateOptimizationResults] = fitrqnet(___)

Description

Mdl = fitrqnet(Tbl,ResponseVarName) returns a trained regression quantile neural network model Mdl. The function trains the model using the predictors in the table Tbl and the response values in the ResponseVarName table variable.

By default, the function uses the median (0.5 quantile).

Mdl = fitrqnet(Tbl,formula) returns a quantile neural network model trained using the sample data in the table Tbl. The input argument formula is an explanatory model of the response and a subset of the predictor variables in Tbl used to fit Mdl.

Mdl = fitrqnet(Tbl,Y) returns a quantile neural network model trained using the predictor variables in the table Tbl and the response values in the vector Y.

Mdl = fitrqnet(X,Y) returns a quantile neural network model trained using the predictors in the matrix X and the response values in the vector Y.

Mdl = fitrqnet(___,Name=Value) specifies options using one or more name-value arguments in addition to any of the input argument combinations in previous syntaxes. For example, you can specify the quantiles by using the Quantiles name-value argument.

example

[Mdl,AggregateOptimizationResults] = fitrqnet(___) also returns AggregateOptimizationResults, which contains hyperparameter optimization results when you specify the OptimizeHyperparameters and HyperparameterOptimizationOptions name-value arguments. You must also specify the ConstraintType and ConstraintBounds options of HyperparameterOptimizationOptions. You can use this syntax to optimize on the compact model size instead of the cross-validation loss, and to solve a set of multiple optimization problems that have the same options but different constraint bounds. (since R2025a)

Examples

collapse all

Fit Quantile Neural Network Regression Model

Open Live Script

Fit a quantile neural network regression model using the 0.25, 0.50, and 0.75 quantiles.

Load the carbig data set, which contains measurements of cars made in the 1970s and early 1980s. Create a matrix X containing the predictor variables Acceleration, Displacement, Horsepower, and Weight. Store the response variable MPG in the variable Y.

load carbig
X = [Acceleration,Displacement,Horsepower,Weight];
Y = MPG;

Delete rows of X and Y where either array has missing values.

R = rmmissing([X Y]);
X = R(:,1:end-1);
Y = R(:,end);

Partition the data into training data (XTrain and YTrain) and test data (XTest and YTest). Reserve approximately 20% of the observations for testing, and use the rest of the observations for training.

rng(0,"twister") % For reproducibility of the partition
c = cvpartition(length(Y),"Holdout",0.20);
 
trainingIdx = training(c);
XTrain = X(trainingIdx,:);
YTrain = Y(trainingIdx);
 
testIdx = test(c);
XTest = X(testIdx,:);
YTest = Y(testIdx);

Train a quantile neural network regression model. Specify to use the 0.25, 0.50, and 0.75 quantiles (that is, the lower quartile, median, and upper quartile). To improve the model fit, standardize the numeric predictors. Use a ridge (L2) regularization term of 0.05. Adding a regularization term can help prevent quantile crossing.

Mdl = fitrqnet(XTrain,YTrain,Quantiles=[0.25,0.50,0.75], ...
    Standardize=true,Lambda=0.05)

Mdl = 
  RegressionQuantileNeuralNetwork
             ResponseName: 'Y'
    CategoricalPredictors: []
               LayerSizes: 10
              Activations: 'relu'
    OutputLayerActivation: 'none'
                Quantiles: [0.2500 0.5000 0.7500]


  Properties, Methods

Mdl is a RegressionQuantileNeuralNetwork model object. You can use dot notation to access the properties of Mdl. For example, Mdl.LayerWeights and Mdl.LayerBiases contain the weights and biases, respectively, for the fully connected layers of the trained model.

In this example, you can use the layer weights, layer biases, predictor means, and predictor standard deviations directly to predict the test set responses for each of the three quantiles in Mdl.Quantiles. In general, you can use the predict object function to make quantile predictions.

firstFCStep = (Mdl.LayerWeights{1})*((XTest-Mdl.Mu)./Mdl.Sigma)' ...
    + Mdl.LayerBiases{1};
reluStep = max(firstFCStep,0);
finalFCStep = (Mdl.LayerWeights{end})*reluStep + Mdl.LayerBiases{end};
predictedY = finalFCStep'

predictedY = 78×3

   13.9602   15.1340   16.6884
   11.2792   12.2332   13.4849
   19.5525   21.7303   23.9473
   22.6950   25.5260   28.1201
   10.4533   11.3377   12.4984
   17.6935   19.5194   21.5152
   12.4312   13.4797   14.8614
   11.7998   12.7963   14.1071
   16.6860   18.3305   20.2070
   24.1142   27.0301   29.7811
   22.2832   25.1327   27.6841
   12.8749   13.9594   15.3917
   12.2328   13.2643   14.6245
   24.0164   26.9150   29.6545
   13.4641   14.5970   16.0957
      ⋮

isequal(predictedY,predict(Mdl,XTest))

ans = logical
   1

Each column of predictedY corresponds to a separate quantile (0.25, 0.5, or 0.75).

Visualize the predictions of the quantile neural network regression model. First, create a grid of predictor values.

minX = floor(min(X))

minX = 1×4

           8          68          46        1613

maxX = ceil(max(X))

maxX = 1×4

          25         455         230        5140

gridX = zeros(100,size(X,2));
for p = 1:size(X,2)
    gridp = linspace(minX(p),maxX(p))';
    gridX(:,p) = gridp;
end

Next, use the trained model Mdl to predict the response values for the grid of predictor values.

gridY = predict(Mdl,gridX)

gridY = 100×3

   31.2419   35.0661   38.6357
   30.8637   34.6317   38.1573
   30.4854   34.1972   37.6789
   30.1072   33.7627   37.2005
   29.7290   33.3283   36.7221
   29.3507   32.8938   36.2436
   28.9725   32.4593   35.7652
   28.5943   32.0249   35.2868
   28.2160   31.5904   34.8084
   27.8378   31.1560   34.3300
   27.4596   30.7215   33.8516
   27.0814   30.2870   33.3732
   26.7031   29.8526   32.8948
   26.3249   29.4181   32.4164
   25.9467   28.9837   31.9380
      ⋮

For each observation in gridX, the predict object function returns predictions for the quantiles in Mdl.Quantiles.

View the gridY predictions for the second predictor (Displacement). Compare the quantile predictions to the true test data values.

predictorIdx = 2;
plot(XTest(:,predictorIdx),YTest,".")
hold on
plot(gridX(:,predictorIdx),gridY(:,1))
plot(gridX(:,predictorIdx),gridY(:,2))
plot(gridX(:,predictorIdx),gridY(:,3))
hold off
xlabel("Predictor (Displacement)")
ylabel("Response (MPG)")
legend(["True values","0.25 predicted values", ...
    "0.50 predicted values","0.75 predicted values"])
title("Test Data")

Figure contains an axes object. The axes object with title Test Data, xlabel Predictor (Displacement), ylabel Response (MPG) contains 4 objects of type line. One or more of the lines displays its values using only markers These objects represent True values, 0.25 predicted values, 0.50 predicted values, 0.75 predicted values.

The red curve shows the predictions for the 0.25 quantile, the yellow curve shows the predictions for the 0.50 quantile, and the purple curve shows the predictions for the 0.75 quantile. The blue points indicate the true test data values.

Notice that the quantile prediction curves do not cross each other.

Prevent Quantile Crossing Using Regularization

Open Live Script

When training a quantile neural network regression model, you can use a ridge (L2) regularization term to prevent quantile crossing.

Load the carbig data set, which contains measurements of cars made in the 1970s and early 1980s. Create a table containing the predictor variables Acceleration, Cylinders, Displacement, and so on, as well as the response variable MPG.

load carbig
cars = table(Acceleration,Cylinders,Displacement, ...
    Horsepower,Model_Year,Origin,Weight,MPG);

Remove rows of cars where the table has missing values.

cars = rmmissing(cars);

Categorize the cars based on whether they were made in the USA.

cars.Origin = categorical(cellstr(cars.Origin));
cars.Origin = mergecats(cars.Origin,["France","Japan",...
    "Germany","Sweden","Italy","England"],"NotUSA");

Partition the data into training and test sets using cvpartition. Use approximately 80% of the observations as training data, and 20% of the observations as test data.

rng(0,"twister") % For reproducibility of the data partition
c = cvpartition(height(cars),"Holdout",0.20);

trainingIdx = training(c);
carsTrain = cars(trainingIdx,:);

testIdx = test(c);
carsTest = cars(testIdx,:);

Train a quantile neural network regression model. Use the 0.25, 0.50, and 0.75 quantiles (that is, the lower quartile, median, and upper quartile). To improve the model fit, standardize the numeric predictors before training.

Mdl = fitrqnet(carsTrain,"MPG",Quantiles=[0.25 0.5 0.75], ...
    Standardize=true);

Mdl is a RegressionNeuralNetwork model object.

Determine if the test data predictions for the quantiles in Mdl.Quantiles cross each other by using the predict object function of Mdl. The crossingIndicator output argument contains a value of 1 (true) for any observation with quantile predictions that cross.

[~,crossingIndicator] = predict(Mdl,carsTest);
sum(crossingIndicator)

ans = 
2

In this example, two of the observations in carsTest have quantile predictions that cross each other.

To prevent quantile crossing, specify the Lambda name-value argument in the call to fitrqnet. Use a 0.05 ridge (L2) penalty term.

newMdl = fitrqnet(carsTrain,"MPG",Quantiles=[0.25 0.5 0.75], ...
    Standardize=true,Lambda=0.05);
[predictedY,newCrossingIndicator] = predict(newMdl,carsTest);
sum(newCrossingIndicator)

ans = 
0

With regularization, the predictions for the test data set do not cross for any observations.

Visualize the predictions returned by newMdl by using a scatter plot with a reference line. Plot the predicted values along the vertical axis and the true response values along the horizontal axis. Points on the reference line indicate correct predictions.

plot(carsTest.MPG,predictedY(:,1),".")
hold on
plot(carsTest.MPG,predictedY(:,2),".")
plot(carsTest.MPG,predictedY(:,3),".")
plot(carsTest.MPG,carsTest.MPG)
hold off
xlabel("True MPG")
ylabel("Predicted MPG")
legend(["0.25 quantile values","0.50 quantile values", ...
    "0.75 quantile values","Reference line"], ...
    Location="southeast")
title("Test Data")

Figure contains an axes object. The axes object with title Test Data, xlabel True MPG, ylabel Predicted MPG contains 4 objects of type line. One or more of the lines displays its values using only markers These objects represent 0.25 quantile values, 0.50 quantile values, 0.75 quantile values, Reference line.

Blue points correspond to the 0.25 quantile, red points correspond to the 0.50 quantile, and yellow points correspond to the 0.75 quantile.

For a more in-depth example, see Regularize Quantile Regression Model to Prevent Quantile Crossing.

Optimize Hyperparameters of Quantile Regression Model

Open Live Script

Optimize the hyperparameters of a quantile neural network regression model. Compare the test set losses of the model before and after hyperparameter optimization.

load carbig
cars = table(Acceleration,Cylinders,Displacement, ...
    Horsepower,Model_Year,Origin,Weight,MPG);
head(cars)

    Acceleration    Cylinders    Displacement    Horsepower    Model_Year    Origin     Weight    MPG
    ____________    _________    ____________    __________    __________    _______    ______    ___

          12            8            307            130            70        USA         3504     18 
        11.5            8            350            165            70        USA         3693     15 
          11            8            318            150            70        USA         3436     18 
          12            8            304            150            70        USA         3433     16 
        10.5            8            302            140            70        USA         3449     17 
          10            8            429            198            70        USA         4341     15 
           9            8            454            220            70        USA         4354     14 
         8.5            8            440            215            70        USA         4312     14

Remove rows of cars where the table has missing values.

cars = rmmissing(cars);

Categorize the cars based on whether they were made in the USA.

cars.Origin = categorical(cellstr(cars.Origin));
cars.Origin = mergecats(cars.Origin,["France","Japan",...
    "Germany","Sweden","Italy","England"],"NotUSA");

Partition the data into training and test sets using cvpartition. Use approximately 80% of the observations as training data, and 20% of the observations as test data.

rng(0,"twister") % For reproducibility of the data partition
c = cvpartition(height(cars),Holdout=0.20);

trainingIdx = training(c);
carsTrain = cars(trainingIdx,:);

testIdx = test(c);
carsTest = cars(testIdx,:);

Train a quantile neural network regression model using the carsTrain training data. Specify MPG as the response variable, and use the 0.05 and 0.95 quantiles. Then, compute the quantile losses using the carsTest test data.

Mdl = fitrqnet(carsTrain,"MPG",Quantiles=[0.05 0.95]);
L = loss(Mdl,carsTest)

L = 1×2

    0.3445    0.4586

L(1) is the quantile loss for the 0.05 quantile, and L(2) is the quantile loss for the 0.95 quantile.

Optimize the hyperparameters of the quantile regression model using Bayesian optimization. Set OptimizeHyperparameters to "auto", which is equivalent to ["Activations","Lambda","LayerSizes","Standardize"]. By default, fitrqnet searches for the hyperparameter values that optimize the 5-fold cross-validation quantile loss, averaged across the quantiles.

optMdl = fitrqnet(carsTrain,"MPG",Quantiles=[0.05 0.95], ...
    OptimizeHyperparameters="auto");

|============================================================================================================================================|
| Iter | Eval   | Objective:  | Objective   | BestSoFar   | BestSoFar   |  Activations |  Standardize |       Lambda |            LayerSizes |
|      | result | log(1+loss) | runtime     | (observed)  | (estim.)    |              |              |              |                       |
|============================================================================================================================================|
|    1 | Best   |      0.4089 |       21.61 |      0.4089 |      0.4089 |         tanh |         true |    0.0037773 | [233  34   3]         |
|    2 | Accept |      2.5474 |     0.31115 |      0.4089 |     0.55147 |         relu |         true |       297.54 |  5                    |
|    3 | Accept |      2.2706 |     0.12982 |      0.4089 |     0.48221 |         tanh |        false |      0.14648 |  1                    |
|    4 | Accept |     0.53488 |      0.4803 |      0.4089 |     0.41003 |         tanh |        false |   1.3334e-05 |  250                  |
|    5 | Accept |      2.1998 |     0.95553 |      0.4089 |     0.40903 |         tanh |        false |       9.2858 |  153                  |
|    6 | Accept |     0.53488 |    0.094073 |      0.4089 |     0.40903 |         tanh |        false |   1.5158e-07 |  1                    |
|    7 | Accept |     0.53488 |    0.094714 |      0.4089 |     0.40902 |         tanh |        false |   1.7771e-06 |  1                    |
|    8 | Best   |     0.28982 |      1.4108 |     0.28982 |     0.28993 |         relu |         true |   3.2836e-08 |  4                    |
|    9 | Accept |     0.34727 |      7.0663 |     0.28982 |     0.28987 |         relu |         true |   1.3732e-05 |  122                  |
|   10 | Accept |     0.53488 |     0.13855 |     0.28982 |     0.28986 |         tanh |        false |   5.2712e-08 | [  1   1]             |
|   11 | Accept |     0.53488 |     0.19481 |     0.28982 |     0.28986 |         tanh |        false |   3.1968e-08 | [  1   2]             |
|   12 | Accept |     0.53488 |     0.20717 |     0.28982 |     0.28985 |         tanh |        false |   3.4236e-07 | [  1   1   2]         |
|   13 | Accept |     0.49989 |     0.19906 |     0.28982 |     0.28984 |         tanh |         true |   2.5481e-07 |  1                    |
|   14 | Accept |      2.5473 |    0.049643 |     0.28982 |     0.28986 |         tanh |         true |       152.93 |  1                    |
|   15 | Accept |     0.36331 |     0.45542 |     0.28982 |     0.28985 |         tanh |         true |   8.2018e-05 |  1                    |
|   16 | Accept |     0.50296 |     0.14815 |     0.28982 |     0.28985 |         relu |        false |   1.8652e-07 |  1                    |
|   17 | Accept |     0.48884 |     0.25371 |     0.28982 |     0.28985 |         relu |        false |   0.00039991 |  1                    |
|   18 | Accept |     0.81581 |     0.12767 |     0.28982 |     0.28985 |         relu |        false |       8.7563 |  1                    |
|   19 | Best   |     0.27665 |     0.39017 |     0.27665 |     0.27672 |      sigmoid |         true |   3.8589e-08 |  1                    |
|   20 | Accept |     0.28825 |     0.31519 |     0.27665 |     0.27669 |      sigmoid |         true |   7.5396e-05 |  1                    |
|============================================================================================================================================|
| Iter | Eval   | Objective:  | Objective   | BestSoFar   | BestSoFar   |  Activations |  Standardize |       Lambda |            LayerSizes |
|      | result | log(1+loss) | runtime     | (observed)  | (estim.)    |              |              |              |                       |
|============================================================================================================================================|
|   21 | Accept |        2.52 |    0.051048 |     0.27665 |     0.27665 |      sigmoid |         true |      0.82956 |  1                    |
|   22 | Accept |     0.32961 |      7.1915 |     0.27665 |     0.27685 |      sigmoid |         true |   2.3332e-06 | [  6  10  19]         |
|   23 | Accept |     0.53488 |     0.08158 |     0.27665 |     0.27683 |      sigmoid |        false |   4.8229e-08 |  1                    |
|   24 | Accept |     0.53867 |    0.094137 |     0.27665 |     0.27683 |      sigmoid |        false |   5.6871e-05 |  1                    |
|   25 | Accept |      2.4073 |     0.06866 |     0.27665 |     0.27684 |      sigmoid |        false |      0.27094 |  1                    |
|   26 | Accept |     0.53488 |    0.081567 |     0.27665 |     0.27683 |      sigmoid |        false |   1.9914e-06 |  1                    |
|   27 | Accept |     0.38154 |     0.21309 |     0.27665 |     0.27682 |         none |        false |   4.1041e-08 |  1                    |
|   28 | Accept |     0.49266 |     0.30354 |     0.27665 |     0.27682 |         none |        false |   6.4929e-05 |  1                    |
|   29 | Accept |     0.84101 |     0.47534 |     0.27665 |     0.27682 |         none |        false |       8.9658 | [  2  48  44]         |
|   30 | Accept |     0.31596 |     0.31732 |     0.27665 |     0.27681 |         none |         true |   4.4715e-08 |  1                    |

__________________________________________________________
Optimization completed.
MaxObjectiveEvaluations of 30 reached.
Total function evaluations: 30
Total elapsed time: 57.465 seconds
Total objective function evaluation time: 43.5104

Best observed feasible point:
    Activations    Standardize      Lambda      LayerSizes
    ___________    ___________    __________    __________

      sigmoid         true        3.8589e-08        1     

Observed objective function value = 0.27665
Estimated objective function value = 0.27681
Function evaluation time = 0.39017

Best estimated feasible point (according to models):
    Activations    Standardize      Lambda      LayerSizes
    ___________    ___________    __________    __________

      sigmoid         true        3.8589e-08        1     

Estimated objective function value = 0.27681
Estimated function evaluation time = 0.39025

Figure contains an axes object. The axes object with title Min objective vs. Number of function evaluations, xlabel Function evaluations, ylabel Min objective contains 2 objects of type line. These objects represent Min observed objective, Estimated min objective.

optMdl is a QuantileRegressionNeuralNetwork model whose hyperparameter values match those of the best estimated feasible point. You can find the optimization results in the HyperparameterOptimizationResults property of the model.

Compare the model parameters in optMdl to the hyperparameters returned by the bestPoint function, and verify that the values match.

optMdlParams = table(string(optMdl.ModelParameters.Activations), ...
    optMdl.ModelParameters.StandardizeData, ...
    optMdl.ModelParameters.Lambda, ...
    optMdl.ModelParameters.LayerSizes, ...
    VariableNames=["Activations","Standardize","Lambda","LayerSizes"])

optMdlParams=1×4 table
    Activations    Standardize      Lambda      LayerSizes
    ___________    ___________    __________    __________

     "sigmoid"        true        3.8589e-08        1

bestPointParams = bestPoint(optMdl.HyperparameterOptimizationResults)

bestPointParams=1×7 table
    NumLayers    Activations    Standardize      Lambda      Layer_1_Size    Layer_2_Size    Layer_3_Size
    _________    ___________    ___________    __________    ____________    ____________    ____________

        1          sigmoid         true        3.8589e-08         1              NaN             NaN

Compute the quantile losses for the optimized model using the carsTest test data. Compare the results with the Mdl quantile losses computed earlier.

optL = loss(optMdl,carsTest)

optL = 1×2

    0.2694    0.3104

L = 1×2

    0.3445    0.4586

The quantile losses for the optimized model are smaller than the quantile losses for the original model, indicating a better fit.

Input Arguments

collapse all

`Tbl` — Sample data
table

Sample data used to train the model, specified as a table. Each row of Tbl corresponds to one observation, and each column corresponds to one predictor variable. Optionally, Tbl can contain one additional column for the response variable. Multicolumn variables and cell arrays other than cell arrays of character vectors are not allowed.

If Tbl contains the response variable, and you want to use all remaining variables in Tbl as predictors, then specify the response variable by using ResponseVarName.
If Tbl contains the response variable, and you want to use only a subset of the remaining variables in Tbl as predictors, then specify a formula by using formula.
If Tbl does not contain the response variable, then specify a response variable by using Y. The length of the response variable and the number of rows in Tbl must be equal.

`ResponseVarName` — Response variable name
name of variable in `Tbl`

Response variable name, specified as the name of a variable in Tbl. The response variable must be a numeric vector.

You must specify ResponseVarName as a character vector or string scalar. For example, if Tbl stores the response variable Y as Tbl.Y, then specify it as "Y". Otherwise, the software treats all columns of Tbl, including Y, as predictors when training the model.

Data Types: char | string

`formula` — Explanatory model of response variable and subset of predictor variables
character vector | string scalar

Explanatory model of the response variable and a subset of the predictor variables, specified as a character vector or string scalar in the form "Y~x1+x2+x3". In this form, Y represents the response variable, and x1, x2, and x3 represent the predictor variables.

To specify a subset of variables in Tbl as predictors for training the model, use a formula. If you specify a formula, then the software does not use any variables in Tbl that do not appear in formula.

The variable names in the formula must be both variable names in Tbl (Tbl.Properties.VariableNames) and valid MATLAB^® identifiers. You can verify the variable names in Tbl by using the isvarname function. If the variable names are not valid, then you can convert them by using the matlab.lang.makeValidName function.

Data Types: char | string

`Y` — Response data
numeric vector

Response data, specified as a numeric vector. The length of Y must be equal to the number of observations in X or Tbl.

Data Types: single | double

`X` — Predictor data
numeric matrix

Predictor data used to train the model, specified as a numeric matrix.

By default, the software treats each row of X as one observation, and each column as one predictor.

The length of Y and the number of observations in X must be equal.

To specify the names of the predictors in the order of their appearance in X, use the PredictorNames name-value argument.

Note

If you orient your predictor matrix so that observations correspond to columns and specify ObservationsIn="columns", then you might experience a significant reduction in computation time.

Data Types: single | double

Note

The software treats NaN, empty character vector (''), empty string (""), <missing>, and <undefined> elements as missing values, and removes observations with any of these characteristics:

Missing value in the response (for example, Y or ValidationData{2})
At least one missing value in a predictor observation (for example, a row in X or ValidationData{1})
NaN value or 0 weight (for example, a value in Weights or ValidationData{3})

Name-Value Arguments

expand all

Specify optional pairs of arguments as Name1=Value1,...,NameN=ValueN, where Name is the argument name and Value is the corresponding value. Name-value arguments must appear after other arguments, but the order of the pairs does not matter.

Example: fitrqnet(Tbl,"MPG",Quantiles=[0.25 0.5 0.75],Standardize=true) specifies to use the 0.25, 0.5, and 0.75 quantiles and to standardize the data before training.

Neural Network Options

expand all

`Quantiles` — Quantiles
`0.5` (default) | vector of values in the range [0,1]

Quantiles to use for training Mdl, specified as a vector of values in the range [0,1]. The function trains a model that separates the bottom 100*q percent of training responses from the top 100*(1 – q) percent of training responses for each quantile q.

Example: Quantiles=[0.25 0.5 0.75]

Data Types: single | double

`LayerSizes` — Sizes of fully connected layers
`10` (default) | positive integer vector

Sizes of the fully connected layers in the quantile neural network regression model, specified as a positive integer vector. Element i of LayerSizes is the number of outputs in the fully connected layer i of the neural network model.

LayerSizes does not include the size of the final fully connected layer. For more information, see Quantile Neural Network Structure.

Example: LayerSizes=[100 25 10]

Data Types: single | double

`Activations` — Activation functions for fully connected layers
`"relu"` (default) | `"tanh"` | `"sigmoid"` | `"none"` | string array | cell array of character vectors

Activation functions for the fully connected layers of the quantile neural network regression model, specified as a character vector, string scalar, string array, or cell array of character vectors with values from this table.

Value	Description
`"relu"`	Rectified linear unit (ReLU) function — Performs a threshold operation on each element of the input, where any value less than zero is set to zero, that is, $f (x) = {\begin{matrix} x, & x \geq 0 \\ 0, & x < 0 \end{matrix}$
`"tanh"`	Hyperbolic tangent (tanh) function — Applies the `tanh` function to each input element
`"sigmoid"`	Sigmoid function — Performs the following operation on each input element: $f (x) = \frac{1}{1 + e^{- x}}$
`"none"`	Identity function — Returns each input element without performing any transformation, that is, f(x) = x

If you specify one activation function only, then Activations is the activation function for every fully connected layer of the neural network model, excluding the final fully connected layer (see Quantile Neural Network Structure).
If you specify an array of activation functions, then element i of Activations is the activation function for layer i of the neural network model.

Example: Activations="sigmoid"

Data Types: char | string | cell

`LayerWeightsInitializer` — Function to initialize fully connected layer weights
`"glorot"` (default) | `"he"`

Function to initialize the fully connected layer weights, specified as "glorot" or "he".

Value	Description
`"glorot"`	Initialize the weights with the Glorot initializer [1] (also known as the Xavier initializer). For each layer, the Glorot initializer independently samples from a uniform distribution with zero mean and variance `2/(I+O)`, where `I` is the input size and `O` is the output size for the layer.
`"he"`	Initialize the weights with the He initializer [2]. For each layer, the He initializer samples from a normal distribution with zero mean and variance `2/I`, where `I` is the input size for the layer.

Example: LayerWeightsInitializer="he"

Data Types: char | string

`LayerBiasesInitializer` — Type of initial fully connected layer biases
`"zeros"` (default) | `"ones"`

Type of initial fully connected layer biases, specified as "zeros" or "ones".

If you specify the value "zeros", then each fully connected layer has an initial bias of 0.
If you specify the value "ones", then each fully connected layer has an initial bias of 1.

Example: LayerBiasesInitializer="ones"

Data Types: char | string

`ObservationsIn` — Predictor data observation dimension
`"rows"` (default) | `"columns"`

Predictor data observation dimension, specified as "rows" or "columns".

Note

If you orient your predictor matrix so that observations correspond to columns and specify ObservationsIn="columns", then you might experience a significant reduction in computation time. You cannot specify ObservationsIn="columns" for predictor data in a table.

Example: ObservationsIn="columns"

Data Types: char | string

`Lambda` — Regularization term strength
`0` (default) | nonnegative scalar

Regularization term strength, specified as a nonnegative scalar. The software constructs the objective function for minimization from the quantile loss averaged over the quantiles (see Quantile Loss) and the ridge (L2) penalty term.

Example: Lambda=1e-4

Data Types: single | double

`Standardize` — Flag to standardize predictor data
`false` or `0` (default) | `true` or `1`

Flag to standardize the predictor data, specified as a numeric or logical 0 (false) or 1 (true). If you set Standardize to true, then the software centers and scales each numeric predictor variable by the corresponding column mean and standard deviation. The software does not standardize categorical predictors.

Example: Standardize=true

Data Types: single | double | logical

Convergence Control Options

expand all

`Verbose` — Verbosity level
`0` (default) | `1`

Verbosity level, specified as 0 or 1. The Verbose name-value argument controls the display of diagnostic information at the command line.

Value	Description
`0`	`fitrqnet` does not display diagnostic information.
`1`	`fitrqnet` periodically displays diagnostic information.

fitrqnet stores the diagnostic information in Mdl. Use Mdl.ConvergenceInfo.History to access the diagnostic information.

Example: Verbose=1

Data Types: single | double

`VerboseFrequency` — Frequency of verbose printing
`1` (default) | positive integer scalar

Frequency of verbose printing, which is the number of iterations between printing diagnostic information at the command line, specified as a positive integer scalar. A value of 1 indicates to print diagnostic information at every iteration.

Note

To use this name-value argument, you must set Verbose to 1.

Example: VerboseFrequency=5

Data Types: single | double

`InitialStepSize` — Initial step size
`[]` (default) | positive scalar | `"auto"`

Initial step size, specified as a positive scalar or "auto". By default, fitrqnet does not use the initial step size to determine the initial Hessian approximation used in training the model. However, if you specify an initial step size ${‖ s_{0} ‖}_{\infty}$ , then the initial inverse-Hessian approximation is $\frac{{‖ s_{0} ‖}_{\infty}}{{‖ \nabla ℒ_{0} ‖}_{\infty}} I$ . $\nabla ℒ_{0}$ is the initial gradient vector, and $I$ is the identity matrix.

To have fitrqnet determine an initial step size automatically, specify the value as "auto". In this case, the function determines the initial step size by using ${‖ s_{0} ‖}_{\infty} = 0.5 {‖ η_{0} ‖}_{\infty} + 0.1$ . $s_{0}$ is the initial step vector, and $η_{0}$ is the vector of unconstrained initial weights and biases.

Example: InitialStepSize="auto"

Data Types: single | double | char | string

`IterationLimit` — Maximum number of training iterations
`1e3` (default) | positive integer scalar

Maximum number of training iterations, specified as a positive integer scalar.

The software returns a trained model regardless of whether the training routine successfully converges. Mdl.ConvergenceInfo.ConvergenceCriterion contains convergence information.

Example: IterationLimit=1e8

Data Types: single | double

`GradientTolerance` — Relative gradient tolerance
`1e-6` (default) | nonnegative scalar

Relative gradient tolerance, specified as a nonnegative scalar.

Let $ℒ_{t}$ be the loss function at training iteration t, $\nabla ℒ_{t}$ be the gradient of the loss function with respect to the weights and biases at iteration t, and $\nabla ℒ_{0}$ be the gradient of the loss function at an initial point. If $\max | \nabla ℒ_{t} | \leq a \cdot GradientTolerance$ , where $a = \max (1, \min | ℒ_{t} |, \max | \nabla ℒ_{0} |)$ , then the training process terminates.

Example: GradientTolerance=1e-5

Data Types: single | double

`LossTolerance` — Loss tolerance
`1e-6` (default) | nonnegative scalar

Loss tolerance, specified as a nonnegative scalar.

If the function loss at some iteration is smaller than LossTolerance, then the training process terminates.

Example: LossTolerance=1e-8

Data Types: single | double

`StepTolerance` — Step size tolerance
`1e-6` (default) | nonnegative scalar

Step size tolerance, specified as a nonnegative scalar.

If the step size at some iteration is smaller than StepTolerance, then the training process terminates.

Example: StepTolerance=1e-4

Data Types: single | double

`ValidationData` — Validation data for training convergence detection
cell array | table

Validation data for training convergence detection, specified as a cell array or a table.

During the training process, the software periodically estimates the validation loss by using ValidationData. If the validation loss increases more than ValidationPatience times consecutively, then the software terminates the training.

You can specify ValidationData as a table if you use a table Tbl of predictor data that contains the response variable. In this case, ValidationData must contain the same predictors and response contained in Tbl. The software does not apply weights to observations, even if Tbl contains a vector of weights. To specify weights, you must specify ValidationData as a cell array.

If you specify ValidationData as a cell array, then it must have the following format:

ValidationData{1} must have the same data type and orientation as the predictor data. That is, if you use a predictor matrix X, then ValidationData{1} must be an m-by-p or p-by-m matrix of predictor data that has the same orientation as X. The predictor variables in the training data X and ValidationData{1} must correspond. Similarly, if you use a predictor table Tbl of predictor data, then ValidationData{1} must be a table containing the same predictor variables contained in Tbl. The number of observations in ValidationData{1} and the predictor data can vary.
ValidationData{2} must match the data type and format of the response variable, either Y or ResponseVarName. If ValidationData{2} is an array of responses, then it must have the same number of elements as the number of observations in ValidationData{1}. If ValidationData{1} is a table, then ValidationData{2} can be the name of the response variable in the table. If you want to use the same ResponseVarName or formula, you can specify ValidationData{2} as [].
Optionally, you can specify ValidationData{3} as an m-dimensional numeric vector of observation weights or the name of a variable in the table ValidationData{1} that contains observation weights. The software normalizes the weights with the validation data so that they sum to 1.

If you specify ValidationData and want to display the validation loss at the command line, set Verbose to 1.

Data Types: table | cell

`ValidationFrequency` — Number of iterations between validation evaluations
`1` (default) | positive integer scalar

Number of iterations between validation evaluations, specified as a positive integer scalar. A value of 1 indicates to evaluate validation metrics at every iteration.

Note

To use this name-value argument, you must specify ValidationData.

Example: ValidationFrequency=5

Data Types: single | double

`ValidationPatience` — Stopping condition for validation evaluations
`6` (default) | nonnegative integer scalar

Stopping condition for validation evaluations, specified as a nonnegative integer scalar. Training stops if the validation loss is greater than or equal to the minimum validation loss computed so far, ValidationPatience times consecutively. You can check the Mdl.ConvergenceInfo.History table to see the running total of times that the validation loss is greater than or equal to the minimum (Validation Checks).

Example: ValidationPatience=10

Data Types: single | double

Other Regression Options

expand all

`CategoricalPredictors` — Categorical predictors list
vector of positive integers | logical vector | character matrix | string array | cell array of character vectors | `"all"`

Categorical predictors list, specified as one of the values in this table. The descriptions assume that the predictor data has observations in rows and predictors in columns.

Value	Description
Vector of positive integers	Each entry in the vector is an index value indicating that the corresponding predictor is categorical. The index values are between 1 and `p`, where `p` is the number of predictors used to train the model. If `fitrqnet` uses a subset of input variables as predictors, then the function indexes the predictors using only the subset. The `CategoricalPredictors` values do not count any response variable, observation weights variable, or other variable that the function does not use.
Logical vector	A `true` entry means that the corresponding predictor is categorical. The length of the vector is `p`.
Character matrix	Each row of the matrix is the name of a predictor variable. The names must match the entries in `PredictorNames`. Pad the names with extra blanks so each row of the character matrix has the same length.
String array or cell array of character vectors	Each element in the array is the name of a predictor variable. The names must match the entries in `PredictorNames`.
`"all"`	All predictors are categorical.

By default, if the predictor data is in a table (Tbl), fitrqnet assumes that a variable is categorical if it is a logical vector, categorical vector, character array, string array, or cell array of character vectors. If the predictor data is a matrix (X), fitrqnet assumes that all predictors are continuous. To identify any other predictors as categorical predictors, specify them by using the CategoricalPredictors name-value argument.

For the identified categorical predictors, fitrqnet creates dummy variables using two different schemes, depending on whether a categorical variable is unordered or ordered. For an unordered categorical variable, fitrqnet creates one dummy variable for each level of the categorical variable. For an ordered categorical variable, fitrqnet creates one less dummy variable than the number of categories. For details, see Automatic Creation of Dummy Variables.

Example: CategoricalPredictors="all"

`PredictorNames` — Predictor variable names
string array of unique names | cell array of unique character vectors

Predictor variable names, specified as a string array of unique names or cell array of unique character vectors. The functionality of PredictorNames depends on the way you supply the training data.

If you supply X and Y, then you can use PredictorNames to assign names to the predictor variables in X.
- The order of the names in PredictorNames must correspond to the predictor order in X. Assuming that X has the default orientation, with observations in rows and predictors in columns, PredictorNames{1} is the name of X(:,1), PredictorNames{2} is the name of X(:,2), and so on. Also, size(X,2) and numel(PredictorNames) must be equal.
- By default, PredictorNames is {'x1','x2',...}.
If you supply Tbl, then you can use PredictorNames to choose which predictor variables to use in training. That is, fitrqnet uses only the predictor variables in PredictorNames and the response variable during training.
- PredictorNames must be a subset of Tbl.Properties.VariableNames and cannot include the name of the response variable.
- By default, PredictorNames contains the names of all predictor variables.
- A good practice is to specify the predictors for training using either PredictorNames or formula, but not both.

Example: PredictorNames=["SepalLength","SepalWidth","PetalLength","PetalWidth"]

Data Types: string | cell

`ResponseName` — Response variable name
`"Y"` (default) | character vector | string scalar

Response variable name, specified as a character vector or string scalar.

If you supply Y, then you can use ResponseName to specify a name for the response variable.
If you supply ResponseVarName or formula, then you cannot use ResponseName.

Example: ResponseName="response"

Data Types: char | string

`ResponseTransform` — Function for transforming raw response values
`"none"` (default) | function handle | function name

Function for transforming raw response values, specified as a function handle or function name. The default is "none", which means @(y)y, or no transformation. The function should accept a vector (the original response values) and return a vector of the same size (the transformed response values).

Example: Suppose you create a function handle that applies an exponential transformation to an input vector by using myfunction = @(y)exp(y). Then, you can specify the response transformation as ResponseTransform=myfunction.

Data Types: char | string | function_handle

`Weights` — Observation weights
nonnegative numeric vector | name of variable in `Tbl`

Observation weights, specified as a nonnegative numeric vector or the name of a variable in Tbl. The software weights each observation in X or Tbl with the corresponding value in Weights. The length of Weights must equal the number of observations in X or Tbl.

If you specify the input data as a table Tbl, then Weights can be the name of a variable in Tbl that contains a numeric vector. In this case, you must specify Weights as a character vector or string scalar. For example, if the weights vector W is stored as Tbl.W, then specify it as "W". Otherwise, the software treats all columns of Tbl, including W, as predictors when training the model.

By default, Weights is ones(n,1), where n is the number of observations in X or Tbl.

fitrqnet normalizes the weights to sum to 1.

Data Types: single | double | char | string

Cross-Validation Options

expand all

`CrossVal` — Flag to train cross-validated model
`"off"` (default) | `"on"`

Since R2025a

Flag to train a cross-validated model, specified as "on" or "off".

If you specify "on", then the software trains a cross-validated model with 10 folds.

You can override this cross-validation setting using the CVPartition, Holdout, KFold, or Leaveout name-value argument. You can use only one cross-validation name-value argument at a time to create a cross-validated model.

Alternatively, cross-validate later by passing Mdl to the crossval function.

Example: CrossVal="on"

Data Types: char | string

`CVPartition` — Cross-validation partition
`[]` (default) | `cvpartition` object

Since R2025a

Cross-validation partition, specified as a cvpartition object that specifies the type of cross-validation and the indexing for the training and validation sets.

To create a cross-validated model, you can specify only one of these four name-value arguments: CVPartition, Holdout, KFold, or Leaveout.

Example: Suppose you create a random partition for 5-fold cross-validation on 500 observations by using cvp = cvpartition(500,KFold=5). Then, you can specify the cross-validation partition by setting CVPartition=cvp.

`Holdout` — Fraction of data for holdout validation
scalar value in the range (0,1)

Since R2025a

Fraction of the data used for holdout validation, specified as a scalar value in the range (0,1). If you specify Holdout=p, then the software completes these steps:

Randomly select and reserve p*100% of the data as validation data, and train the model using the rest of the data.
Store the compact trained model in the Trained property of the cross-validated model.

To create a cross-validated model, you can specify only one of these four name-value arguments: CVPartition, Holdout, KFold, or Leaveout.

Example: Holdout=0.1

Data Types: double | single

`KFold` — Number of folds
`10` (default) | positive integer value greater than 1

Since R2025a

Number of folds to use in the cross-validated model, specified as a positive integer value greater than 1. If you specify KFold=k, then the software completes these steps:

Randomly partition the data into k sets.
For each set, reserve the set as validation data, and train the model using the other k – 1 sets.
Store the k compact trained models in a k-by-1 cell vector in the Trained property of the cross-validated model.

To create a cross-validated model, you can specify only one of these four name-value arguments: CVPartition, Holdout, KFold, or Leaveout.

Example: KFold=5

Data Types: single | double

`Leaveout` — Leave-one-out cross-validation flag
`"off"` (default) | `"on"`

Since R2025a

Leave-one-out cross-validation flag, specified as "on" or "off". If you specify Leaveout="on", then for each of the n observations (where n is the number of observations, excluding missing observations, specified in the NumObservations property of the model), the software completes these steps:

Reserve the one observation as validation data, and train the model using the other n – 1 observations.
Store the n compact trained models in an n-by-1 cell vector in the Trained property of the cross-validated model.

To create a cross-validated model, you can specify only one of these four name-value arguments: CVPartition, Holdout, KFold, or Leaveout.

Example: Leaveout="on"

Data Types: char | string

Note

You cannot use any cross-validation name-value argument together with the OptimizeHyperparameters name-value argument. You can modify the cross-validation for OptimizeHyperparameters only by using the HyperparameterOptimizationOptions name-value argument.

Hyperparameter Optimization

expand all

`OptimizeHyperparameters` — Parameters to optimize
`"none"` (default) | `"auto"` | `"all"` | string array or cell array of eligible parameter names | vector of `optimizableVariable` objects

Since R2025a

Parameters to optimize, specified as one of the following:

"none" — Do not optimize.
"auto" — Use ["Activations","Lambda","LayerSizes","Standardize"].
"all" — Optimize all eligible parameters.
String array or cell array of eligible parameter names.
Vector of optimizableVariable objects, typically the output of hyperparameters.

The optimization attempts to minimize the cross-validation loss (error) for fitrqnet by varying the parameters. To control the cross-validation type and other aspects of the optimization, use the HyperparameterOptimizationOptions name-value argument. When you use HyperparameterOptimizationOptions, you can use the (compact) model size instead of the cross-validation loss as the optimization objective by setting the ConstraintType and ConstraintBounds options.

Note

The values of OptimizeHyperparameters override any values you specify using other name-value arguments. For example, setting OptimizeHyperparameters to "auto" causes fitrqnet to optimize hyperparameters corresponding to the "auto" option and to ignore any specified values for the hyperparameters.

The eligible parameters for fitrqnet are:

Activations — fitrqnet optimizes Activations over the set ["relu","tanh","sigmoid","none"].
Lambda — fitrqnet optimizes Lambda over log-scaled values in the range [1e-5/NumObservations,1e5/NumObservations].
LayerBiasesInitializer — fitrqnet optimizes LayerBiasesInitializer over the two values ["zeros","ones"].
LayerWeightsInitializer — fitrqnet optimizes LayerWeightsInitializer over the two values ["glorot","he"].
LayerSizes — fitrqnet optimizes over the values 1, 2, and 3 representing the number of fully connected layers, excluding the final fully connected layer. fitrqnet optimizes each fully connected layer separately over 1 through 300 sizes in the layer, sampled on a logarithmic scale.
Note
When you use the LayerSizes argument, the iterative display shows the size of each relevant layer. For example, if the current number of fully connected layers is 3, and the three layers are of size 10, 79, and 44 (respectively), the iterative display shows LayerSizes for that iteration as [10 79 44].
Note
To access up to five fully connected layers or a different range of sizes in a layer, use hyperparameters to select the optimizable parameters and ranges.
Standardize — fitrqnet optimizes Standardize over the two values [true,false].

Set nondefault parameters by passing a vector of optimizableVariable objects that have nondefault values. For example, this code sets the range of NumLayers to [1 5] and optimizes Layer_4_Size and Layer_5_Size:

load carsmall
params = hyperparameters("fitrqnet",[Horsepower,Weight],MPG);
params(1).Range = [1 5];
params(10).Optimize = true;
params(11).Optimize = true;

Pass params as the value of OptimizeHyperparameters.

By default, the iterative display appears at the command line, and plots appear according to the number of hyperparameters in the optimization. For the optimization and plots, the objective function is log(1 + cross-validation loss). To control the iterative display, set the Verbose option of the HyperparameterOptimizationOptions name-value argument. To control the plots, set the ShowPlots option of the HyperparameterOptimizationOptions name-value argument.

Example: OptimizeHyperparameters="auto"

`HyperparameterOptimizationOptions` — Options for optimization
`HyperparameterOptimizationOptions` object | structure

Since R2025a

Options for optimization, specified as a HyperparameterOptimizationOptions object or a structure. This argument modifies the effect of the OptimizeHyperparameters name-value argument. If you specify HyperparameterOptimizationOptions, you must also specify OptimizeHyperparameters. All the options listed in the following table are optional. However, you must set ConstraintBounds and ConstraintType to return AggregateOptimizationResults. The options that you can set in a structure are the same as those in the HyperparameterOptimizationOptions object.

Option	Values	Default
`Optimizer`	`"bayesopt"` — Use Bayesian optimization. Internally, this setting calls `bayesopt`. `"gridsearch"` — Use grid search with `NumGridDivisions` values per dimension. `"gridsearch"` searches in a random order, using uniform sampling without replacement from the grid. After optimization, you can get a table in grid order by using the `sortrows` function. `"randomsearch"` — Search at random among `MaxObjectiveEvaluations` points.	`"bayesopt"`
`ConstraintBounds`	Constraint bounds for N optimization problems, specified as an N-by-2 numeric matrix or `[]`. The columns of `ConstraintBounds` contain the lower and upper bound values of the optimization problems. If you specify `ConstraintBounds` as a numeric vector, the software assigns the values to the second column of `ConstraintBounds`, and zeros to the first column. If you specify `ConstraintBounds`, you must also specify `ConstraintType`.	`[]`
`ConstraintTarget`	Constraint target for the optimization problems, specified as `"matlab"` or `"coder"`. If `ConstraintBounds` and `ConstraintType` are `[]` and you set `ConstraintTarget`, then the software sets `ConstraintTarget` to `[]`. The values of `ConstraintTarget` and `ConstraintType` determine the objective and constraint functions. For more information, see `HyperparameterOptimizationOptions`.	If you specify `ConstraintBounds` and `ConstraintType`, then the default value is `"matlab"`. Otherwise, the default value is `[]`.
`ConstraintType`	Constraint type for the optimization problems, specified as `"size"` or `"loss"`. If you specify `ConstraintType`, you must also specify `ConstraintBounds`. The values of `ConstraintTarget` and `ConstraintType` determine the objective and constraint functions. For more information, see `HyperparameterOptimizationOptions`.	`[]`
`AcquisitionFunctionName`	Type of acquisition function: `"expected-improvement-per-second-plus"` `"expected-improvement"` `"expected-improvement-plus"` `"expected-improvement-per-second"` `"lower-confidence-bound"` `"probability-of-improvement"` Acquisition functions whose names include `per-second` do not yield reproducible results, because the optimization depends on the run time of the objective function. Acquisition functions whose names include `plus` modify their behavior when they overexploit an area. For more details, see Acquisition Function Types.	`"expected-improvement-per-second-plus"`
`LossFun`	Type of validation loss to optimize, specified as `"auto"` or `"quantile"`. In the case of `fitrqnet`, the two options are equivalent, and the software uses the quantile loss averaged across the quantiles.	`"auto"`
`MaxObjectiveEvaluations`	Maximum number of objective function evaluations. If you specify multiple optimization problems using `ConstraintBounds`, the value of `MaxObjectiveEvaluations` applies to each optimization problem individually.	`30` for `"bayesopt"` and `"randomsearch"`, and the entire grid for `"gridsearch"`
`MaxTime`	Time limit for the optimization, specified as a nonnegative real scalar. The time limit is in seconds, as measured by `tic` and `toc`. The software performs at least one optimization iteration, regardless of the value of `MaxTime`. The run time can exceed `MaxTime` because `MaxTime` does not interrupt function evaluations. If you specify multiple optimization problems using `ConstraintBounds`, the time limit applies to each optimization problem individually.	`Inf`
`NumGridDivisions`	For `Optimizer="gridsearch"`, the number of values in each dimension. The value can be a vector of positive integers giving the number of values for each dimension, or a scalar that applies to all dimensions. The software ignores this option for categorical variables.	`10`
`ShowPlots`	Logical value indicating whether to show plots of the optimization progress. If this option is `true`, the software plots the best observed objective function value against the iteration number. If you use Bayesian optimization (`Optimizer="bayesopt"`), the software also plots the best estimated objective function value. The best observed objective function values and best estimated objective function values correspond to the values in the `BestSoFar (observed)` and `BestSoFar (estim.)` columns of the iterative display, respectively. You can find these values in the properties `ObjectiveMinimumTrace` and `EstimatedObjectiveMinimumTrace` of the `SupervisedLearningBayesianOptimization` object. If the problem includes one or two optimization parameters for Bayesian optimization, then `ShowPlots` also plots a model of the objective function against the parameters.	`true`
`SaveIntermediateResults`	Logical value indicating whether to save the optimization results. If this option is `true`, the software overwrites a workspace variable named `SupervisedLearningBayesoptResults` at each iteration. The variable is a `SupervisedLearningBayesianOptimization` object. If you specify multiple optimization problems using `ConstraintBounds`, the workspace variable is an `AggregateBayesianOptimization` object named `AggregateBayesoptResults`.	`false`
`Verbose`	Display level at the command line: `0` — No iterative display `1` — Iterative display `2` — Iterative display with additional information For details, see the `bayesopt` `Verbose` name-value argument and the example Optimize Classifier Fit Using Bayesian Optimization.	`1`
`UseParallel`	Logical value indicating whether to run the Bayesian optimization in parallel, which requires Parallel Computing Toolbox™. Due to the nonreproducibility of parallel timing, parallel Bayesian optimization does not necessarily yield reproducible results. For details, see Parallel Bayesian Optimization.	`false`
`Repartition`	Logical value indicating whether to repartition the cross-validation at every iteration. If this option is `false`, the optimizer uses a single partition for the optimization. A value of `true` usually gives the most robust results because this setting takes partitioning noise into account. However, for optimal results, `true` requires at least twice as many function evaluations.	`false`
Specify only one of the following three options.
`CVPartition`	`cvpartition` object created by `cvpartition`	`KFold=5` if you do not specify a cross-validation option
`Holdout`	Scalar in the range `(0,1)` representing the holdout fraction
`KFold`	Integer greater than 1

Example: HyperparameterOptimizationOptions=struct(UseParallel=true)

Output Arguments

collapse all

`Mdl` — Trained quantile neural network model
`RegressionQuantileNeuralNetwork` object | `RegressionPartitionedQuantileNeuralNetwork` object | cell array of model objects

Trained quantile neural network model, returned as a RegressionQuantileNeuralNetwork object, a RegressionPartitionedQuantileNeuralNetwork object, or a cell array of model objects.

If you set any of the name-value arguments CrossVal, CVPartition, Holdout, KFold, or Leaveout, then Mdl is a RegressionPartitionedQuantileNeuralNetwork object.
If you specify OptimizeHyperparameters and set the ConstraintType and ConstraintBounds options of HyperparameterOptimizationOptions, then Mdl is an N-by-1 cell array of model objects, where N is equal to the number of rows in ConstraintBounds. If none of the optimization problems yields a feasible model, then each cell array value is [].
Otherwise, Mdl is a RegressionQuantileNeuralNetwork model object.

To reference properties of a model object, use dot notation.

`AggregateOptimizationResults` — Aggregate optimization results
`AggregateBayesianOptimization` object

Since R2025a

Aggregate optimization results for multiple optimization problems, returned as an AggregateBayesianOptimization object. To return AggregateOptimizationResults, you must specify OptimizeHyperparameters and HyperparameterOptimizationOptions. You must also specify the ConstraintType and ConstraintBounds options of HyperparameterOptimizationOptions. For an example that shows how to produce this output, see Hyperparameter Optimization with Multiple Constraint Bounds.

More About

collapse all

Quantile Neural Network Structure

The default quantile neural network regression model has the following layer structure.

Structure	Description
	Input — This layer corresponds to the predictor data in `Tbl` or `X`.
	First fully connected layer — This layer has 10 outputs, by default. You can widen the layer or add more fully connected layers to the network by specifying the `LayerSizes` name-value argument. You can find the weights and biases for this layer in the `Mdl.LayerWeights{1}` and `Mdl.LayerBiases{1}` properties of `Mdl`, respectively.
	ReLU activation function — `fitrqnet` applies this activation function to the first fully connected layer. You can change the activation function by specifying the `Activations` name-value argument.
	Final fully connected layer — This layer has one output for each quantile specified by the `Quantiles` name-value argument. You can find the weights and biases for this layer in the `Mdl.LayerWeights{end}` and `Mdl.LayerBiases{end}` properties of `Mdl`, respectively.
	Output — This layer corresponds to the predicted response values.

Tips

You can use the α/2 and 1 – α/2 quantiles to create a prediction interval that captures an estimated 100*(1 – α) percent of the variation in the response. For an example, see Create Prediction Interval Using Quantiles.
You can use quantile regression models to fit models that are robust to outliers. For an example, see Fit Regression Models to Data with Outliers.

Algorithms

collapse all

Training Solver

fitrqnet uses a limited-memory Broyden-Fletcher-Goldfarb-Shanno quasi-Newton algorithm (LBFGS) [3] as its loss function minimization technique, where the software minimizes the quantile loss averaged over the quantiles (see Quantile Loss). The LBFGS solver uses a standard line-search method with an approximation to the Hessian.

Extended Capabilities

expand all

Automatic Parallel Support
Accelerate code by automatically running computation in parallel using Parallel Computing Toolbox™. (since R2025a)

To perform parallel hyperparameter optimization, use the UseParallel=true option in the HyperparameterOptimizationOptions name-value argument in the call to the fitrqnet function.

For more information on parallel hyperparameter optimization, see Parallel Bayesian Optimization.

For general information about parallel computing, see Run MATLAB Functions with Automatic Parallel Support (Parallel Computing Toolbox).

Version History

Introduced in R2024b

expand all

R2026a: A cross-validated quantile neural network regression model is a `RegressionPartitionedQuantileNeuralNetwork` object

A cross-validated quantile neural network regression model is a RegressionPartitionedQuantileNeuralNetwork object. In previous releases, a cross-validated quantile neural network regression model is a RegressionPartitionedQuantileModel object.

You can create a RegressionPartitionedQuantileNeuralNetwork object in two ways:

Create a cross-validated model from a quantile neural network regression model RegressionQuantileNeuralNetwork by using the crossval function.
Create a cross-validated model by using the fitrqnet function and specifying one of the name-value arguments CrossVal, CVPartition, Holdout, KFold, or Leaveout.

R2026a: Perform hyperparameter optimization with multiple quantiles

You can optimize the hyperparameters of a quantile regression model with multiple quantiles. Specify the OptimizeHyperparameters and Quantiles name-value arguments.

R2025a: Optimize or cross-validate

You can optimize or cross-validate quantile regression models created using fitrqnet.

To optimize the hyperparameters of a quantile regression model, specify the OptimizeHyperparameters name-value argument.
To cross-validate a quantile regression model, specify one of these name-value arguments: CrossVal, CVPartition, Holdout, KFold, or Leaveout. Alternatively, create a full quantile regression model and then use the crossval object function.

fitrqnet

Syntax

Description

Examples

Fit Quantile Neural Network Regression Model

Prevent Quantile Crossing Using Regularization

Optimize Hyperparameters of Quantile Regression Model

Input Arguments

Tbl — Sample data table

ResponseVarName — Response variable name name of variable in Tbl

formula — Explanatory model of response variable and subset of predictor variables character vector | string scalar

Y — Response data numeric vector

X — Predictor data numeric matrix

Name-Value Arguments

Neural Network Options

Quantiles — Quantiles 0.5 (default) | vector of values in the range [0,1]

LayerSizes — Sizes of fully connected layers 10 (default) | positive integer vector

Activations — Activation functions for fully connected layers "relu" (default) | "tanh" | "sigmoid" | "none" | string array | cell array of character vectors

LayerWeightsInitializer — Function to initialize fully connected layer weights "glorot" (default) | "he"

LayerBiasesInitializer — Type of initial fully connected layer biases "zeros" (default) | "ones"

ObservationsIn — Predictor data observation dimension "rows" (default) | "columns"

Lambda — Regularization term strength 0 (default) | nonnegative scalar

Standardize — Flag to standardize predictor data false or 0 (default) | true or 1

Convergence Control Options

Verbose — Verbosity level 0 (default) | 1

VerboseFrequency — Frequency of verbose printing 1 (default) | positive integer scalar

InitialStepSize — Initial step size [] (default) | positive scalar | "auto"

IterationLimit — Maximum number of training iterations 1e3 (default) | positive integer scalar

GradientTolerance — Relative gradient tolerance 1e-6 (default) | nonnegative scalar

LossTolerance — Loss tolerance 1e-6 (default) | nonnegative scalar

StepTolerance — Step size tolerance 1e-6 (default) | nonnegative scalar

ValidationData — Validation data for training convergence detection cell array | table

ValidationFrequency — Number of iterations between validation evaluations 1 (default) | positive integer scalar

ValidationPatience — Stopping condition for validation evaluations 6 (default) | nonnegative integer scalar

Other Regression Options

CategoricalPredictors — Categorical predictors list vector of positive integers | logical vector | character matrix | string array | cell array of character vectors | "all"

PredictorNames — Predictor variable names string array of unique names | cell array of unique character vectors

ResponseName — Response variable name "Y" (default) | character vector | string scalar

ResponseTransform — Function for transforming raw response values "none" (default) | function handle | function name

Weights — Observation weights nonnegative numeric vector | name of variable in Tbl

Cross-Validation Options

CrossVal — Flag to train cross-validated model "off" (default) | "on"

CVPartition — Cross-validation partition [] (default) | cvpartition object

Holdout — Fraction of data for holdout validation scalar value in the range (0,1)

KFold — Number of folds 10 (default) | positive integer value greater than 1

Leaveout — Leave-one-out cross-validation flag "off" (default) | "on"

Hyperparameter Optimization

OptimizeHyperparameters — Parameters to optimize "none" (default) | "auto" | "all" | string array or cell array of eligible parameter names | vector of optimizableVariable objects

HyperparameterOptimizationOptions — Options for optimization HyperparameterOptimizationOptions object | structure

Output Arguments

Mdl — Trained quantile neural network model RegressionQuantileNeuralNetwork object | RegressionPartitionedQuantileNeuralNetwork object | cell array of model objects

AggregateOptimizationResults — Aggregate optimization results AggregateBayesianOptimization object

More About

Quantile Neural Network Structure

Tips

Algorithms

Training Solver

Extended Capabilities

Automatic Parallel Support Accelerate code by automatically running computation in parallel using Parallel Computing Toolbox™. (since R2025a)

Version History

R2026a: A cross-validated quantile neural network regression model is a RegressionPartitionedQuantileNeuralNetwork object

R2026a: Perform hyperparameter optimization with multiple quantiles

R2025a: Optimize or cross-validate

See Also

Topics

`Tbl` — Sample data
table

`ResponseVarName` — Response variable name
name of variable in `Tbl`

`formula` — Explanatory model of response variable and subset of predictor variables
character vector | string scalar

`Y` — Response data
numeric vector

`X` — Predictor data
numeric matrix

`Quantiles` — Quantiles
`0.5` (default) | vector of values in the range [0,1]

`LayerSizes` — Sizes of fully connected layers
`10` (default) | positive integer vector

`Activations` — Activation functions for fully connected layers
`"relu"` (default) | `"tanh"` | `"sigmoid"` | `"none"` | string array | cell array of character vectors

`LayerWeightsInitializer` — Function to initialize fully connected layer weights
`"glorot"` (default) | `"he"`

`LayerBiasesInitializer` — Type of initial fully connected layer biases
`"zeros"` (default) | `"ones"`

`ObservationsIn` — Predictor data observation dimension
`"rows"` (default) | `"columns"`

`Lambda` — Regularization term strength
`0` (default) | nonnegative scalar

`Standardize` — Flag to standardize predictor data
`false` or `0` (default) | `true` or `1`

`Verbose` — Verbosity level
`0` (default) | `1`

`VerboseFrequency` — Frequency of verbose printing
`1` (default) | positive integer scalar

`InitialStepSize` — Initial step size
`[]` (default) | positive scalar | `"auto"`

`IterationLimit` — Maximum number of training iterations
`1e3` (default) | positive integer scalar

`GradientTolerance` — Relative gradient tolerance
`1e-6` (default) | nonnegative scalar

`LossTolerance` — Loss tolerance
`1e-6` (default) | nonnegative scalar

`StepTolerance` — Step size tolerance
`1e-6` (default) | nonnegative scalar

`ValidationData` — Validation data for training convergence detection
cell array | table

`ValidationFrequency` — Number of iterations between validation evaluations
`1` (default) | positive integer scalar

`ValidationPatience` — Stopping condition for validation evaluations
`6` (default) | nonnegative integer scalar

`CategoricalPredictors` — Categorical predictors list
vector of positive integers | logical vector | character matrix | string array | cell array of character vectors | `"all"`

`PredictorNames` — Predictor variable names
string array of unique names | cell array of unique character vectors

`ResponseName` — Response variable name
`"Y"` (default) | character vector | string scalar

`ResponseTransform` — Function for transforming raw response values
`"none"` (default) | function handle | function name

`Weights` — Observation weights
nonnegative numeric vector | name of variable in `Tbl`

`CrossVal` — Flag to train cross-validated model
`"off"` (default) | `"on"`

`CVPartition` — Cross-validation partition
`[]` (default) | `cvpartition` object

`Holdout` — Fraction of data for holdout validation
scalar value in the range (0,1)

`KFold` — Number of folds
`10` (default) | positive integer value greater than 1

`Leaveout` — Leave-one-out cross-validation flag
`"off"` (default) | `"on"`

`OptimizeHyperparameters` — Parameters to optimize
`"none"` (default) | `"auto"` | `"all"` | string array or cell array of eligible parameter names | vector of `optimizableVariable` objects

`HyperparameterOptimizationOptions` — Options for optimization
`HyperparameterOptimizationOptions` object | structure

`Mdl` — Trained quantile neural network model
`RegressionQuantileNeuralNetwork` object | `RegressionPartitionedQuantileNeuralNetwork` object | cell array of model objects

`AggregateOptimizationResults` — Aggregate optimization results
`AggregateBayesianOptimization` object

Automatic Parallel Support
Accelerate code by automatically running computation in parallel using Parallel Computing Toolbox™. (since R2025a)

R2026a: A cross-validated quantile neural network regression model is a `RegressionPartitionedQuantileNeuralNetwork` object