tuning hyper parameters using bayesian optimisation

Question

luke il 1 Mag 2024

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/2113846-tuning-hyper-parameters-using-bayesian-optimisation

Commentato: Dinu il 5 Feb 2025 alle 15:21

Hi all,

I have configured an artificial neural network using 'fitnet' and need to tune the hyper parameters within the network since it is performing rather poorly. I am still rather new to matlab and this is all new to me. Any help with understanding the bayesian optimisation process in basic terms would be greatly appreciated. I am potentially looking to use the 'bayesopt' function but dont understand how this works.

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

Katja Mogalle il 6 Mag 2024

0
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/2113846-tuning-hyper-parameters-using-bayesian-optimisation#answer_1452996

Hi Luke,

In simple terms, Bayesian optimization is an algorithm that helps you choose the best hyperparameters that define the structure or training circumstainces for a neural network. You typically define a set of values for the algorithm to explore (e.g. network size, activations functions, training parameters) and Bayesian optimization takes care of figuring out which combinations of parameters to try out in order to reach an optimal outcome (the objective function is typically the cross-validation loss).

Using the bayesoft function is one approach (it is the most flexible, but maybe also more difficult to use). Though depending on the complexity and size of your task I can propose to look into one of these two directions which might be easier for getting started:

A) If you think a smaller, less complex network is able to solve the task, you can try out fitcnet (for classification) or fitrnet (for regression) of the Statistics and Machine Learning Toolbox. Both functions have built-in support for bayesian hyper-parameter optimization. For example, by using the OptimizeHyperparameters setting, the software will attempt to minimize the cross-validation loss (error) by varying parameters such as the activation functions, layer sizes, pre-processing options, and more. These examples might help you get started:

B) Alternatively, if you need a bigger and more flexible network architecture or you are working with image data, it might be better to start in the Deep Learning area. Here I can recommend the route via the Experiment Manager app, for example:

Tune Experiment Hyperparameters by Using Bayesian Optimization

Or if, in the end, you need more flexibility over the Bayesian optimization settings, here is an example using the bayesopt function: https://www.mathworks.com/help/deeplearning/ug/deep-learning-using-bayesian-optimization.html .

Hope this helps.

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Dinu il 5 Feb 2025 alle 15:21

Hello Katja,

Thank you for your detailed explanation on this.

I had a follow up question, when I use 'OptimizeHyperParameter' option with 'fitrnet' the optimization takes a lot of time to complete. I assume this is down to each optimization running down to default 1000 iterations / epoch. To lessen the time it takes to do the optimization, I have tried the following -

i. separate out my data as training and validation dataset as you would do normally with fitrnet to enable early stopping and this did not help

ii. setting the iteration limit to < 1000, to manipulate the number of epochs it runs to complete the optimization step. I'm a bit worried about doing this as I feel this would mean that the loss curve has not flattenned out enough.

Please can you suggest a better way to get the hyperparameter optimization timing down?

Thanks

Accedi per commentare.

tuning hyper parameters using bayesian optimisation

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposte (1)

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Vedere anche

Categorie

Tag

Community Treasure Hunt

tuning hyper parameters using bayesian optimisation

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Risposte (1)

1 Commento Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti

Vedere anche

Categorie

Tag

Community Treasure Hunt

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

1 Commento
Mostra -1 commenti meno recentiNascondi -1 commenti meno recenti