How to select the number of samples to train a Machine Learning algorithm?

Question

Jose Marques il 31 Gen 2019

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/442422-how-to-select-the-number-of-samples-to-train-a-machine-learning-algorithm

Commentato: Greg Heath il 4 Feb 2019

I working in a dataset of 12000 samples concerning about 5 years of an industrial process.

It is likely that during this time the plant has undergone changes (equipments, the performance drop itself, chemical products).

Is there a tool for identifying the best subset of this data? In my view, a temporal cut in the data could increase the quality of the models created.

3 Commenti
Mostra 1 commento meno recenteNascondi 1 commento meno recente

Jose Marques il 31 Gen 2019

Thanks for the comment!

The dataset has 426 inputs (I am using techniques for feature selection too).

I am using four algorithms to create the models: Regression Tree, Bagged Trees, SVM and Neural Networks.

Greg Heath il 4 Feb 2019

As a common sense rule of thumb I try to use at least 10 to 30 times as many training points as unknown parameters that have to be estimated.

In addition I use 10 to 20 sets of random initial weights.

I assume , of course, that you ave examined plots of the data to initialize your common sense.

Hope this Helps

Greg

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

BERGHOUT Tarek il 3 Feb 2019

0
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/442422-how-to-select-the-number-of-samples-to-train-a-machine-learning-algorithm#answer_359276

u can use deep belif networks ; they are the best for feature sellection and mapping; and train you network by driven chunks of data "by randomly chosing a pairs of (inputs,targets)" and in the same time pire attention to your approximation function you must keep your error function in its local minimam. deep belif nets depands on a set of stacked auto_encoders that allows to tune all the parameters of the networks with small amount of training data

https://www.youtube.com/watch?v=E2Mt_7qked0

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Accedi per commentare.

How to select the number of samples to train a Machine Learning algorithm?

3 Commenti
Mostra 1 commento meno recenteNascondi 1 commento meno recente

Risposte (1)

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Vedere anche

Categorie

Tag

Community Treasure Hunt

How to select the number of samples to train a Machine Learning algorithm?

3 Commenti Mostra 1 commento meno recenteNascondi 1 commento meno recente

Risposte (1)

0 Commenti Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti

Vedere anche

Categorie

Tag

Community Treasure Hunt

3 Commenti
Mostra 1 commento meno recenteNascondi 1 commento meno recente

0 Commenti
Mostra -2 commenti meno recentiNascondi -2 commenti meno recenti