Relatively easy optimization problem in Excel it's hard to implement on Matlab

Question

Barbab il 6 Lug 2022

0
Link

Link diretto a questa domanda

https://it.mathworks.com/matlabcentral/answers/1754460-relatively-easy-optimization-problem-in-excel-it-s-hard-to-implement-on-matlab

Modificato: Torsten il 7 Lug 2022

rng('default')
% creating fake data
data = randi([-1000 +1000],30,500);
yt = randi([-1000 1000],30,1);
% creating fake missing values
row = randi([1 15],1,500);
col = rand(1,500) < .5;
% imputing missing fake values
for i = 1:500
    if col(i) == 1
        data(1:row(i),i) = nan;
    end
end
%% here starts my problem
wgts = ones(1,500); % optimal weights needs to be binary (only zero or one)
% this would be easy with matrix formulas but I have missing values at the
% beginning of the series
for j = 1:30
    xt(j,:) = sum(data(j,:) .* wgts,2,'omitnan');
end
X = [xt(3:end) xt(2:end-1) xt(1:end-2)];
y = yt(3:end);
% from here I basically need to:
% maximize the Adjusted R squared of the regression fitlm(X,y)
% by changing wgts
% subject to wgts = 1 or wgts = 0
% and optionally to impose sum(wgts,'all') = some number;
% basically I need to select the data cols with the highest explanatory
% power, omitting missing data

This is easy to implement with Excel solver, but it only can handle 200 decision variables and it takes a lot of time. Thank you in advance.

2 Commenti
Mostra NessunoNascondi Nessuno

Sam Chak il 6 Lug 2022

@Barbab,

Unsure what went wrong. Can you show your results in Excel? It is probably better to compare the performances of having the same data.

Barbab il 6 Lug 2022

There is nothing wrong. I am not able to implement it in Matlab. I am familiar with Matlab but I don’t know where to start with this problem

Accedi per commentare.

Accedi per rispondere a questa domanda.

Answer 1

Torsten il 6 Lug 2022

0
Link

Link diretto a questa risposta

https://it.mathworks.com/matlabcentral/answers/1754460-relatively-easy-optimization-problem-in-excel-it-s-hard-to-implement-on-matlab#answer_1001385

Modificato: Torsten il 6 Lug 2022

Apri in MATLAB Online

If you accept a MATLAB solution for this problem:

min: norm(X*p-y)
s.c.
xt = data*w
0<=w<=1
w integer

where p is (3x1) and "norm" is either 1-norm or max-norm, you can use intlinprog.

If you insist at maximizing adjustable r-squared, I think you will have to use ga.

3 Commenti
Mostra 1 commento meno recenteNascondi 1 commento meno recente

Barbab il 6 Lug 2022

My problem is that I have two lags of the indpendent variable and the independent variable is a linear combination in what is contained in "data"

Torsten il 7 Lug 2022

Modificato: Torsten il 7 Lug 2022

Then you will have to use ga:

https://de.mathworks.com/help/gads/ga.html

And - contrary to the title - I find your optimization problem is quite hard.

Accedi per commentare.

Relatively easy optimization problem in Excel it's hard to implement on Matlab

2 Commenti
Mostra NessunoNascondi Nessuno

Risposte (1)

3 Commenti
Mostra 1 commento meno recenteNascondi 1 commento meno recente

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

Relatively easy optimization problem in Excel it's hard to implement on Matlab

2 Commenti Mostra NessunoNascondi Nessuno

Risposte (1)

3 Commenti Mostra 1 commento meno recenteNascondi 1 commento meno recente

Vedere anche

Categorie

Tag

Prodotti

Release

Community Treasure Hunt

2 Commenti
Mostra NessunoNascondi Nessuno

3 Commenti
Mostra 1 commento meno recenteNascondi 1 commento meno recente