downsample data adapively/​"intellige​ntly"

62 visualizzazioni (ultimi 30 giorni)
Andreas
Andreas il 16 Set 2021
Commentato: Wolfie il 28 Lug 2022
I have a set of data that contains ~7800 data points (the red curve in the figure). I have reduced the number of data points with downsample() to 19 data points (blue curve). However, I would like to position the points more "intelligently". Is there a function in Matlab where the points are placed with some kind of least square minimization/adaptive spacing? The data is currently stored in a struct (data.Stress and data.Strain).
The number of data points must be possible to fix by the user (e.g. =19 in this case).
The most important part of the curve is the beginning, but I think that a general minimization would give a good enough result.
Adding a point between the first and second point and remoing one point where the curve is more or less linear would give a good-enough result. But modifying the curve manually isn't so attractive as there are quite a few curves to process.
  1 Commento
Mathieu NOE
Mathieu NOE il 17 Set 2021
hello
I would suggest you do a linear interpolation, but the new x vector should have more points in the beginning compared to the end of the curve
you can try different x axis distributions (linear decreasing or exponential) ....

Accedi per commentare.

Risposta accettata

Jan
Jan il 17 Set 2021
Modificato: Jan il 17 Set 2021
This is not a trivial problem. In the general case it is a global optimization problem and there can be a huge number of equivalent solutions.
The Douglas-Peuker-algorithm is a fair approach, if the input is given as discrete set of points, see: https://www.mathworks.com/help/images/ref/reducepoly.html or https://www.mathworks.com/matlabcentral/fileexchange/61046-douglas-peucker-algorithm
An alternative: Calculate the cumulative sum of the absolute values of the 2nd derivative. Now find rounded equidistant steps on this line. The higher the curvature, the more points you get:
x = linspace(0, 4*pi, 500);
y = sin(x);
n = 21; % Number of points
ddy = gradient(gradient(y, x), x); % 2nd derivative
sy = cumsum(abs(ddy));
idx = interp1(sy, 1:numel(x), linspace(sy(1), sy(end), n), 'nearest');
plot(x, y);
hold on
plot(x(idx), y(idx), 'ro');
This looks very nice even for a small number of points. To my surprise it looks such good, that this must be a standard approach. Does anybody know, how this algorithm is called?
  1 Commento
Wolfie
Wolfie il 28 Lug 2022
This is nice, but if it's possible for the 2nd derivative to be zero then interp1 will complain about grid vectors not being unique. Adding some small steady increment to sy can avoid this e.g.
sy = sy + linspace(0,max(sy)/10,numel(sy));

Accedi per commentare.

Più risposte (1)

Andreas
Andreas il 17 Set 2021
Modificato: Andreas il 17 Set 2021
Thanks @Jan and @Mathieu NOE, I used bits and pieces from your input to this (principal) code
x = linspace(0,10,500);
y = log(x);
plot(x,y,'-r','LineWidth',2)
hold on
n = 12;
pot = 2.0;
% logarithmic
scale = x(end)/(10^pot-1.0);
red_x = (logspace(0,pot,n)-1.0)*scale;
red_y = interp1(x,y,red_x,'nearest');
red_y(1) = -3.9; %"cheating" for this example, as ln(0) -> -inf
plot(red_x,red_y,'-b*','LineWidth',2)
% linear
red_x = linspace(0,x(end),n);
red_y = interp1(x,y,red_x,'nearest');
red_y(1) = -3.9;
plot(red_x,red_y,'-co','LineWidth',2)
plot(x,y,'-r','LineWidth',2)
legend('Function log(x)','logarithmic distribution','linear distribution')
It gives a better interpolation for my types of curves with logarithmic distribution. Below is an example of my "real" curves.

Categorie

Scopri di più su Colormaps in Help Center e File Exchange

Prodotti


Release

R2021a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by