Replace extreme outliers in dataset
1 visualizzazione (ultimi 30 giorni)
I want to replace "extreme" outliers in a data set. The data set varies quite a lot in terms of values, i.e. the maximum values are 50 times larger than the minimum values. However, in circa 0.05 % of the data points there are obvious mistakes depending on measurements errors. These values are at least 10 times larger than the maximum values in the data set. My question is therefore:
- How can I replace the wrong value with the average of the value to the left and to the right of the wrong one? I know about the function "filloutliers" but the function finds values more than three local standard deviations away from the local mean. I need 5 or 6 means because of the large spread in data.
Have you considered hacking the source of filloutliers.m? If it is only a matter of scaling, then it is probably a simple one line change.
Modificato: David Wilson il 10 Apr 2019
Not difficult to hack, but slightly more complex to hack correctly. Just copy (and rename) the function into your work directory, and
>> which filloutliers
>> !copy "C:\Program Files\MATLAB\R2018b\toolbox\matlab\datafun\filloutliers.m" myFillOutliers.m
1 file(s) copied.
>> edit myFilloutliers
I'd start hacking on about line 335.
But there seems to be a problem calling the (invisible) function locateoutliers. You might have better luck carefully hacking the original function.