Why vectorized calculations are faster than for loops?

18 visualizzazioni (ultimi 30 giorni)
Why it's faster in Matlab? Is it because better memory treating, or paralleling? If this is only due parallel computation, on single-core laptop it will be now difference between? Thanks<

Risposte (2)

Jan
Jan il 26 Ott 2014
Modificato: Jan il 26 Ott 2014
At first: There is no evidence that vectorized code is faster in general.
If a build-in function can be applied to a complete array, a vectorization is much faster than a loop appraoch. When large temporary arrays are required, the benefits of the vectorization can be dominated by the expensive allocation of the memory, when it does not match into the processor cache.
A secondray effect of vectorizing is that the code looks more clear, at least as a rule of thumb. A trivial example:
% Loops:
A = rand(10);
B = rand(10);
C = zeros(size(A));
for i2 = 1:size(A, 2)
for i1 = 1:size(A, 1) % Columns in the inner loop
C(i1, i2) = A(i1, i2) + B(i1, i2);
end
end
% Vectorized:
C = A + B;
The 2nd method is faster concerning the runtime, but also for the programming and debug time. There is almost no chance to create a bug and it will be very easy to understand the code, when the program needs changes in the future.
  3 Commenti
Mikhail
Mikhail il 27 Ott 2014
Modificato: Mikhail il 27 Ott 2014
I still didn't understand why it happens? Why build-in function are faster? For what (fundamental) reason it happens?
Keldon Alleyne
Keldon Alleyne il 1 Ott 2018
Multiplying matrices in loops is O(N^3), while the fastest algorithms using other methods are O(N^2.3) - O(N^2.8), which can easily explain the differences in performance.
On my laptop with Matlab 2018b I get:
Elapsed time is 0.112362 seconds.
Elapsed time is 0.214544 seconds.
Elapsed time is 0.007935 seconds.

Accedi per commentare.


Vibhav
Vibhav il 24 Apr 2025
Spostato: Walter Roberson il 24 Apr 2025
Looks like your question about why vectorizing is fundamentally faster (in most cases) was never answered.
The answer is complicated and I won't provide all the details, but in addition to optimized memory access and multithreading, vectorized code uses Single Instruction Multiple Data (SIMD) instructions which are specific CPU instructions that perform the same computation on multiple data, thus speeding up the computational throughput of the same CPU / core. You can look up "SIMD" for more information on how this is done, but the TL;DR is that vectorized MATLAB core gets "Just-In-Time" (JIT) compiled to BLAS and LAPACK subroutines that exploit SIMD instructions.
Vectorization also exploits multithreaded routines on machines that support it. It is possible to have "multithreaded" code on a single core CPU, but that is not true parallelism as only one instruction can execute at a time, but from a user's standpoint it appears to be concurrent.

Categorie

Scopri di più su Performance and Memory in Help Center e File Exchange

Prodotti

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by