Skip to content

Commit 6eb46e2

Browse files
committed
When running MAT_MUL, divide the work up into small chunks, and execute the chunks.
Currently if one thread stalls, it'll delay the final result. Testing shows that with this change, threads finish within a few nanoseconds of each other instead of being spread out over 1-2 ms, the total time is also faster and more consistent.
1 parent 4dba7e8 commit 6eb46e2

File tree

1 file changed

+540
-470
lines changed

1 file changed

+540
-470
lines changed

0 commit comments

Comments
 (0)