[ENH] functional n_jobs parameter for knn classifier #2478

baraline · 2025-01-01T15:33:12Z

Describe the feature or idea you want to propose

Current n_jobs params is not doing anything in knn classifier.

Describe your proposed solution

Make use of it ! TBD how exactly. If not possible a warning should at least be raised.

Describe alternatives you've considered, if relevant

No response

Additional context

Was curious after looking at sequentia benchmark on why we were so slow...

Ramana-Raja · 2025-01-06T23:20:15Z

I think parallelization could potentially be applied in the _kneighbors method, but, as shown in the image below, it actually worsens execution time. This is likely because the task itself is too lightweight or quick to benefit from parallel processing. So, I’d say it’s safe to conclude that parallelization isn’t really useful here

As for the warning, I’m not entirely sure what you mean, but maybe we can just remove the n_jobs parameter and turn off multithreading?

Testing on data:

baraline · 2025-01-07T10:36:39Z

Thanks for the benchmarking job ! Could you try switching the joblib backend to threading and see if it makes a difference ? As the distance function are njit numba function we shouldn't need to use loky backend. Otherwise, defining the kneighbors function as a numba function with parallel=True might work better.
Additionally, using more computing intensive functions like dtw instead of euclidean might change results

aadya940 · 2025-01-07T10:46:25Z

@baraline @Ramana-Raja How about parallelizing the loop in _predict in addition to what @Ramana-Raja has already done, it doesn't seem to have any looping dependencies. So could try?

baraline · 2025-01-07T10:51:06Z

It can work aswell yes, but then you need to balance number of threads (for kneighbor) per process (for sample or group of sample to predict) to find the right balance

aadya940 · 2025-01-07T11:33:36Z

Maybe we can limit the number of threads per process with max cpu count divided by njobs?

Ramana-Raja · 2025-01-07T11:58:49Z

@baraline @Ramana-Raja How about parallelizing the loop in _predict in addition to what @Ramana-Raja has already done, it doesn't seem to have any looping dependencies. So could try?

This approach also doesn’t seem to work as intended if the data is small. It actually ends up making the execution time worse, as you can see below

testing done with dtw:

But if the data is large it performs better:

testing done on data:

aadya940 · 2025-01-07T12:56:31Z

@Ramana-Raja That behavior is expected. It occurs because the time required for process creation and context switching exceeds the compute time. To address this, analyze how the problem scales and plot a graph comparing execution times with and without parallelization. The intersection point will indicate the optimal input size where parallelism becomes beneficial. Ideally, this function should allow for dynamically switching between single-threaded and multithreaded execution.

Ramana-Raja · 2025-01-07T14:02:09Z

@Ramana-Raja That behavior is expected. It occurs because the time required for process creation and context switching exceeds the compute time. To address this, analyze how the problem scales and plot a graph comparing execution times with and without parallelization. The intersection point will indicate the optimal input size where parallelism becomes beneficial. Ideally, this function should allow for dynamically switching between single-threaded and multithreaded execution.

This seems to depend on the CPU, right? The optimal data size might vary for different users with different CPU's. I think the best approach is to leave it configurable as a hyperparameter

aadya940 · 2025-01-07T14:05:50Z

Makes sense, @baraline wdyt?

baraline · 2025-01-07T19:55:33Z

I think the simplest would be to offload such hasssle to the numba compiler. We could use the existing functions of the distance module (e.g. euclidean_pairwise_distance) to compute the distance matrix in parallel, by adding a n_jobs=1 optional parameter, using the set_num_threads numba function and making _euclidean_pairwise_distance and _euclidean_from_multiple_to_multiple_distance use parallel=True with prange on both loops. This would not change default behaviour but would allow us to make use of n_jobs without much work.

This would necessitate a change in such function for all distances though, @chrisholder what do you think ?

steenrotsman · 2025-03-14T10:20:10Z

I've tried two changes to KNeighborsTimeSeriesClassifier and profiled both:

Use n_jobs in _predict
Use dtaidistance instead of Aeon's dtw_distance

Left and middle screen show parallelization and dtaidistance integration into class. Right screen shows profiling code. These are the results of profiling on my 11th Gen Intel Core i7-1165G7 @ 2.80GHz × 4 CPU:

name	avg	std	min
Aeon ACSF1 (1)	110.2201	2.4291	107.7615
Aeon ACSF1 (8)	49.2053	1.0859	48.3538
Aeon ArrowHead (1)	1.8508	0.0506	1.7939
Aeon ArrowHead (8)	0.5634	0.0114	0.5545
Aeon GunPoint (1)	0.8112	0.0238	0.7918
Aeon GunPoint (8)	1.0206	1.6650	0.2700
DTAI ACSF1 (1)	9.7684	0.3365	9.5679
DTAI ACSF1 (8)	3.2668	0.0646	3.1645
DTAI ArrowHead (1)	0.2433	0.0058	0.2370
DTAI ArrowHead (8)	0.0840	0.0090	0.0744
DTAI GunPoint (1)	0.1359	0.0022	0.1332
DTAI GunPoint (8)	0.0551	0.0016	0.0530

Conclusion: dtaidistance helps more than parallelization, but parallelization certainly helps on larger data sets.
dtaidistance is implemented in C, but its performance gains are almost only due to it allowing an upper bound. I've parallelized in _predict instead of _kneighbors because the upper bound means _kneighbors is not embarrassingly parallel anymore, while _predict still is.

For the issue at hand, I'd put the parallelization in _predict and open a new issue to either include dtaidistance as a (soft) dependency or implement max_dist in dtw_distance and possibly others. Let me know what you think!

MatthewMiddlehurst · 2025-03-15T20:32:27Z

Sounds good generally. There is some discussion to be had on how exactly to implement the soft dependency distance and if we want to go that route to begin with, but can leave that to its own issue.

There is a PR to parallelise the aeon distances in #2545. This is only being held up due to notebook issue, so it may be worth testing that also.

baraline added enhancement New feature, improvement request or other non-bug code enhancement classification Classification package labels Jan 1, 2025

TonyBagnall changed the title ~~[ENH] functionnal n_jobs parameter for knn classifier~~ [ENH] functional n_jobs parameter for knn classifier Jan 10, 2025

MatthewMiddlehurst assigned chrisholder Jan 10, 2025

steenrotsman linked a pull request Mar 24, 2025 that will close this issue

[ENH]Use n_jobs parameter in KNeighborsTimeSeriesClassifier. #2687

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] functional n_jobs parameter for knn classifier #2478

[ENH] functional n_jobs parameter for knn classifier #2478

baraline commented Jan 1, 2025 •

edited

Loading

Ramana-Raja commented Jan 6, 2025 •

edited

Loading

baraline commented Jan 7, 2025 •

edited

Loading

aadya940 commented Jan 7, 2025 •

edited

Loading

baraline commented Jan 7, 2025 •

edited

Loading

aadya940 commented Jan 7, 2025

Ramana-Raja commented Jan 7, 2025 •

edited

Loading

aadya940 commented Jan 7, 2025

Ramana-Raja commented Jan 7, 2025

aadya940 commented Jan 7, 2025

baraline commented Jan 7, 2025 •

edited

Loading

steenrotsman commented Mar 14, 2025

MatthewMiddlehurst commented Mar 15, 2025

[ENH] functional n_jobs parameter for knn classifier #2478

[ENH] functional n_jobs parameter for knn classifier #2478

Comments

baraline commented Jan 1, 2025 • edited Loading

Describe the feature or idea you want to propose

Describe your proposed solution

Describe alternatives you've considered, if relevant

Additional context

Ramana-Raja commented Jan 6, 2025 • edited Loading

baraline commented Jan 7, 2025 • edited Loading

aadya940 commented Jan 7, 2025 • edited Loading

baraline commented Jan 7, 2025 • edited Loading

aadya940 commented Jan 7, 2025

Ramana-Raja commented Jan 7, 2025 • edited Loading

aadya940 commented Jan 7, 2025

Ramana-Raja commented Jan 7, 2025

aadya940 commented Jan 7, 2025

baraline commented Jan 7, 2025 • edited Loading

steenrotsman commented Mar 14, 2025

MatthewMiddlehurst commented Mar 15, 2025

baraline commented Jan 1, 2025 •

edited

Loading

Ramana-Raja commented Jan 6, 2025 •

edited

Loading

baraline commented Jan 7, 2025 •

edited

Loading

aadya940 commented Jan 7, 2025 •

edited

Loading

baraline commented Jan 7, 2025 •

edited

Loading

Ramana-Raja commented Jan 7, 2025 •

edited

Loading

baraline commented Jan 7, 2025 •

edited

Loading