docs

Jul 5, 2024

eddb9e8 · Jul 5, 2024

Name	Name	Last commit message	Last commit date
parent directory ..
README.md	README.md	Refactoring of benchmarks (#133 )	Jul 5, 2024

README.md

Developer Guide

This document covers topics useful for contributors to Scikit-learn_bench:

Developer Guide
- High-level workflow of Scikit-learn_bench
- Configs parser workflow

High-Level Workflow of Scikit-learn_bench

stateDiagram-v2
    classDef inputOutput fill:#33b,color:white,stroke-width:2px,stroke:white;

    user_arguments:::inputOutput --> ArgumentParser
    BenchmarksRunner --> raw_results[JSON]:::inputOutput
    raw_results[JSON] --> ReportGenerator
    ReportGenerator --> benchmarks_report[Excel]:::inputOutput

    state BenchmarksRunner {
        ArgumentParser --> ConfigParser: config_arguments
        ArgumentParser --> Benchmarks: other_arguments
        ConfigParser --> Benchmarks: benchmark_cases\n[JSON-formatted string]
        ConfigParser --> Benchmarks: benchmark_filters\n[JSON-formatted string]

        state Benchmarks {
            SklearnLikeEstimator --> raw_results[JSON]
            ... --> raw_results[JSON]
            Functional --> raw_results[JSON]
        }
    }

Scikit-learn_bench consists of three main parts:

Benchmarks runner:
1. Consumes user-provided high-level arguments (argument parser).
2. Transforms arguments to benchmark cases as parameters for individual benchmarks (config parser).
3. Combines the raw outputs.
Individual benchmarks wrapping specific entities or workloads (sklearn-like estimators, custom functions, etc.)
Report generator which consumes benchmarks' outputs and generates high-level report with aggregated stats

Runner is responsible for orchestration of benchmarking cases, individual benchmarks - for actual run of each case, report generator - for human-readable output.

Configs parser workflow

Benchmarking configuration exists as two stages:

Benchmarking template where parameters or group of them might be defined as a range of values
Benchmarking case with deducted scalar values of parameters

In other words, the template has the Dict[str, AnyJSONSerializable] type, while the case has Dict[str, Dict[str, ... [str, Scalar] ... ]].

Configs parser steps:

Find all config files from the user-provided config argument or use globally defined parameters as a standalone config
Convert configs to templates
Expand template-special values and ranges to all possible cases
Remove duplicated cases and assign case-special values if possible

Special values might be assigned on three stages:

During template reading in runner
During benchmarking cases generation in runner
During run of individual benchmark

Benchmark parameters the following overwriting priority:

CLI parameters
Config template parameters
Parameters set

Documentation tree

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

docs

docs

README.md

Developer Guide

High-Level Workflow of Scikit-learn_bench

Configs parser workflow

Files

docs

Directory actions

More options

Directory actions

More options

Latest commit

History

docs

Folders and files

parent directory

README.md

Developer Guide

High-Level Workflow of Scikit-learn_bench

Configs parser workflow