LLM NPUW tests for PR checks #30051

AsyaPronina · 2025-04-09T21:50:43Z

Details:

Added LLM NPUW tests for accuracy and behavior using SimpleLLMPipeline helper class

Tickets:

EISW-144182

AsyaPronina · 2025-04-09T21:59:46Z

src/plugins/intel_npu/tests/functional/behavior/npuw/test_engine/simple_llm_pipeline.hpp

+
+class SimpleLLMPipeline {
+public:
+    void initialize(const std::string& model_path, ov::Core& core, const ov::AnyMap& config);


AsyaPronina · 2025-04-09T22:00:08Z

src/plugins/intel_npu/tests/functional/behavior/npuw/llm_behavior_tests.cpp

+    register_mock_plugins_in_ov();
+
+    // Do the actual test:
+


AsyaPronina · 2025-04-09T22:00:16Z

src/plugins/intel_npu/tests/functional/behavior/npuw/llm_behavior_tests.cpp

+    // ------------------------ Prefill model ---------------------------
+    // 1 infer request for head:
+    EXPECT_CREATE_SYNC_INFER_REQ(mock_npu_for_prefill, MODEL(0), TIMES(1));  
+    // 2 infer requests for function, `create_sync_infer_request()`


dmatveev · 2025-04-10T11:46:12Z

src/plugins/intel_npu/tests/functional/behavior/npuw/llm_accuracy_tests.cpp

+        ov::AnyMap config;
+        std::tie(model_path, config, input_ids, reference_ids) = param;
+        config["NPUW_DEVICES"] = "CPU";
+        config["NPUW_LLM_MIN_RESPONSE_LEN"] = 4;


what's the prompt length? Not sure why you limit this one here, but if your test prompt is short you may want to limit the PROMPT_LEN as well to improve the prefill's performance on CPU.

dmatveev · 2025-04-10T11:46:32Z

src/plugins/intel_npu/tests/functional/behavior/npuw/llm_accuracy_tests.cpp

+                                 std::vector<int64_t>, std::vector<int64_t>>;
+} // anonymous namespace
+
+class LLMAccuracyTestsNPUW : public ::testing::TestWithParam<LLMTestParams> {


This shouldn't be called an ACCURACY test to avoid confusion with the real accuracy tests.

dmatveev · 2025-04-10T11:46:54Z

src/plugins/intel_npu/tests/functional/behavior/npuw/llm_accuracy_tests.cpp

+namespace {
+const std::vector<int64_t> What_is_OpenVINO =
+    {529, 29989, 1792, 29989, 29958, 13, 5618, 338, 4673, 29963, 1177,
+     29949, 29973, 2, 29871, 13, 29966, 29989, 465, 22137, 29989, 29958, 13};
+const std::vector<int64_t> OpenVINO =
+    {6585, 29963, 1177, 29949}; 


this namespace can have a name token_ids.

AsyaPronina added 4 commits April 7, 2025 13:39

First accuracy test

29aa6ee

First implementation of E2E LLM test

26ac169

Added test

76d7066

Refactored LLM NPUW accuracy tests

ac980f6

AsyaPronina requested review from a team as code owners April 9, 2025 21:50

github-actions bot added category: build OpenVINO cmake script / infra category: NPU OpenVINO NPU plugin category: NPUW NPUW plugin labels Apr 9, 2025

LLM Behavior tests

4f4186e

AsyaPronina force-pushed the add_new_npuw_tests branch from 53a65f7 to 4f4186e Compare April 9, 2025 21:55

AsyaPronina commented Apr 9, 2025

View reviewed changes

dmatveev reviewed Apr 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LLM NPUW tests for PR checks #30051

LLM NPUW tests for PR checks #30051

AsyaPronina commented Apr 9, 2025 •

edited

Loading

Uh oh!

AsyaPronina Apr 9, 2025

Uh oh!

AsyaPronina Apr 9, 2025

Uh oh!

AsyaPronina Apr 9, 2025

Uh oh!

dmatveev Apr 10, 2025

Uh oh!

dmatveev Apr 10, 2025

Uh oh!

dmatveev Apr 10, 2025

Uh oh!

Uh oh!

LLM NPUW tests for PR checks #30051

Are you sure you want to change the base?

LLM NPUW tests for PR checks #30051

Conversation

AsyaPronina commented Apr 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Details:

Tickets:

Uh oh!

AsyaPronina Apr 9, 2025

Choose a reason for hiding this comment

Uh oh!

AsyaPronina Apr 9, 2025

Choose a reason for hiding this comment

Uh oh!

AsyaPronina Apr 9, 2025

Choose a reason for hiding this comment

Uh oh!

dmatveev Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

dmatveev Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

dmatveev Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AsyaPronina commented Apr 9, 2025 •

edited

Loading