Static llm pipeline dynamic shape model #1240

AsyaPronina · 2024-11-20T19:24:56Z

Related PRs:

OpenVINO: [NPUW] Dynamic stateful model support openvino#27651

dmatveev · 2024-11-25T10:24:54Z

src/cpp/src/llm_pipeline_static.cpp

-    int64_t position_ids_data = prompt_len -1;
-    std::vector<int64_t> attention_mask_data(1, prompt_len);
+    int64_t position_ids_data = prompt_len - 1;
+    std::vector<int64_t> attention_mask_data(prompt_len - 1, 1);


LOL, @TolyaTalamanov !!

smirnov-alexey · 2024-11-27T15:48:55Z

samples/cpp/chat_sample/chat_sample.cpp

@@ -10,7 +10,7 @@ int main(int argc, char* argv[]) try {
    std::string prompt;
    std::string models_path = argv[1];

-    std::string device = "CPU";  // GPU, NPU can be used as well
+    std::string device = "NPU";  // GPU, NPU can be used as well


I believe the default device should remain CPU

smirnov-alexey · 2024-11-27T15:51:11Z

src/cpp/src/llm_pipeline_static.cpp

@@ -472,7 +480,7 @@ std::optional<NPUDesc> extract_npu_descriptor(ov::Core& core) {
 ov::AnyMap get_baseline_common_config() {
    ov::AnyMap config = {
        { "NPU_COMPILATION_MODE_PARAMS", "compute-layers-with-higher-precision=Sqrt,Power,ReduceMean,Add_RMSNorm" },
-        { "NPUW_DEVICES", "NPU" },
+        { "NPUW_DEVICES", "NPU,CPU" },


Shouldn't be changed I believe

smirnov-alexey · 2024-11-27T15:53:29Z

src/cpp/src/llm_pipeline_static.cpp

+    }
+
+    ov::genai::TokenizedInputs tokenized_input;
+    //if (m_is_chat_conversation) {


Why is it commented out?

smirnov-alexey · 2024-11-27T15:53:32Z

src/cpp/src/llm_pipeline_static.cpp

+    DecodedResults decoded_results = {m_tokenizer.decode(encoded_results.tokens), encoded_results.scores};
+    auto decode_stop_time =  std::chrono::steady_clock::now();
+
+    //if (m_is_chat_conversation) {


Why is it commented out?

smirnov-alexey · 2024-11-27T15:53:58Z

src/cpp/src/llm_pipeline_static.cpp

+}
+
+template <typename T>
+void print_tensor(ov::Tensor t) {


smirnov-alexey · 2024-11-27T15:55:24Z

src/cpp/src/llm_pipeline_static.cpp

+                                 const ov::genai::Tokenizer& tokenizer,
+                                 const std::string& device,
+                                 const ov::AnyMap& config) {
+    //return std::make_unique<StaticLLMPipeline>(models_path, tokenizer, device, config);


TolyaTalamanov added 2 commits November 20, 2024 19:22

Snapshot

d0b0298

Snapshot

e0416c6

github-actions bot added category: LLM LLM pipeline (stateful, static) category: samples GenAI samples labels Nov 20, 2024

AsyaPronina marked this pull request as draft November 20, 2024 19:26

AsyaPronina mentioned this pull request Nov 20, 2024

[NPUW] Dynamic stateful model support openvinotoolkit/openvino#27651

Open

dmatveev reviewed Nov 25, 2024

View reviewed changes

Fixed all typos comparing to dual-model GenAI pipeline

cc34616

AsyaPronina force-pushed the at/static-llm-pipeline-dynamic-shape-model branch from 6cdd518 to cc34616 Compare November 27, 2024 15:41

smirnov-alexey reviewed Nov 27, 2024

View reviewed changes

src/cpp/src/llm_pipeline_static.cpp

}

template <typename T>

void print_tensor(ov::Tensor t) {

Copy link

smirnov-alexey Nov 27, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not needed

smirnov-alexey reviewed Nov 27, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Static llm pipeline dynamic shape model #1240

Static llm pipeline dynamic shape model #1240

AsyaPronina commented Nov 20, 2024 •

edited

Loading

dmatveev Nov 25, 2024

smirnov-alexey Nov 27, 2024

smirnov-alexey Nov 27, 2024

smirnov-alexey Nov 27, 2024

smirnov-alexey Nov 27, 2024

smirnov-alexey Nov 27, 2024

smirnov-alexey Nov 27, 2024

Static llm pipeline dynamic shape model #1240

Are you sure you want to change the base?

Static llm pipeline dynamic shape model #1240

Conversation

AsyaPronina commented Nov 20, 2024 • edited Loading

dmatveev Nov 25, 2024

Choose a reason for hiding this comment

smirnov-alexey Nov 27, 2024

Choose a reason for hiding this comment

smirnov-alexey Nov 27, 2024

Choose a reason for hiding this comment

smirnov-alexey Nov 27, 2024

Choose a reason for hiding this comment

smirnov-alexey Nov 27, 2024

Choose a reason for hiding this comment

smirnov-alexey Nov 27, 2024

Choose a reason for hiding this comment

smirnov-alexey Nov 27, 2024

Choose a reason for hiding this comment

AsyaPronina commented Nov 20, 2024 •

edited

Loading