clip : use smart pointer (⚠️ breaking change) #12869

ngxson · 2025-04-10T09:21:29Z

Summarize this change

Breaking change to clip.cpp

The struct clip_image_f32_batch now become an opaque pointer. To access the underlay data, use these API calls:

size_t clip_image_f32_batch_n_images(const struct clip_image_f32_batch * batch); // equivalent to batch->size()
size_t clip_image_f32_batch_nx(const struct clip_image_f32_batch * batch, int idx); // equivalent to batch[idx]->nx
size_t clip_image_f32_batch_ny(const struct clip_image_f32_batch * batch, int idx); // equivalent to batch[idx]->ny
clip_image_f32 * clip_image_f32_get_img(const struct clip_image_f32_batch * batch, int idx); // equivalent to batch[idx]->data

Note: The struct clip_image_u8_batch is also became an opaque pointer, but I cannot find any code actually using it. If this change affect you, please leave a comment in this PR.

Note 2: If your existing code initialize clip_image_f32_batch on heap, here is how to migrate:

// old code:
clip_image_f32_batch batch_f32;
clip_image_preprocess(ctx.ctx_clip, img_u8, &batch_f32);
clip_image_batch_encode(ctx.ctx_clip, ctx.n_threads, &batch_f32, ...);
clip_image_f32_batch_free(&batch_f32);

// new code:
clip_image_f32_batch * batch_f32 = clip_image_f32_init();
clip_image_preprocess(ctx.ctx_clip, img_u8, batch_f32); // no more "&"
clip_image_batch_encode(ctx.ctx_clip, ctx.n_threads, batch_f32, ...); // no more "&"
clip_image_f32_batch_free(batch_f32); // no more "&"

Test

OK:   llama-gemma3-cli ggml-org/gemma-3-4b-it-GGUF:Q4_K_M
OK:   llama-llava-cli cmp-nct/Yi-VL-6B-GGUF:Q5_K
OK:   llama-llava-cli guinmoon/MobileVLM-3B-GGUF:Q4_K_M
OK:   llama-llava-cli THUDM/glm-edge-v-5b-gguf:Q4_K_M
OK:   llama-llava-cli second-state/Llava-v1.5-7B-GGUF:Q2_K
OK:   llama-llava-cli cjpais/llava-1.6-mistral-7b-gguf:Q3_K
OK:   llama-llava-cli ibm-research/granite-vision-3.2-2b-GGUF:Q4_K_M
OK:   llama-minicpmv-cli second-state/MiniCPM-Llama3-V-2_5-GGUF:Q2_K
OK:   llama-minicpmv-cli openbmb/MiniCPM-V-2_6-gguf:Q2_K
OK:   llama-minicpmv-cli openbmb/MiniCPM-o-2_6-gguf:Q4_0
OK:   llama-qwen2vl-cli bartowski/Qwen2-VL-2B-Instruct-GGUF:Q4_K_M

zhouwg · 2025-04-10T09:31:28Z

you are a highly-skilled and an interesting AI tech expert.hope can get help/guidance from you or collaborate with you in the future.

ggerganov · 2025-04-10T09:54:07Z

examples/llava/clip-impl.h

+struct clip_image_u8_ptr : std::unique_ptr<clip_image_u8, clip_image_u8_deleter> {
+    clip_image_u8_ptr() : std::unique_ptr<clip_image_u8, clip_image_u8_deleter>(clip_image_u8_init()) {}
+};
+
+struct clip_image_f32_ptr : std::unique_ptr<clip_image_f32, clip_image_f32_deleter> {
+    clip_image_f32_ptr() : std::unique_ptr<clip_image_f32, clip_image_f32_deleter>(clip_image_f32_init()) {}
+};
+
+typedef std::unique_ptr<clip_image_size, clip_image_size_deleter> clip_image_size_ptr;
+
+// these need to be struct to maintain compatibility with C interface
+struct clip_image_u8_batch : std::vector<clip_image_u8_ptr> {
+    clip_image_u8_batch() : std::vector<clip_image_u8_ptr>() {}
+};
+struct clip_image_f32_batch : std::vector<clip_image_f32_ptr> {
+    clip_image_f32_batch() : std::vector<clip_image_f32_ptr>() {}
+    ~clip_image_f32_batch() {}
+};


Inheritance from STL classes is generally not recommended. Although it's tempting, I think it would be better to avoid establishing this practice, as it can lead to subtle issues.

The recommended way is to use composition instead.

Thanks, that's good to know! (And I will also need to do the same on my llama_batch_ext PR)

With the help of gemini 3.5 pro, I added the composite in f8c5394

I was thinking about something much simpler like:

struct clip_image_u8_batch { std::vector<clip_image_u8_ptr> entries; }; // ... // end then using directly the `entries` member in the code batch.entries.clear(); batch.entries.push_back(...);

Yes thanks, I simplify this in f727124

The clip_image_u8_ptr is also no longer being initialized by default, I think it's not a big deal, reduce the complexity of the wrapper

(And I will also need to do the same on my llama_batch_ext PR)

Yup, I will get back to this PR after some refactoring in libllama. We had a discussion with @slaren how to improve on it.

Thanks, that's good to know! (And I will also need to do the same on my llama_batch_ext PR)

With the help of gemini 3.5 pro, I added the composite in f8c5394

Inheriting from std::unique_ptr is very strange usage of C++ pointers.

Also, std::shared_ptr might be a better option than std::unique_ptr as the former avoids dangling references. The overhead is usually not significant.

Tbh unique_ptr gives me more confident when writing code, kinda rust-like experience

Tbh unique_ptr gives me more confident when writing code, kinda rust-like experience

No. They are not equivalent to rust semantics.

I said rust-like experience, not rust-equivalent semantics. unique_ptr is useful in my case because the compiler will complain about unique_ptr being non-copiable, so I have total control over the life cycle of the object owned by unique_ptr

Makes sense. Just be aware of
.get() usage as it could create dangling pointers despite unique ownership.

* clip : use smart pointers * fix warmup * add forward declaration * misisng include * fix include (2) * composite * simplify batch ptr * fix conflict

ngxson added 2 commits April 10, 2025 00:07

clip : use smart pointers

9f2920d

fix warmup

4ab55e8

ngxson requested a review from ggerganov April 10, 2025 09:21

github-actions bot added the examples label Apr 10, 2025

ngxson changed the title ~~clip : use smart pointer everywhere (⚠️ breaking change)~~ clip : use smart pointer (⚠️ breaking change) Apr 10, 2025

add forward declaration

554256b

misisng include

39941d7

ngxson added the breaking change Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility. label Apr 10, 2025

fix include (2)

f306d59

ggerganov reviewed Apr 10, 2025

View reviewed changes

composite

f8c5394

ngxson mentioned this pull request Apr 10, 2025

server: Bring back multimodal support #8010

Open

19 tasks

simplify batch ptr

f727124

ggerganov approved these changes Apr 11, 2025

View reviewed changes

ngxson added 2 commits April 11, 2025 09:20

Merge branch 'master' into xsn/clip_smart_ptr

fcc4608

fix conflict

6dbfb30

ngxson merged commit 0c50923 into ggml-org:master Apr 11, 2025
51 checks passed

LostRuins mentioned this pull request Apr 30, 2025

clip : refactor clip_init, add tests #12757

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

clip : use smart pointer (⚠️ breaking change) #12869

clip : use smart pointer (⚠️ breaking change) #12869

ngxson commented Apr 10, 2025 •

edited

Loading

zhouwg commented Apr 10, 2025 •

edited

Loading

ggerganov Apr 10, 2025

ngxson Apr 10, 2025

ggerganov Apr 10, 2025

ngxson Apr 10, 2025

ggerganov Apr 11, 2025

acbits Apr 11, 2025

ngxson Apr 11, 2025

acbits Apr 11, 2025

ngxson Apr 11, 2025 •

edited

Loading

acbits Apr 11, 2025 •

edited

Loading

clip : use smart pointer (⚠️ breaking change) #12869

clip : use smart pointer (⚠️ breaking change) #12869

Conversation

ngxson commented Apr 10, 2025 • edited Loading

Summarize this change

Breaking change to clip.cpp

Test

zhouwg commented Apr 10, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ngxson Apr 11, 2025 • edited Loading

Choose a reason for hiding this comment

acbits Apr 11, 2025 • edited Loading

Choose a reason for hiding this comment

ngxson commented Apr 10, 2025 •

edited

Loading

zhouwg commented Apr 10, 2025 •

edited

Loading

ngxson Apr 11, 2025 •

edited

Loading

acbits Apr 11, 2025 •

edited

Loading