Add LFM2-VL support #40259

ankke · 2025-08-18T22:19:14Z

Add support for LFM2-VL models.

LFM2‑VL is Liquid AI's first series of multimodal models, designed to process text and images with variable resolutions. Built on the LFM2 backbone, it is optimized for low-latency and edge AI applications.

Checkpoints available here.

zucchini-nlp

Yay happy to see a VLM release from LiquidAI! I left a few comments to refine and clean up the PR. Would be nice to use modular because the model arch is very similar to existing VLMs and it makes review process easier//faster

docs/source/en/model_doc/lfm2_vl.md

src/transformers/models/lfm2_vl/configuration_lfm2_vl.py

src/transformers/models/lfm2_vl/processing_lfm2_vl.py

zucchini-nlp · 2025-08-19T09:52:31Z

src/transformers/models/lfm2_vl/processing_lfm2_vl.py

+    def _smart_resize(
+        self,
+        image: Image.Image,
+        downsample_factor: int,
+        min_image_tokens: int,
+        max_image_tokens: int,
+        encoder_patch_size: int,
+    ) -> Image.Image:


maybe we can use modular and copy LfmV2ImageProcessor from Qwen-VL with minor changes, since that looks to be the closest processor

I'm afraid there will be many changes to the Qwen-VL implementation as we treat images up to 512x512 pixels differently from larges ones

oke, maybe qwen-vl isn't much close to LFM-VL. It is nice to try to copy from similar processors if any, but we can make a separate class if there isn't any similar processor

In second case, we don't need modular and it is easier to just keep it as is in processing_xxx.py

src/transformers/models/lfm2_vl/processing_lfm2_vl.py

zucchini-nlp · 2025-08-19T09:56:00Z

src/transformers/models/lfm2_vl/processing_lfm2_vl.py

+        return list(dict.fromkeys(image_processor_input_names + tokenizer_input_names))
+
+
+__all__ = ["Lfm2VlProcessor"]


would be nice if you can add a few helpers here to make the model vLLM compatible ootb? We have a doc page on which helpers are needed here

given that our backbone is a hybrid model and we're not sure if this functionality is supported, could we postpone vLLM integration until next update?

ankke · 2025-08-20T00:36:54Z

@zucchini-nlp thank you for the review! sorry, the PR was still a draft and wasn't quite ready. I have addressed most of your comments. Some model and processor tests are failing, due to failing language backbone tests and some kwargs merging tests, that I'm not sure how to resolve

zucchini-nlp · 2025-08-20T09:27:02Z

Ah yeah, just wanted to do a preliminary review for general format. No worries, ping me when you need another review :)

The tests seem to be failing due to typing torch.Tensor | None, you can change it to Optional[torch.Tensor] instead.

github-actions · 2025-08-27T07:13:18Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto, lfm2_vl

ankke · 2025-08-27T08:09:54Z

@zucchini-nlp Hi, let me know if any more changes are required. I'd appreciate your help with resolving some of the failing CI.

check_code_quality and check_repository_consistency I tried running make fix-copies but the command failed
tests_torch: I am not exactly sure why that test fails. It seems to be using some other model's forward method
I am aware that some tests on the modeling file are also failing (I suspect due to some failing tests on the language backbone)
some "kwargs" tests are failing in Lfm2VlProcessorTest

ankke · 2025-09-19T10:48:18Z

merged in #40624

ankke and others added 2 commits August 18, 2025 22:18

Add LFM2-VL support

73c21ba

Merge branch 'main' into lfm2-vl

b55972c

zucchini-nlp reviewed Aug 19, 2025

View reviewed changes

add tests

64ead3b

ankke force-pushed the lfm2-vl branch from 9b125ed to 64ead3b Compare August 19, 2025 23:38

linting, formatting, misc review changes

b3027ec

ankke force-pushed the lfm2-vl branch from 70bae9c to b3027ec Compare August 20, 2025 00:21

Merge branch 'main' into lfm2-vl

1ed0ec1

ankke added 2 commits August 22, 2025 19:08

add siglip2 to auto config and instantiate it in lfm2-vl configuration

96af179

decouple image processor from processor

d30f78c

ankke and others added 4 commits August 27, 2025 07:16

remove torch import from configuration

4344d10

Merge branch 'main' into lfm2-vl

9123528

replace | with Optional

03d26e1

remove layer truncation from modeling file

80505f8

ankke marked this pull request as ready for review August 27, 2025 07:50

zucchini-nlp mentioned this pull request Sep 2, 2025

Add new model LFM2-VL #40624

Merged

ankke closed this Sep 19, 2025

		return list(dict.fromkeys(image_processor_input_names + tokenizer_input_names))


		__all__ = ["Lfm2VlProcessor"]

Add LFM2-VL support #40259

Add LFM2-VL support #40259

Uh oh!

Conversation

ankke commented Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zucchini-nlp Aug 19, 2025

Choose a reason for hiding this comment

Uh oh!

ankke Aug 22, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Aug 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

zucchini-nlp Aug 19, 2025

Choose a reason for hiding this comment

Uh oh!

ankke Aug 27, 2025

Choose a reason for hiding this comment

Uh oh!

ankke commented Aug 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zucchini-nlp commented Aug 20, 2025

Uh oh!

github-actions bot commented Aug 27, 2025

Uh oh!

ankke commented Aug 27, 2025

Uh oh!

ankke commented Sep 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ankke commented Aug 18, 2025 •

edited

Loading

ankke commented Aug 20, 2025 •

edited

Loading