Preserve integer/bool tensor dtype in to_device() (fixes #36) by lonexreb · Pull Request #80 · NVlabs/alpamayo

lonexreb · 2026-05-04T06:13:39Z

Summary

Fixes #36.

helper.to_device() currently casts every tensor it encounters to the requested dtype. Under mixed-precision inference (e.g. dtype=torch.bfloat16), this also rewrites input_ids and attention_mask from int64 to bfloat16, which then crashes the embedding lookup in the VLM.

This PR restricts the dtype cast to floating-point tensors. Integer and boolean tensors keep their original dtype and are still moved to device.

Behavior

Caller	`dtype` arg	Before	After
`test_inference.py:52` (`to_device(model_inputs, "cuda")`)	`None`	device-only move	unchanged
Mixed-precision call (`to_device(batch, "cuda", torch.bfloat16)`)	non-None	every tensor cast — breaks embedding lookups	floats cast to `bfloat16`; ints/bools preserved

The reporter (@aniekannn) suggested the same shape of fix in the issue.

Test plan

Existing device-only call path in test_inference.py is unaffected (dtype is None, so the original data.to(device=device) path is taken).
Manual smoke: pass a dict containing a float tensor and an int64 tensor through to_device(..., dtype=torch.bfloat16) and confirm the float is cast and the int stays int64.

When `dtype` is provided, `to_device` previously cast every tensor in the payload — including `input_ids` and `attention_mask` — to that dtype. With mixed-precision inference (e.g. `dtype=torch.bfloat16`) this turns token IDs into floats and breaks subsequent embedding lookups. Only apply `dtype` to floating-point tensors; integer and boolean tensors keep their original dtype and are still moved to `device`. Behavior is unchanged when `dtype` is `None` (the path used by `test_inference.py`). Fixes NVlabs#36 Signed-off-by: lonexreb <[email protected]>

Five scenarios covering the issue NVlabs#36 contract: - dtype=None preserves all original dtypes - dtype=<float> keeps integer tensors as-is and casts floats - bool tensors are preserved when a float dtype is requested - recursion descends into nested mappings and sequences - non-tensor leaves (str, int) pass through unchanged Signed-off-by: lonexreb <[email protected]>

lonexreb added 2 commits May 4, 2026 03:48

lonexreb force-pushed the fix/to-device-preserve-integer-tensors branch from 2726542 to ad3a6a3 Compare May 4, 2026 09:17

lonexreb mentioned this pull request May 4, 2026

Add regression tests for basic_collation_fn and get_assistant_mask #90

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preserve integer/bool tensor dtype in to_device() (fixes #36)#80

Preserve integer/bool tensor dtype in to_device() (fixes #36)#80
lonexreb wants to merge 2 commits intoNVlabs:mainfrom
lonexreb:fix/to-device-preserve-integer-tensors

lonexreb commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

lonexreb commented May 4, 2026

Summary

Behavior

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant