Handle PyTorch to Flax conversion of 1D convolutions #15519

sanchit-gandhi · 2022-02-04T15:17:22Z

What does this PR do?

Currently, only 2-dimensional convolutional layers are renamed and reshaped in the PyTorch to Flax conversion script. This PR handles the case of 1-dimensional convolutions layers, in an entirely equivalent way to their 2-dimensional counterparts.

HuggingFaceDocBuilder · 2022-02-04T15:17:44Z

The documentation is not available anymore as the PR was closed or merged.

patil-suraj

LGTM! Thanks for handling this!

Note for the future: We should try to find a more robust way of detecting Conv layers.

Also cc @patrickvonplaten

patrickvonplaten · 2022-02-07T10:06:36Z

I'm a bit surprised that we needed that. We already had 1D Conv layers in Flax in Wav2Vec2 and the conversion worked

patrickvonplaten · 2022-02-07T10:29:24Z

src/transformers/modeling_flax_pytorch_utils.py


+    # conv1d layer
+    renamed_pt_tuple_key = pt_tuple_key[:-1] + ("kernel",)
+    if pt_tuple_key[-1] == "weight" and pt_tensor.ndim == 3 and not is_key_or_prefix_key_in_dict(pt_tuple_key):


What we do here is equivalent to what is done below for # linear layer - I don't understand why we've added this. Was some code failing before?

pt_tensor.transpose(2, 1, 0) is the same as pt_tensor.T and if pt_tuple_key[-1] == "weight" and pt_tensor.ndim == 3 and not is_key_or_prefix_key_in_dict(pt_tuple_key) is true then pt_tuple_key[-1] == "weight" and not is_key_or_prefix_key_in_dict(pt_tuple_key) is also True.

We should avoid at all cost to add more complexity to those weight conversion statements and keep things as simple as possible. Remember that all PT<->Flax conversions depend on this code and we should not clutter be extra careful here. If we add a new statement like this, there has to be at least a test that ensures that this change is needed. At the moment I cannot think of a single use case where this is the case -> we already had 1D-conv layer conversions working for Flax Wav2Vec2 <-> PT Wav2Vec2

=> IMO we should revert this PR. The comments could indeed be improved however, so I'm happy to change the comment # conv layer to # 2D - conv layer and # linear layer to # linear and 1D - conv layer

Completely agree, we actually don't need this. My bad, not my best review. Thanks a lot!

This reverts commit 854a0d5.

#15540) This reverts commit 854a0d5.

…face#15519)" (huggingface#15540) This reverts commit 854a0d5.

Handle PyTorch to Flax conversion of 1D convolutions

48d8aef

LysandreJik requested a review from patil-suraj February 4, 2022 15:31

patil-suraj approved these changes Feb 4, 2022

View reviewed changes

patil-suraj merged commit 854a0d5 into huggingface:master Feb 4, 2022

patrickvonplaten reviewed Feb 7, 2022

View reviewed changes

patrickvonplaten added a commit that referenced this pull request Feb 7, 2022

Revert "Handle PyTorch to Flax conversion of 1D convolutions (#15519)"

4789179

This reverts commit 854a0d5.

This was referenced Feb 7, 2022

Revert "Handle PyTorch to Flax conversion of 1D convolutions" #15540

Merged

Add Wav2Vec2 Adapter Weights to Flax #15521

Closed

patrickvonplaten added a commit that referenced this pull request Feb 7, 2022

Revert "Handle PyTorch to Flax conversion of 1D convolutions (#15519)" (

e02bdce

#15540) This reverts commit 854a0d5.

stevhliu pushed a commit to stevhliu/transformers that referenced this pull request Feb 18, 2022

Handle PyTorch to Flax conversion of 1D convolutions (huggingface#15519)

857406b

stevhliu pushed a commit to stevhliu/transformers that referenced this pull request Feb 18, 2022

Revert "Handle PyTorch to Flax conversion of 1D convolutions (hugging…

8fdd44a

…face#15519)" (huggingface#15540) This reverts commit 854a0d5.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Handle PyTorch to Flax conversion of 1D convolutions #15519

Handle PyTorch to Flax conversion of 1D convolutions #15519

Uh oh!

sanchit-gandhi commented Feb 4, 2022

Uh oh!

HuggingFaceDocBuilder commented Feb 4, 2022 •

edited

Loading

Uh oh!

patil-suraj left a comment

Uh oh!

patrickvonplaten commented Feb 7, 2022

Uh oh!

patrickvonplaten Feb 7, 2022

Uh oh!

patil-suraj Feb 7, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Handle PyTorch to Flax conversion of 1D convolutions #15519

Handle PyTorch to Flax conversion of 1D convolutions #15519

Uh oh!

Conversation

sanchit-gandhi commented Feb 4, 2022

What does this PR do?

Uh oh!

HuggingFaceDocBuilder commented Feb 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

patil-suraj left a comment

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten commented Feb 7, 2022

Uh oh!

patrickvonplaten Feb 7, 2022

Choose a reason for hiding this comment

Uh oh!

patil-suraj Feb 7, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

HuggingFaceDocBuilder commented Feb 4, 2022 •

edited

Loading