Skip to content

[feature] add a frontend module in wespeaker and support wavlm#344

Merged
czy97 merged 13 commits intomasterfrom
hongji-frontend-wavlm
Aug 19, 2024
Merged

[feature] add a frontend module in wespeaker and support wavlm#344
czy97 merged 13 commits intomasterfrom
hongji-frontend-wavlm

Conversation

@JiJiJiang
Copy link
Copy Markdown
Collaborator

@JiJiJiang JiJiJiang commented Jul 29, 2024

All pre-trained models and configs in the pretrained page can be loaded and used normally after this update!

image

@JiJiJiang JiJiJiang requested review from czy97 and wsstriving July 29, 2024 13:10
@JiJiJiang JiJiJiang marked this pull request as draft July 29, 2024 13:25
@JiJiJiang JiJiJiang force-pushed the hongji-frontend-wavlm branch from b9b8fb2 to a85085f Compare July 29, 2024 13:47
with torch.cuda.amp.autocast(enabled=configs['enable_amp']):
features, _ = model.module.frontend(wavs, wavs_len)

with torch.cuda.amp.autocast(enabled=configs['enable_amp']):
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think it is necessary to add amp context here. There is no pytorch model involved.

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

solved

def spec_aug(feats, num_t_mask=1, num_f_mask=1, max_t=10, max_f=8, prob=0.6):
# feats batch: (B,T,F)
# do spec_aug on all batch samples using a same group of params randomly
# TODO (hongji): do spec_aug on each sample separately
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good idea. I will try it later.

@czy97
Copy link
Copy Markdown
Collaborator

czy97 commented Jul 31, 2024

Hello @JiJiJiang , I have listed some comments. Besides, there seems no independent recipe with run.sh.

@JiJiJiang JiJiJiang requested a review from czy97 August 18, 2024 15:52
@JiJiJiang JiJiJiang marked this pull request as ready for review August 18, 2024 15:52
Copy link
Copy Markdown
Collaborator

@czy97 czy97 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Well Done !!!!

@czy97 czy97 merged commit 655039e into master Aug 19, 2024
@czy97 czy97 deleted the hongji-frontend-wavlm branch August 20, 2024 04:52
@312513065
Copy link
Copy Markdown

do you have any checkpoint on wavlm+ecapa-tdnn ?

@JiJiJiang
Copy link
Copy Markdown
Collaborator Author

do you have any checkpoint on wavlm+ecapa-tdnn ?

Sorry, I have lost the access to my exp dir, as well as the checkpoint.

JunyiPeng00 pushed a commit to JunyiPeng00/wespeaker_hubert that referenced this pull request Jul 31, 2025
…-e2e#344)

* [feature] add a frontend module in wespeaker and support wavlm

* update .gitignore

* update wavlm configs

* update wespeaker/frontend/__init__.py

* [fix] remove trailing whitespaces

* [fix] fix lint errors

* [fix] fix lint errors

* [fix] fix lint errors

* [fix] fix spelling mistakes

* update run.sh

* update wavlm configs and add run_wavlm.sh

* update README.md
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants