Skip to content

[inductor][cpu]inductor_max_autotune float32 models accuracy crash in 2024-08-26 nightly releaseΒ #134998

@zxd1997066

Description

@zxd1997066

πŸ› Describe the bug

fp32 static shape default wrapper

suite name thread accuracy perf reason(reference only)
huggingface AlbertForMaskedLM multiple X √ AlbertForMaskedLM, KeyError: '_frozen_param0'
huggingface ElectraForCausalLM multiple X √ ElectraForCausalLM, KeyError: '_frozen_param0'
huggingface T5ForConditionalGeneration multiple X √ T5ForConditionalGeneration, KeyError: '_frozen_param0'
huggingface T5Small multiple X √ T5Small, KeyError: '_frozen_param0'
huggingface XGLMForCausalLM multiple X √ XGLMForCausalLM, KeyError: '_frozen_param0'
torchbench hf_Albert multiple X √ hf_Albert, KeyError: '_frozen_param0'
torchbench hf_Bert multiple X √ hf_Bert, KeyError: '_frozen_param0'
torchbench hf_DistilBert multiple X √ hf_DistilBert, KeyError: '_frozen_param0'
torchbench hf_T5 multiple X √ hf_T5, KeyError: '_frozen_param0'
torchbench speech_transformer multiple X X speech_transformer, KeyError: '_frozen_param0'
huggingface AlbertForMaskedLM single X √ AlbertForMaskedLM, KeyError: '_frozen_param0'
torchbench hf_Albert single X √ hf_Albert, KeyError: '_frozen_param0'
torchbench speech_transformer single X √ speech_transformer, KeyError: '_frozen_param0'

fp32 dynamic shape default wrapper

suite name thread accuracy perf reason(reference only)
huggingface AlbertForMaskedLM multiple X √ AlbertForMaskedLM, KeyError: '_frozen_param0'
huggingface ElectraForCausalLM multiple X √ ElectraForCausalLM, KeyError: '_frozen_param0'
torchbench speech_transformer multiple X X speech_transformer, KeyError: '_frozen_param0'
huggingface AlbertForMaskedLM single X √ AlbertForMaskedLM, KeyError: '_frozen_param0'
torchbench hf_Albert single X √ hf_Albert, KeyError: '_frozen_param0'

Versions

SW info

name target_branch target_commit refer_branch refer_commit
torchbench main 23512dbe main 23512dbe
torch main 2553278 main b4a1673
torchvision main 0.19.0a0+d23a6e1 main 0.19.0a0+d23a6e1
torchtext main 0.16.0a0+b0ebddc main 0.16.0a0+b0ebddc
torchaudio main 2.4.0a0+b3f6f51 main 2.4.0a0+b3f6f51
torchdata main 0.7.0a0+11bb5b8 main 0.7.0a0+11bb5b8
dynamo_benchmarks main nightly main nightly

Repro:
inductor_single_run.sh
bash inductor_single_run.sh thread inference accuracy suite name float32 first static/dynamic default 0 inductor_max_autotune
Suspected guilty commit: c42ac54
torchbench-hf_T5-inference-float32-static-default-multiple-accuracy-crash_guilty_commit.log
cc @ezyang @chauhang @penguinwu @WeizhuoZhang-intel @chuanqi129 @chunyuan-w

### Tasks

Metadata

Metadata

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions