Wav2vec2 xlsr nan train loss

tadf · June 11, 2021, 5:43am

Hi,
I’m running into nan training_loss when training wav2vec2 xlsr with my custom dataset.
Weird thing is that even though training_loss goes to nan, eval_loss still goes down, and error_rate (cer and wer) also goes down.
I’ve experimented with lower learning_rate, but still getting similar behavior. I’m logging with wandb.

My graphs look like the following:

There’s no value for train/loss after ~60 steps since it is nan, but eval/loss is still decreasing.

Has anyone experienced similar behavior?

tadf · June 14, 2021, 4:22am

I’ve let it train over the weekend, still NAN train loss, but eval loss and both WER and CER continue to decrease

Topic		Replies	Views
`nan` training loss but eval loss does improve over time Research	5	4116	October 10, 2022
Wav2Vec2: How to correct for nan in training and validation loss Models	13	9996	October 22, 2023
Training and evaluation loss goes down however, WER score stays the same 🤗Transformers	0	383	May 23, 2022
Wav2Vec2: loss growing in training and validation after few epochs Models	6	2114	September 25, 2024
Fine-tuning wav2vec2 loss explodes and then goes to zero after certain time-steps 🤗Transformers	0	461	November 17, 2021

Wav2vec2 xlsr nan train loss

Related topics