Beam Search: Why do some beams begin with the same token?

ComfortEagle · March 31, 2021, 12:02pm

I am confused about two observations regarding beam search:

From reading @patrickvonplaten How to generate text: using different decoding methods for language generation with Transformers](https://How to generate text: using different decoding methods for language generation with Transformers), it is my understanding that each beam in BeamSearchEncoderDecoderOutput should begin with a different token, or am I wrong in that assertion?
The documentation for BeamSearchEncoderDecoderOutput for the sequences parameter states that “The second dimension (sequence_length) is either equal to max_length or shorter if all batches finished early due to the eos_token_id .”. In my observations it’s always been longer than max_length. How come?

Thanks

Topic		Replies	Views
BART_LM: Odd Beam Search Output Intermediate	18	1871	August 17, 2020
Beam_search and generate are not consistent 🤗Transformers	0	513	May 10, 2022
Implement details about Beam Search 🤗Transformers	0	604	March 25, 2023
How to manually set k generated words in beam search's 1st step 🤗Transformers	0	216	May 10, 2022
Beam search (FlaxT5) generates PAD tokens mid generation 🤗Transformers	1	502	November 25, 2021