Any plans for gpt oss 20b?
#1
by
andhakanoon
- opened
It would be really great in local setup to use gpt oss 20b with faster through put/
I personally believe that It does not make sense to use speculative models to accelerate decoding process of a 20b model.