llmat
/

Mistral-Small-24B-Instruct-2501-NVFP4

Text Generation

8-bit precision

compressed-tensors

Model card Files Files and versions

Mistral-Small-24B-Instruct-2501-NVFP4 / generation_config.json

llmat's picture

Add NVFP4 quantized model (llmcompressor oneshot).

a77c13c verified 5 months ago

history blame contribute delete

155 Bytes

	{
	"_from_model_config": true,
	"bos_token_id": 1,
	"do_sample": true,
	"eos_token_id": 2,
	"temperature": 0.15,
	"transformers_version": "4.55.4"
	}