RedHatAI
/

Llama-2-7b-chat-quantized.w4a16

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions

Llama-2-7b-chat-quantized.w4a16

3.9 GB

2 contributors

History: 12 commits

alexmarques's picture

Update README.md

49f4794 verified over 1 year ago

.gitattributes
1.52 kB

initial commit over 1 year ago
README.md
7.47 kB

Update README.md over 1 year ago
config.json
1.03 kB

Upload config.json with huggingface_hub over 1 year ago
model.safetensors
3.89 GB
xet

Upload model.safetensors with huggingface_hub over 1 year ago
quantize_config.json
269 Bytes

Upload quantize_config.json with huggingface_hub over 1 year ago
special_tokens_map.json
414 Bytes

Upload special_tokens_map.json with huggingface_hub over 1 year ago
tokenizer.json
1.84 MB

Upload tokenizer.json with huggingface_hub over 1 year ago
tokenizer.model
500 kB
xet

Upload tokenizer.model with huggingface_hub over 1 year ago
tokenizer_config.json
1.76 kB

Upload tokenizer_config.json with huggingface_hub over 1 year ago