Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Yhyu13
/
phi-2-sft-dpo-gpt4_en-ep1-lora
like
1
PEFT
TensorBoard
Safetensors
llama-factory
lora
Generated from Trainer
License:
microsoft-research-license
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Use this model
main
phi-2-sft-dpo-gpt4_en-ep1-lora
145 MB
2 contributors
History:
4 commits
yhyu13
Fix base mode
fdbe358
almost 2 years ago
Predict_20
Upload
almost 2 years ago
checkpoint-1000
Fix base mode
almost 2 years ago
checkpoint-2000
Fix base mode
almost 2 years ago
checkpoint-3000
Fix base mode
almost 2 years ago
checkpoint-4000
Fix base mode
almost 2 years ago
runs
Upload
almost 2 years ago
.gitattributes
1.52 kB
initial commit
almost 2 years ago
README.md
2.95 kB
Fix base mode
almost 2 years ago
adapter_config.json
570 Bytes
Fix base mode
almost 2 years ago
adapter_model.safetensors
10.5 MB
xet
Upload
almost 2 years ago
added_tokens.json
1.08 kB
Upload
almost 2 years ago
all_results.json
690 Bytes
Upload
almost 2 years ago
eval_results.json
543 Bytes
Upload
almost 2 years ago
merges.txt
456 kB
Upload
almost 2 years ago
special_tokens_map.json
473 Bytes
Upload
almost 2 years ago
tokenizer_config.json
7.48 kB
Upload
almost 2 years ago
train_eval_log.txt
886 kB
Upload
almost 2 years ago
train_results.json
167 Bytes
Upload
almost 2 years ago
trainer_log.jsonl
2.46 kB
Upload
almost 2 years ago
trainer_state.json
5.06 kB
Upload
almost 2 years ago
training_args.bin
4.54 kB
xet
Upload
almost 2 years ago
training_eval_loss.png
36.5 kB
Upload
almost 2 years ago
training_loss.png
38.4 kB
Upload
almost 2 years ago
vocab.json
999 kB
Upload
almost 2 years ago