AI & ML interests
None yet
Organizations
None yet
zeliang0426/Distill_Llama_Darpo-cache-adapter-3k
Text Generation
•
8B
•
Updated
•
2
zeliang0426/Distill_Llama_Darpo-cache-lora-3k
Updated
zeliang0426/8k_Distill_Llama_Darpo-cache-adapter-3k
Text Generation
•
8B
•
Updated
•
2
zeliang0426/SuperLong_Distill_Llama_Darpo-cache-lora-3k
Updated
zeliang0426/SuperLong_Distill_Llama_Darpo-cache-adapter-3k
Updated
zeliang0426/QKV_Qwen25-7-full-lora-3k
Updated
zeliang0426/QKV_Qwen25-7-cache-lora-3k
Updated
zeliang0426/qwen25_code_r1_grpo_cache
Updated
zeliang0426/qwen25_code_r1_grpo_think
Text Generation
•
3B
•
Updated
•
1
zeliang0426/qwen25_code_r1_grpo_full
Updated
zeliang0426/Gemma3-Darpo-full-lora-3k
Updated
zeliang0426/Limited_Base-Qwen25-7-Think-adapter-3k
Text Generation
•
8B
•
Updated
•
1
zeliang0426/Limted_Base-Qwen25-7-cache-lora-3k
Updated
zeliang0426/Gemma3-Darpo-cache-adapter-3k
Text Generation
•
4B
•
Updated
•
1
zeliang0426/Base-Qwen25-7-full-lora-3k
Updated
zeliang0426/Base-Qwen25-7-Think-adapter-3k
Text Generation
•
8B
•
Updated
•
1
zeliang0426/Llama_Darpo-full-lora-3k
Updated
zeliang0426/Base-Qwen25-7-cache-lora-3k
Updated
zeliang0426/Llama_Darpo-cache-adapter-3k
Text Generation
•
3B
•
Updated
•
1
zeliang0426/Gemma3-Darpo-cache-lora-3k
Updated
zeliang0426/Llama_Darpo-cache-lora-3k
Updated
zeliang0426/1e-6-Qwen25-7-Think-adapter-3k
Text Generation
•
8B
•
Updated
zeliang0426/Qwen25-7-Think-adapter-3k
Text Generation
•
8B
•
Updated
•
2
zeliang0426/1e-6-Qwen25-7-full-lora-3k
Updated
zeliang0426/Qwen25-7-full-lora-3k
Updated
zeliang0426/1e-6-Qwen25-7-cache-lora-3k
Updated
zeliang0426/Qwen25-7-cache-lora-3k
Updated
zeliang0426/ddp-GSM8K-CacheTraining-LORA
Updated
zeliang0426/tofu_qwen-2.5-3b
Updated
zeliang0426/Fix-Strict_Darpo-cache-adapter-3k
Text Generation
•
3B
•
Updated
•
1