exdysa commited on
Commit
67956a5
·
verified ·
1 Parent(s): 31aeb05

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -31
README.md CHANGED
@@ -1,39 +1,25 @@
1
  ---
 
 
2
  license: mit
3
- ---
4
-
5
- # Introduction to TraDo
6
-
7
- [Paper](https://arxiv.org/abs/2509.06949) | [Code](https://github.com/Gen-Verse/dLLM-RL)
8
-
9
- We introduce **TraDo**, SOTA diffusion language model, trained with **TraceRL**.
10
-
11
- * **TraDo-4B-Instruct** and **TraDo-8B-Instruct** outperform similarly sized strong AR models across math reasoning tasks.
12
- * **TraDo-8B-Thinking** is the first Long-CoT diffusion language model.
13
-
14
-
15
-
16
- <p align="center">
17
- <img src="https://github.com/yinjjiew/Data/raw/main/dllm-rl/figure1.png" width="100%"/>
18
- </p>
19
 
20
-
21
- <p align="center">
22
- <img src="https://github.com/yinjjiew/Data/raw/main/dllm-rl/maintable.png" width="100%"/>
23
- </p>
24
-
25
-
26
-
27
-
28
- # Citation
29
 
30
  ```
31
- @article{wang2025trado,
32
- title={Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models},
33
- author={Wang, Yinjie and Yang, Ling and Li, Bowen and Tian, Ye and Shen, Ke and Wang, Mengdi},
34
- journal={arXiv preprint arXiv:2509.06949},
35
- year={2025}
36
- }
37
  ```
38
 
39
 
 
 
 
1
  ---
2
+ name: TraDo-4B-Instruct
3
+ base_model: Gen-Verse/TraDo-4B-Instruct
4
  license: mit
5
+ pipeline_tag: text-generation
6
+ tasks: text-generation
7
+ language: en
 
 
 
 
 
 
 
 
 
 
 
 
 
8
 
9
+ ---
10
+ > [!IMPORTANT]
11
+ > Original Model Link : [https://huggingface.co/Gen-Verse/TraDo-4B-Instruct](https://huggingface.co/Gen-Verse/TraDo-4B-Instruct)
12
+ >
 
 
 
 
 
13
 
14
  ```
15
+ name: TraDo-4B-Instruct
16
+ base_model: Gen-Verse/TraDo-4B-Instruct
17
+ license: mit
18
+ pipeline_tag: text-generation
19
+ tasks: text-generation
20
+ language: en
21
  ```
22
 
23
 
24
+ # TraDo-4B-Instruct
25
+ TraDo-4B-Instruct is a diffusion language model. This repo makes the `flash_attn` and CUDA libraries optional so that all frameworks can be supported.