Huiqiang Jiang
iofu728
AI & ML interests
None yet
Recent Activity
authored
a paper
7 days ago
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
upvoted
a
paper
8 days ago
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
authored
a paper
7 months ago
Chain-of-Model Learning for Language Model