Kevin Xu
hellomonkey318
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
PORTool: Tool-Use LLM Training with Rewarded Tree
upvoted
a
paper
about 1 month ago
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise
Reasoning
Organizations
None yet