Zhongyi Han

zhyhan

https://zhyhan.github.io/

AI & ML interests

OOD Generalization & Detection，AI for Science

Recent Activity

upvoted a paper about 2 months ago

Self-Improving LLM Agents at Test-Time

upvoted a paper 3 months ago

MachineLearningLM: Continued Pretraining Language Models on Millions of Synthetic Tabular Prediction Tasks Scales In-Context ML

upvoted a paper 3 months ago

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

Self-Improving LLM Agents at Test-Time

Paper • 2510.07841 • Published Oct 9 • 9

upvoted 2 papers 3 months ago

MachineLearningLM: Continued Pretraining Language Models on Millions of Synthetic Tabular Prediction Tasks Scales In-Context ML

Paper • 2509.06806 • Published Sep 8 • 63

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

Paper • 2509.04292 • Published Sep 4 • 57

upvoted a paper 4 months ago

Can LLM-Generated Textual Explanations Enhance Model Classification Performance? An Empirical Study

Paper • 2508.09776 • Published Aug 13 • 3

upvoted a paper 5 months ago

What Has a Foundation Model Found? Using Inductive Bias to Probe for World Models

Paper • 2507.06952 • Published Jul 9 • 7

upvoted a paper 7 months ago

NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification

Paper • 2505.16938 • Published May 22 • 120

upvoted 2 papers 8 months ago

Exploring Expert Failures Improves LLM Agent Tuning

Paper • 2504.13145 • Published Apr 17 • 12

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 303

upvoted a paper 10 months ago

Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation

Paper • 2501.17749 • Published Jan 29 • 14

upvoted a paper about 1 year ago

MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11, 2024 • 56

upvoted 9 papers over 1 year ago

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Paper • 2403.05530 • Published Mar 8, 2024 • 66

upvoted a paper almost 2 years ago

Wukong: Towards a Scaling Law for Large-Scale Recommendation

Paper • 2403.02545 • Published Mar 4, 2024 • 17

Zhongyi Han

AI & ML interests

Recent Activity

Organizations

zhyhan's activity