1 67 8

Yumin Kim

YuminKim

Yu-billie

AI & ML interests

NLG, Data Augmentation with LLMs

Recent Activity

upvoted a paper 12 days ago

LLM in the Loop: Creating the PARADEHATE Dataset for Hate Speech Detoxification

authored a paper 7 months ago

Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question Answering

upvoted a paper 7 months ago

Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models

View all activity

Organizations

upvoted a paper 12 days ago

LLM in the Loop: Creating the PARADEHATE Dataset for Hate Speech Detoxification

Paper • 2506.01484 • Published Jun 2 • 6

authored a paper 7 months ago

Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question Answering

Paper • 2505.15805 • Published May 21 • 3

upvoted 2 papers 7 months ago

Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models

Paper • 2505.02847 • Published May 1 • 28

Llama-3.1-FoundationAI-SecurityLLM-Base-8B Technical Report

Paper • 2504.21039 • Published Apr 28 • 15

upvoted 2 papers 8 months ago

Agentic Knowledgeable Self-awareness

Paper • 2504.03553 • Published Apr 4 • 27

Open Deep Search: Democratizing Search with Open-source Reasoning Agents

Paper • 2503.20201 • Published Mar 26 • 48

liked a dataset 9 months ago

pszemraj/qmsum-cleaned

Viewer • Updated Feb 18, 2024 • 3.62k • 687 • 13

upvoted a paper 9 months ago

Survey on Evaluation of LLM-based Agents

Paper • 2503.16416 • Published Mar 20 • 95

upvoted 2 articles 9 months ago

Article

AI Policy @🤗: Response to the White House AI Action Plan RFI

Mar 19

•

Article

AI Agents Are Here. What Now?

Jan 13

•

liked a dataset 9 months ago

edinburghcstr/ami

Viewer • Updated Jan 16, 2023 • 110k • 4.86k • 70

upvoted 2 papers 9 months ago

SafeArena: Evaluating the Safety of Autonomous Web Agents

Paper • 2503.04957 • Published Mar 6 • 21

Exploiting Instruction-Following Retrievers for Malicious Information Retrieval

Paper • 2503.08644 • Published Mar 11 • 16

liked 2 datasets 9 months ago

Salesforce/wikitext

Viewer • Updated Jan 4, 2024 • 3.71M • 980k • 532

TianshengHuang/DirectRefusal

Viewer • Updated Feb 25 • 1k • 97 • 1

upvoted a paper 9 months ago

The Hidden Risks of Large Reasoning Models: A Safety Assessment of R1

Paper • 2502.12659 • Published Feb 18 • 7

upvoted 2 papers 10 months ago

Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety Mechanisms Tend to Be Anchored in The Template Region

Paper • 2502.13946 • Published Feb 19 • 10

SAFE-SQL: Self-Augmented In-Context Learning with Fine-grained Example Selection for Text-to-SQL

Paper • 2502.11438 • Published Feb 17 • 8

upvoted a collection 10 months ago

Deepseek Papers

Collection

Deepseek papers collection • 26 items • Updated 1 day ago • 287

upvoted a paper 10 months ago

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published Jan 30 • 88