https://arxiv.org/pdf/2601.18081
韩沛煊
HakHan
·
AI & ML interests
None yet
Recent Activity
upvoted a paper about 7 hours ago
PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems upvoted a paper 20 days ago
Ψ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues