山田太郎

user-2009

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

AnyGroundBench: A Specialized-Domain Benchmark for Video Grounding in Vision-Language Models

upvoted a paper 15 days ago

Looped World Models

upvoted a paper 15 days ago

Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs

View all activity

Organizations

None yet

upvoted a paper 2 days ago

AnyGroundBench: A Specialized-Domain Benchmark for Video Grounding in Vision-Language Models

Paper • 2607.02269 • Published 4 days ago • 7

upvoted 2 papers 15 days ago

Looped World Models

Paper • 2606.18208 • Published 20 days ago • 476

Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs

Paper • 2605.30611 • Published May 28 • 250

upvoted 4 papers about 1 month ago

Reflective Prompt Tuning through Language Model Function-Calling

Paper • 2605.21781 • Published May 20 • 9

How can embedding models bind concepts?

Paper • 2605.31503 • Published May 29 • 8

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

Paper • 2605.06169 • Published May 7 • 238

Ethical Hyper-Velocity (EHV): A Provably Deterministic Governance-Aware JIT Compiler Architecture for Agentic Systems

Paper • 2605.17909 • Published May 18 • 4

upvoted 2 papers about 2 months ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published May 12 • 196

Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation

Paper • 2605.11739 • Published May 13 • 60

upvoted 2 papers 2 months ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published May 3 • 171

Universal statistical signatures of evolution in artificial intelligence architectures

Paper • 2604.10571 • Published Apr 12 • 4

upvoted 6 papers 3 months ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 509

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 638

When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models

Paper • 2604.08546 • Published Apr 9 • 116

ClawArena: Benchmarking AI Agents in Evolving Information Environments

Paper • 2604.04202 • Published Apr 5 • 37

TokenDial: Continuous Attribute Control in Text-to-Video via Spatiotemporal Token Offsets

Paper • 2603.27520 • Published Mar 29 • 4

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 353

upvoted 3 papers 4 months ago

From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models

Paper • 2602.22859 • Published Feb 26 • 150

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 526

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 221