D. Zhang

How Far Are Video Models from True Multimodal Reasoning? featured image

How Far Are Video Models from True Multimodal Reasoning?

Evaluating video models for true multimodal reasoning.

x.-zhang
ROSE: A Reward-Oriented Data Selection Framework for LLM Task-Specific Instruction Tuning featured image

ROSE: A Reward-Oriented Data Selection Framework for LLM Task-Specific Instruction Tuning

Reward-oriented data selection for task-specific LLM instruction tuning.

y.-wu