Evaluating video models for true multimodal reasoning.
Reward-oriented data selection for task-specific LLM instruction tuning.