H. Zhang

ROSE: A Reward-Oriented Data Selection Framework for LLM Task-Specific Instruction Tuning featured image

ROSE: A Reward-Oriented Data Selection Framework for LLM Task-Specific Instruction Tuning

Reward-oriented data selection for task-specific LLM instruction tuning.

y.-wu