ROSE: A Reward-Oriented Data Selection Framework for LLM Task-Specific Instruction Tuning
Reward-oriented data selection for task-specific LLM instruction tuning.
y.-wu
Reward-oriented data selection for task-specific LLM instruction tuning.
LSTM modeling for 30-day hospital readmission prediction.