Shuojia Fu

Direct Preference Optimization for Chatbot Fine-Tuning: An Empirical Study featured image

Direct Preference Optimization for Chatbot Fine-Tuning: An Empirical Study

Empirical study of Direct Preference Optimization for chatbot fine-tuning.

avatar
Dezhi Yu