Feature Request: Full Support for Direct Preference Optimization (DPO)

Hello, I'm interested in knowing if there are any plans to implement Full support for Direct Preference Optimization (DPO) in the upcoming releases.

Are there any current efforts or roadmap items related to this, or is it something that might be considered in future updates?

Thank you for your time and consideration.