feat: multiple draft generations and rlhf data collection #96

New Issue

GiteaMirror · 2025-11-11T14:05:20-06:00

GiteaMirror commented

2025-11-11 14:05:20 -06:00

Originally created by @nivibilla on GitHub (Dec 15, 2023).

Is your feature request related to a problem? Please describe.
Recent DPO models being so good shows the impact of having good rlhf datasets.

Describe the solution you'd like

Similar to lymsys's arena battle. The ability to make multiple draft generations per prompt generations (from the same model, or different models/endpoints) and then the user would be able to click which one is better(possibly even edit the answer and fix). This would provide very rich and useful conversation trees along with rlhf data.

Describe alternatives you've considered
Lymsys ui kind of does this. With the battle mode but limited to one prompt, no follow up. I was looking for the ability to make multiple drafts every turn in the conversation.

Originally created by @nivibilla on GitHub (Dec 15, 2023). **Is your feature request related to a problem? Please describe.** Recent DPO models being so good shows the impact of having good rlhf datasets. **Describe the solution you'd like** Similar to lymsys's arena battle. The ability to make multiple draft generations per prompt generations (from the same model, or different models/endpoints) and then the user would be able to click which one is better(possibly even edit the answer and fix). This would provide very rich and useful conversation trees along with rlhf data. **Describe alternatives you've considered** Lymsys ui kind of does this. With the battle mode but limited to one prompt, no follow up. I was looking for the ability to make multiple drafts every turn in the conversation.

GiteaMirror closed this issue

2025-11-11 14:05:20 -06:00

GiteaMirror commented

2025-11-11 14:05:21 -06:00

@tjbck commented on GitHub (Dec 15, 2023):

Hi, I'm not sure if I understand what your feature request, could you elaborate a bit more? You can regenerate the responses and pick the one you like the most to continue the conversation if that's the feature you're requesting. Thanks.

@tjbck commented on GitHub (Dec 15, 2023): Hi, I'm not sure if I understand what your feature request, could you elaborate a bit more? You can regenerate the responses and pick the one you like the most to continue the conversation if that's the feature you're requesting. Thanks.

GiteaMirror commented

2025-11-11 14:05:22 -06:00

@nivibilla commented on GitHub (Dec 15, 2023):

Hi, yeah sorry im not explaining properly.

In addition to be able to generate multiple drafts like you have already. I would like a way to keep record of that data. As in be able to export human preference data for prompt response based on which one is better. (I see you already have a thumbs up button but not sure where this data is recorded)

@nivibilla commented on GitHub (Dec 15, 2023): Hi, yeah sorry im not explaining properly. In addition to be able to generate multiple drafts like you have already. I would like a way to keep record of that data. As in be able to export human preference data for prompt response based on which one is better. (I see you already have a thumbs up button but not sure where this data is recorded)

GiteaMirror commented

2025-11-11 14:05:23 -06:00

@tjbck commented on GitHub (Dec 15, 2023):

You can export all the chat logs if thats the feature you're looking for!

@tjbck commented on GitHub (Dec 15, 2023): You can export all the chat logs if thats the feature you're looking for!

GiteaMirror commented

2025-11-11 14:05:27 -06:00

@nivibilla commented on GitHub (Dec 16, 2023):

Does that include if multiple generations were made for the same prompt and the thumbs up flag?

@nivibilla commented on GitHub (Dec 16, 2023): Does that include if multiple generations were made for the same prompt and the thumbs up flag?

GiteaMirror commented

2025-11-11 14:05:28 -06:00

@tjbck commented on GitHub (Dec 23, 2023):

Yep! It should have everything, let me know if it doesn't, I'll reopen this issue. Thanks!

@tjbck commented on GitHub (Dec 23, 2023): Yep! It should have everything, let me know if it doesn't, I'll reopen this issue. Thanks!

GiteaMirror commented

2025-11-11 14:05:28 -06:00

@briancleland commented on GitHub (Dec 27, 2023):

Hi @tjbck , can you point me to where the thumbs-up/thumbs-down is recorded in the exported json please?

@briancleland commented on GitHub (Dec 27, 2023): Hi @tjbck , can you point me to where the thumbs-up/thumbs-down is recorded in the exported json please?

GiteaMirror commented

2025-11-11 14:05:29 -06:00

@tjbck commented on GitHub (Dec 27, 2023):

@briancleland, I'd suggest you wait for #216 to get merged to main. I think there might've been a bug with the message rating annotation feature. Stay tuned!

@tjbck commented on GitHub (Dec 27, 2023): @briancleland, I'd suggest you wait for #216 to get merged to main. I think there might've been a bug with the message rating annotation feature. Stay tuned! ![image](https://github.com/ollama-webui/ollama-webui/assets/25473318/bdece95a-bf0e-4b0d-9cc4-d37dfc5d1dc4)

GiteaMirror commented

2025-11-11 14:05:29 -06:00

@briancleland commented on GitHub (Dec 28, 2023):

Perfect, thanks!

@briancleland commented on GitHub (Dec 28, 2023): Perfect, thanks!

GiteaMirror referenced this issue

2025-11-11 17:12:37 -06:00

[PR #96] [MERGED] feat: improved chat history support (response regeneration history) #6950

GiteaMirror referenced this issue

2026-04-20 02:46:46 -05:00

[PR #96] [MERGED] feat: improved chat history support (response regeneration history) #20154

GiteaMirror referenced this issue