mirror of
https://github.com/open-webui/open-webui.git
synced 2026-03-22 14:13:08 -05:00
pdf text extraction #71
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @JohnZolton on GitHub (Dec 3, 2023).
I want to drag pdfs into the chat window and talk with the ai about them by loading the pdf text into the prompt. I'm trying to implement it myself but have issues with the pdf libraries and svelte/vite. PDFs would already be OCR'd so its just extracting text.
@DanMyers300 commented on GitHub (Dec 4, 2023):
I would like this but with the ability to integrate with LangChain. I'm attempting to work on it, if successful I'll make a PR.
@tjbck commented on GitHub (Dec 5, 2023):
Hi, Thanks for the feature request! I'm actively working on a RAG feature for the webui, so stay tuned! Let's continue our discussion here: #31
@bhaidar commented on GitHub (May 27, 2024):
Hi @tjbck
I've uploaded a PDF file (~2mb) and asked the LLAMA3 what is this document about and received "I am not sure ..". Is there a special model to use with documents? Thanks
@dmvieira commented on GitHub (Sep 12, 2024):
Its not working here for me too @bhaidar . I'm trying to summarize a large PDF and it's not working because it's using RAG for PDF text recovery. Why not an option to use all pdf when generate response?
@thiswillbeyourgithub commented on GitHub (Sep 12, 2024):
@bhaidar and @dmvieira this might be related to my ongoing PR #5378