feat: nltk_data taggers and tokenizers to docker image #5955

Closed
opened 2025-11-11 16:40:04 -06:00 by GiteaMirror · 0 comments
Owner

Originally created by @artokarj on GitHub (Aug 4, 2025).

Check Existing Issues

  • I have searched the existing issues and discussions.

Problem Description

When using Open Webui in air-gapped environment attaching ms office document (docx & pptx) into prompt does not work, because nltk_data can not be loaded from internet.

Desired Solution you'd like

nltk_data including taggers and tokenizers are pre-loaded into docker image.

Alternatives Considered

No response

Additional Context

No response

Originally created by @artokarj on GitHub (Aug 4, 2025). ### Check Existing Issues - [x] I have searched the existing issues and discussions. ### Problem Description When using Open Webui in air-gapped environment attaching ms office document (docx & pptx) into prompt does not work, because nltk_data can not be loaded from internet. ### Desired Solution you'd like nltk_data including taggers and tokenizers are pre-loaded into docker image. ### Alternatives Considered _No response_ ### Additional Context _No response_
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/open-webui#5955