[PR #4786] [MERGED] chore(deps): bump unstructured from 0.15.5 to 0.15.7 in /backend #8358

Closed
opened 2025-11-11 17:51:50 -06:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/open-webui/open-webui/pull/4786
Author: @dependabot[bot]
Created: 8/21/2024
Status: Merged
Merged: 8/22/2024
Merged by: @tjbck

Base: devHead: dependabot/pip/backend/dev/unstructured-0.15.7


📝 Commits (1)

  • ccf5bd1 chore(deps): bump unstructured from 0.15.5 to 0.15.7 in /backend

📊 Changes

1 file changed (+1 additions, -1 deletions)

View changed files

📝 backend/requirements.txt (+1 -1)

📄 Description

Bumps unstructured from 0.15.5 to 0.15.7.

Release notes

Sourced from unstructured's releases.

0.15.7

Enhancements

Features

Fixes

  • Fix NLTK data download path to prevent nested directories. Resolved an issue where a nested "nltk_data" directory was created within the parent "nltk_data" directory when it already existed. This fix prevents errors in checking for existing downloads and loading models from NLTK data.

0.15.6

Enhancements

Features

Fixes

  • Bump to NLTK 3.9.x Bumps to the latest nltk version to resolve CVE.
  • Update CI for ingest-test-fixture-update-pr to resolve NLTK model download errors.
  • Synchronized text and html on TableChunk splits. When a Table element is divided during chunking to fit the chunking window, TableChunk.text corresponds exactly with the table text in TableChunk.metadata.text_as_html, .text_as_html is always parseable HTML, and the table is split on even row boundaries whenever possible.
Changelog

Sourced from unstructured's changelog.

0.15.7

Enhancements

Features

Fixes

  • Fix NLTK data download path to prevent nested directories. Resolved an issue where a nested "nltk_data" directory was created within the parent "nltk_data" directory when it already existed. This fix prevents errors in checking for existing downloads and loading models from NLTK data.

0.15.6

Enhancements

Features

Fixes

  • Bump to NLTK 3.9.x Bumps to the latest nltk version to resolve CVE.
  • Update CI for ingest-test-fixture-update-pr to resolve NLTK model download errors.
  • Synchronized text and html on TableChunk splits. When a Table element is divided during chunking to fit the chunking window, TableChunk.text corresponds exactly with the table text in TableChunk.metadata.text_as_html, .text_as_html is always parseable HTML, and the table is split on even row boundaries whenever possible.
Commits
  • 01dbc7b fix: nltk data download path to prevent redundant nested directories (#3546)
  • 1f8030d fix(CVE-2024-39705): bump to nltk 3.9.1; correct model download issues (#3541)
  • a861ed8 feat(chunk): split tables on even row boundaries (#3504)
  • 99f72d6 ci: fix ingest test fixtures update (#3532)
  • See full diff in compare view

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/open-webui/open-webui/pull/4786 **Author:** [@dependabot[bot]](https://github.com/apps/dependabot) **Created:** 8/21/2024 **Status:** ✅ Merged **Merged:** 8/22/2024 **Merged by:** [@tjbck](https://github.com/tjbck) **Base:** `dev` ← **Head:** `dependabot/pip/backend/dev/unstructured-0.15.7` --- ### 📝 Commits (1) - [`ccf5bd1`](https://github.com/open-webui/open-webui/commit/ccf5bd149271b16b4aa2a90d1d98ad69591bbce1) chore(deps): bump unstructured from 0.15.5 to 0.15.7 in /backend ### 📊 Changes **1 file changed** (+1 additions, -1 deletions) <details> <summary>View changed files</summary> 📝 `backend/requirements.txt` (+1 -1) </details> ### 📄 Description Bumps [unstructured](https://github.com/Unstructured-IO/unstructured) from 0.15.5 to 0.15.7. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/Unstructured-IO/unstructured/releases">unstructured's releases</a>.</em></p> <blockquote> <h2>0.15.7</h2> <h3>Enhancements</h3> <h3>Features</h3> <h3>Fixes</h3> <ul> <li><strong>Fix NLTK data download path to prevent nested directories</strong>. Resolved an issue where a nested &quot;nltk_data&quot; directory was created within the parent &quot;nltk_data&quot; directory when it already existed. This fix prevents errors in checking for existing downloads and loading models from NLTK data.</li> </ul> <h2>0.15.6</h2> <h3>Enhancements</h3> <h3>Features</h3> <h3>Fixes</h3> <ul> <li><strong>Bump to NLTK 3.9.x</strong> Bumps to the latest <code>nltk</code> version to resolve CVE.</li> <li><strong>Update CI for <code>ingest-test-fixture-update-pr</code> to resolve NLTK model download errors.</strong></li> <li><strong>Synchronized text and html on <code>TableChunk</code> splits.</strong> When a <code>Table</code> element is divided during chunking to fit the chunking window, <code>TableChunk.text</code> corresponds exactly with the table text in <code>TableChunk.metadata.text_as_html</code>, <code>.text_as_html</code> is always parseable HTML, and the table is split on even row boundaries whenever possible.</li> </ul> </blockquote> </details> <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/Unstructured-IO/unstructured/blob/0.15.7/CHANGELOG.md">unstructured's changelog</a>.</em></p> <blockquote> <h2>0.15.7</h2> <h3>Enhancements</h3> <h3>Features</h3> <h3>Fixes</h3> <ul> <li><strong>Fix NLTK data download path to prevent nested directories</strong>. Resolved an issue where a nested &quot;nltk_data&quot; directory was created within the parent &quot;nltk_data&quot; directory when it already existed. This fix prevents errors in checking for existing downloads and loading models from NLTK data.</li> </ul> <h2>0.15.6</h2> <h3>Enhancements</h3> <h3>Features</h3> <h3>Fixes</h3> <ul> <li><strong>Bump to NLTK 3.9.x</strong> Bumps to the latest <code>nltk</code> version to resolve CVE.</li> <li><strong>Update CI for <code>ingest-test-fixture-update-pr</code> to resolve NLTK model download errors.</strong></li> <li><strong>Synchronized text and html on <code>TableChunk</code> splits.</strong> When a <code>Table</code> element is divided during chunking to fit the chunking window, <code>TableChunk.text</code> corresponds exactly with the table text in <code>TableChunk.metadata.text_as_html</code>, <code>.text_as_html</code> is always parseable HTML, and the table is split on even row boundaries whenever possible.</li> </ul> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href="https://github.com/Unstructured-IO/unstructured/commit/01dbc7b4733e88efd6c1e85930c707009a2a966e"><code>01dbc7b</code></a> fix: <code>nltk</code> data download path to prevent redundant nested directories (<a href="https://redirect.github.com/Unstructured-IO/unstructured/issues/3546">#3546</a>)</li> <li><a href="https://github.com/Unstructured-IO/unstructured/commit/1f8030dd0ee76389c619603197fc5f7f4e546d09"><code>1f8030d</code></a> fix(CVE-2024-39705): bump to <code>nltk</code> 3.9.1; correct model download issues (<a href="https://redirect.github.com/Unstructured-IO/unstructured/issues/3541">#3541</a>)</li> <li><a href="https://github.com/Unstructured-IO/unstructured/commit/a861ed8fe762de72c79372c9656f88143c0901c2"><code>a861ed8</code></a> feat(chunk): split tables on even row boundaries (<a href="https://redirect.github.com/Unstructured-IO/unstructured/issues/3504">#3504</a>)</li> <li><a href="https://github.com/Unstructured-IO/unstructured/commit/99f72d65baaac7b3111a026fa19d7c133c661431"><code>99f72d6</code></a> ci: fix ingest test fixtures update (<a href="https://redirect.github.com/Unstructured-IO/unstructured/issues/3532">#3532</a>)</li> <li>See full diff in <a href="https://github.com/Unstructured-IO/unstructured/compare/0.15.5...0.15.7">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=unstructured&package-manager=pip&previous-version=0.15.5&new-version=0.15.7)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2025-11-11 17:51:50 -06:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/open-webui#8358