open-webui

mirror of https://github.com/open-webui/open-webui.git synced 2025-12-12 20:35:19 +00:00

Author	SHA1	Message	Date
Timothy Jaeryang Baek	d5fd3b3600	feat: external reranker Co-Authored-By: Brendan Campbell <20541191+bcambs09@users.noreply.github.com>	2025-05-10 18:25:20 +04:00
PVBLIC Foundation	3f58a17e47	Update pinecone.py • Removed the unused Pinecone REST‐client import; we now only import ServerlessSpec and the gRPC client. • Enhanced close() • Call self.client.close() to explicitly shut down the underlying gRPC channel. • Log success or a warning on failure. • Still tear down the thread‐pool executor afterward. • Context‐manager support • Added __enter__()/__exit__() so you can do: with PineconeClient() as client: client.insert(...) # automatically calls client.close()	2025-05-10 06:07:27 -07:00
PVBLIC Foundation	12c2138982	Update pinecone.py Refactor and added debug	2025-05-09 18:15:22 -07:00
PVBLIC Foundation	b38711a581	Update pinecone.py	2025-05-08 16:02:47 -07:00
PVBLIC Foundation	04b9065f08	Update pinecone.py Now supports batched insert, upsert, and delete operations using a default batch size of 100, reducing API strain and improving throughput. All blocking calls to the Pinecone API are wrapped in asyncio.to_thread(...), ensuring async safety and preventing event loop blocking. The implementation includes zero-vector handling for efficient metadata-only queries, normalized cosine distance scores for accurate ranking, and protections against empty input operations. Logs for batch durations have been streamlined to minimize noise, while preserving key info-level success logs.	2025-05-08 15:53:30 -07:00
Matt Harrison	2df9f7fb4d	fix: remove import for os module in milvus.py	2025-05-08 00:28:24 -04:00
Matt Harrison	731251d11a	refac: streamline Milvus index type handling using configuration options	2025-05-07 23:39:56 -04:00
Matt Harrison	5e46c27806	refac: enhance MilvusClient with dynamic index type and improved logging	2025-05-07 21:51:28 -04:00
Timothy Jaeryang Baek	6359cb55fe	chore: format	2025-05-07 02:01:03 +04:00
Tim Jaeryang Baek	ea07e242f5	Merge pull request #13528 from Classic298/dev feat: Enhance YouTube Transcription Loader for multi-language support	2025-05-07 00:44:45 +04:00
Classic298	1dcbec71ec	Update youtube.py	2025-05-06 17:14:00 +02:00
Classic298	87dcbd198c	Update youtube.py	2025-05-06 17:11:03 +02:00
Classic298	d7927506f1	Update youtube.py	2025-05-06 17:06:21 +02:00
Classic298	f65dc715f9	Update youtube.py	2025-05-06 16:30:18 +02:00
Classic298	c69278c13c	Update youtube.py	2025-05-06 16:24:27 +02:00
Classic298	a129e0954e	Update youtube.py	2025-05-06 16:22:40 +02:00
Classic298	5e1cb76b93	Update youtube.py	2025-05-06 16:16:58 +02:00
Timothy Jaeryang Baek	e63b8b3879	refac	2025-05-06 00:46:32 +04:00
Timothy Jaeryang Baek	27da31dc83	fix: tikaloader extract images	2025-05-05 23:40:34 +04:00
Classic298	67a612fe24	Update youtube.py	2025-05-05 20:40:48 +02:00
Classic298	791dd24ace	Update youtube.py	2025-05-05 20:08:25 +02:00
Classic298	9cf3381381	Update youtube.py	2025-05-05 20:07:52 +02:00
Classic298	b0d74a59f1	Update youtube.py	2025-05-05 20:07:37 +02:00
Classic298	1a30b3746e	Update youtube.py	2025-05-05 20:03:00 +02:00
Classic298	0a3817ed86	Update youtube.py	2025-05-05 20:00:10 +02:00
Classic298	0a845db8ec	Update youtube.py	2025-05-05 19:57:21 +02:00
Classic298	7680ac2517	Update youtube.py	2025-05-05 19:57:06 +02:00
Timothy Jaeryang Baek	4cfb99248d	chore: format	2025-05-03 23:48:24 +04:00
Athanasios Oikonomou	657162e96d	feat(ocr): add support for Docling OCR engine and language configuration This commit adds support for configuring the OCR engine and language(s) for Docling. Configuration can be set via the environment variables `DOCLING_OCR_ENGINE` and `DOCLING_OCR_LANG`, or through the UI. Fixes #13133	2025-05-03 00:32:06 +03:00
Tim Jaeryang Baek	7d184c3a14	Merge pull request #13085 from ayan4m1/fix/tika-image-ocr fix: pass extractInlineImages header to Tika if PDF_EXTRACT_IMAGES is true	2025-05-02 03:47:51 -07:00
Tim Jaeryang Baek	61580e9490	Merge pull request #13404 from NoMoreFood/dev fix: Use SHA256 For Query Result Computation	2025-05-01 04:55:16 -07:00
Bryan Berns	32257089f9	Use SHA256 For Query Result Computation	2025-05-01 03:56:20 -04:00
Alexander Grimm	da9966aca1	~ truncate vectors for pgvector if too big	2025-04-30 05:35:17 +00:00
Tim Jaeryang Baek	4ee5dd58b7	Merge pull request #13177 from tth37/fix_firecrawl_loader_default_mode fix: FireCrawlLoader default mode to scrape	2025-04-29 08:39:06 -07:00
Tim Jaeryang Baek	e87f2669fa	Merge pull request #13191 from tth37/feat_firecrawl_search_engine feat: Add Firecrawl search engine	2025-04-29 08:38:28 -07:00
Tim Jaeryang Baek	7b863465a9	Merge pull request #13311 from stephen304/yacy-support feat: Yacy search support	2025-04-29 08:35:10 -07:00
Stephen Smith	ea16426a8d	Remove unused kwargs in yacy, update comments.	2025-04-27 00:41:46 -04:00
Stephen Smith	f9b9217e98	Set Yacy search to text	2025-04-26 23:13:31 -04:00
Stephen Smith	e6d43d70f3	Don't request nav and pass count to Yacy	2025-04-26 23:08:16 -04:00
Stephen Smith	240d91d38d	Add yacy config for user/pass, automatically add yacy json api path	2025-04-26 22:28:30 -04:00
Stephen Smith	0f73b96616	first pass at yacy support copied from searxng	2025-04-26 14:07:13 -04:00
tth37	92dbeb1939	feat: Add Firecrawl search engine	2025-04-24 14:57:28 +08:00
tth37	8f7195ceda	fix: FireCrawlLoader default mode to scrape	2025-04-24 01:17:35 +08:00
Tim Jaeryang Baek	91e758f3ec	Merge pull request #13165 from feddersen-group/perf/parallel_knowledge_search perf: all knowledge searches in parallel in non-hybrid mode	2025-04-23 10:01:06 -07:00
Timothy Jaeryang Baek	09874ab83d	fix: FireCrawlLoader	2025-04-24 01:40:34 +09:00
Alexander Grimm	d182155fac	~ call knowledge searches in parallel in non-hybrid mode	2025-04-23 09:20:51 +00:00
Tim Jaeryang Baek	faa3cac0e4	Merge pull request #13107 from tth37/fix_tavily_max_results fix: `max_results` in Tavily search handler	2025-04-22 23:47:36 -07:00
tth37	bc315bd530	fix: `max_results` in Tavily search api	2025-04-21 20:59:47 +08:00
Athanasios Oikonomou	1e291aff25	feat: Add abstract base class for vector database integration - Created `VectorDBBase` as an abstract base class to standardize vector database operations. - Added required methods for common vector database operations: `has_collection`, `delete_collection`, `insert`, `upsert`, `search`, `query`, `get`, `delete`, `reset`. - The base class can now be extended by any vector database implementation (e.g., Qdrant, Pinecone) to ensure a consistent API across different database systems.	2025-04-21 08:27:27 +03:00
ayan4m1	039dec6820	fix: pass header to Tika if PDF_EXTRACT_IMAGES is true	2025-04-20 17:36:40 +02:00

1 2 3 4 5

225 commits