gateway

History

Ida a7f4055130 fix(rag): preserve per-page granularity + remove on-demand extraction fallbacks The default MergeStrategy concatenates every extracted text part into a single ContentPart, collapsing a 500-page PDF into one chunk with a blurred average embedding — RAG retrieval was effectively broken. - ExtractionOptions.mergeStrategy is now Optional[MergeStrategy]; passing None preserves per-part granularity. Default factory kept for backward compatibility. - routeDataFiles._autoIndexFile, _workspaceTools.readFile, and _documentTools.describeImage explicitly pass mergeStrategy=None. - Agent tools no longer carry redundant extraction + requestIngestion fallback paths: the unified ingestion lane owns all corpus writes, and readFile/describeImage are pure consumers of the knowledge store. - Unit test asserts runExtraction(mergeStrategy=None) keeps every part.		2026-04-29 14:39:40 +02:00
..
core	hotfix msft/google login tokens end to end separated from connection	2026-03-21 01:34:40 +01:00
services	fix(rag): preserve per-page granularity + remove on-demand extraction fallbacks	2026-04-29 14:39:40 +02:00
__init__.py	refactor: modules/services/ abgeloest durch serviceCenter + serviceHub	2026-03-14 11:51:45 +01:00
context.py	streamlined billing incl ai and storage budget	2026-03-29 12:18:58 +02:00
registry.py	phase 2 i18n clean	2026-04-10 12:33:27 +02:00
resolver.py	refactor: modules/services/ abgeloest durch serviceCenter + serviceHub	2026-03-14 11:51:45 +01:00