gateway/modules/serviceCenter/services/serviceAgent/coreTools
Ida a7f4055130 fix(rag): preserve per-page granularity + remove on-demand extraction fallbacks
The default MergeStrategy concatenates every extracted text part into a
single ContentPart, collapsing a 500-page PDF into one chunk with a
blurred average embedding — RAG retrieval was effectively broken.

- ExtractionOptions.mergeStrategy is now Optional[MergeStrategy]; passing
  None preserves per-part granularity. Default factory kept for
  backward compatibility.
- routeDataFiles._autoIndexFile, _workspaceTools.readFile, and
  _documentTools.describeImage explicitly pass mergeStrategy=None.
- Agent tools no longer carry redundant extraction + requestIngestion
  fallback paths: the unified ingestion lane owns all corpus writes,
  and readFile/describeImage are pure consumers of the knowledge store.
- Unit test asserts runExtraction(mergeStrategy=None) keeps every part.
2026-04-29 14:39:40 +02:00
..
__init__.py automation unification implemented 2026-04-07 00:49:08 +02:00
_connectionTools.py fix:merge conflicts clickup branch 2026-04-17 13:31:38 +02:00
_crossWorkflowTools.py automation unification implemented 2026-04-07 00:49:08 +02:00
_dataSourceTools.py wired infomaniac to ai adapters and tools 2026-04-29 01:52:47 +02:00
_documentTools.py fix(rag): preserve per-page granularity + remove on-demand extraction fallbacks 2026-04-29 14:39:40 +02:00
_emailTools.py fixes udb, outlook, workflow 2026-04-21 23:49:46 +02:00
_featureSubAgentTools.py fixed trustee 2026-04-26 23:59:09 +02:00
_helpers.py wired infomaniac to ai adapters and tools 2026-04-29 01:52:47 +02:00
_mediaTools.py wired infomaniac to ai adapters and tools 2026-04-29 01:52:47 +02:00
_workspaceTools.py fix(rag): preserve per-page granularity + remove on-demand extraction fallbacks 2026-04-29 14:39:40 +02:00
registerCore.py fixes udb, outlook, workflow 2026-04-21 23:49:46 +02:00