Commit graph

10 commits

Author SHA1 Message Date
Ida
a7f4055130 fix(rag): preserve per-page granularity + remove on-demand extraction fallbacks
The default MergeStrategy concatenates every extracted text part into a
single ContentPart, collapsing a 500-page PDF into one chunk with a
blurred average embedding — RAG retrieval was effectively broken.

- ExtractionOptions.mergeStrategy is now Optional[MergeStrategy]; passing
  None preserves per-part granularity. Default factory kept for
  backward compatibility.
- routeDataFiles._autoIndexFile, _workspaceTools.readFile, and
  _documentTools.describeImage explicitly pass mergeStrategy=None.
- Agent tools no longer carry redundant extraction + requestIngestion
  fallback paths: the unified ingestion lane owns all corpus writes,
  and readFile/describeImage are pure consumers of the knowledge store.
- Unit test asserts runExtraction(mergeStrategy=None) keeps every part.
2026-04-29 14:39:40 +02:00
ValueOn AG
d9fcea54ff trustee agent fix 2026-04-27 08:07:37 +02:00
ValueOn AG
30ea8bbefe fixed trustee 2026-04-26 23:59:09 +02:00
ValueOn AG
d505ffd9cd Graph and data class falignment strict 2026-04-26 22:53:44 +02:00
ValueOn AG
b6be8f391e fixes 2026-04-23 23:09:38 +02:00
ValueOn AG
9186c60ad2 fixed rendering issues 2026-03-22 11:09:48 +01:00
ValueOn AG
c8b7517209 refactor: modules/services/ abgeloest durch serviceCenter + serviceHub
serviceCenter = DI-Container (Resolver, Registry, Context) fuer Service-Instanziierung
serviceHub = Consumer-facing Aggregation (DB-Interfaces, Runtime-State, lazy Service-Resolution via serviceCenter)

- modules/serviceHub/ erstellt: ServiceHub, PublicService, getInterface()
- 22 Consumer-Dateien migriert (routes, features, tests): imports von modules.services auf serviceHub bzw. serviceCenter umgestellt
- resolver.py: legacy fallback auf altes services/ entfernt
- modules/services/ komplett geloescht (83 Dateien inkl. dead code mainAiChat.py)
- pre-extraction: progress callback durch chunk-pipeline propagiert, operationType DATA_EXTRACT->DATA_ANALYSE fuer guenstigeres Modell
2026-03-14 11:51:45 +01:00
ValueOn AG
64b44473aa fixed data extraction and generation handling with parts 2026-01-02 00:05:54 +01:00
ValueOn AG
d009f93dba rbac roles and rules integration tests passed 2025-12-07 23:51:05 +01:00
ValueOn AG
9bd7821cf5 feat: refactored ai calls and pydantic models 2025-11-17 23:12:18 +01:00