Commit graph

44 commits

Author SHA1 Message Date
Ida
93aff13d26 node handover standartisiert, kein hardcoden mehr, inhalt extraktion node verbessert, output ports vereinheitlicht mit user im blick 2026-05-06 12:50:49 +02:00
ValueOn AG
3da6e24bec fixed component formgeneratortree and truastee workflows 2026-05-03 22:03:29 +02:00
Patrick Motsch
7942766931
Merge pull request #149 from valueonag/feat/demo-system-readieness
Feat/demo system readieness
2026-04-30 23:58:26 +02:00
ValueOn AG
880fa4d787 plana+c implemented 2026-04-29 21:27:08 +02:00
Ida
93cb6939dc feat: frontend consent integration 2026-04-29 14:39:40 +02:00
Ida
3add5c9a80 commit before rebase 2026-04-29 14:39:40 +02:00
Ida
6a5ff1ff7c feat(rag): P1 user-connection hooks + retrieval threshold fix
- connection.established/revoked callbacks from OAuth routes and
  connection management endpoints
- KnowledgeIngestionConsumer dispatches bootstrap job (established)
  and synchronous purge (revoked)
- FileContentIndex: add connectionId + sourceKind columns
- SharePoint bootstrap with @odata.nextLink pagination and eTag-based
  idempotency
- Outlook bootstrap treats messages as virtual documents with
  cleanEmailBody for HTML/quote/signature stripping
- fix(rag): lower buildAgentContext minScore thresholds from
  0.55/0.65/0.70 to 0.35 — previous values blocked all real matches
  from text-embedding-3-small
- 24 new unit tests covering purge, consumer dispatch, email cleaning
  and both bootstrap paths
2026-04-29 14:39:40 +02:00
Ida
dff3d41845 fix(rag): stable ingestion idempotency across re-extractions (AC4)
Re-indexing the same file always triggered a full embedding run —
ingestion.skipped.duplicate never fired. Two independent causes:

1. _computeIngestionHash included contentObjectId in its payload, but
   extractors generate fresh uuid4() per run, making the hash a
   per-run nonce. Now hashed over (contentType, data) in extractor
   order — stable across re-extractions, sensitive to content,
   ordering, and type changes.
2. _autoIndexFile upserted the fresh pre-scan FileContentIndex before
   requestIngestion's duplicate check, wiping structure._ingestion
   and status=indexed from the prior run. The pre-upsert now merges
   the existing _ingestion metadata and preserves the indexed status.

Verified end-to-end: second PATCH /scope on an already-indexed file
logs  and returns in ~2s
with zero embedding API calls.

Adds test_ingestion_hash_stability.py (5 cases).
2026-04-29 14:39:40 +02:00
Ida
a7f4055130 fix(rag): preserve per-page granularity + remove on-demand extraction fallbacks
The default MergeStrategy concatenates every extracted text part into a
single ContentPart, collapsing a 500-page PDF into one chunk with a
blurred average embedding — RAG retrieval was effectively broken.

- ExtractionOptions.mergeStrategy is now Optional[MergeStrategy]; passing
  None preserves per-part granularity. Default factory kept for
  backward compatibility.
- routeDataFiles._autoIndexFile, _workspaceTools.readFile, and
  _documentTools.describeImage explicitly pass mergeStrategy=None.
- Agent tools no longer carry redundant extraction + requestIngestion
  fallback paths: the unified ingestion lane owns all corpus writes,
  and readFile/describeImage are pure consumers of the knowledge store.
- Unit test asserts runExtraction(mergeStrategy=None) keeps every part.
2026-04-29 14:39:40 +02:00
Ida
078b4eaaaf removed unnecessary test files 2026-04-29 14:39:40 +02:00
Ida
9d82d3d353 P0: injection facade 2026-04-29 14:39:40 +02:00
ValueOn AG
b405cebdec kdrive fix 2026-04-29 00:35:21 +02:00
ValueOn AG
fb3a1f0a51 fixes ai agents parameter flow 2026-04-28 11:58:53 +02:00
ValueOn AG
d9fcea54ff trustee agent fix 2026-04-27 08:07:37 +02:00
ValueOn AG
30ea8bbefe fixed trustee 2026-04-26 23:59:09 +02:00
ValueOn AG
d505ffd9cd Graph and data class falignment strict 2026-04-26 22:53:44 +02:00
ValueOn AG
564a1200c6 datamodel sctirc fk logic in one place 2026-04-26 18:11:42 +02:00
ValueOn AG
24f0c3e2eb cleanup internal marked exports 2026-04-26 08:31:35 +02:00
ValueOn AG
794ba36f27 teamsbot 2026-04-25 01:13:01 +02:00
ValueOn AG
b6be8f391e fixes 2026-04-23 23:09:38 +02:00
ValueOn AG
b84ca37d99 pwg-demo 2026-04-20 00:31:05 +02:00
ValueOn AG
50107a91ba fixed proper splitting sysadmin/platformadmin and proper logic for mandate name(slug) and label(user) 2026-04-19 00:04:03 +02:00
ValueOn AG
e942770ffc feat db-clean-ui and unified content udm 2026-04-16 23:13:05 +02:00
ValueOn AG
be9e47caad phase 2 i18n clean 2026-04-10 12:33:27 +02:00
idittrich-valueon
d0f8444bac next version of visual workflow editor with ClickUp Connections 2026-03-25 09:38:06 +01:00
ValueOn AG
f796ae3807 fix: RBAC bootstrap anthropic for user, FeatureAccess response, workspace UI repair, user access overview, RBAC tests
Made-with: Cursor
2026-03-23 10:29:23 +01:00
ValueOn AG
9186c60ad2 fixed rendering issues 2026-03-22 11:09:48 +01:00
ValueOn AG
c8b7517209 refactor: modules/services/ abgeloest durch serviceCenter + serviceHub
serviceCenter = DI-Container (Resolver, Registry, Context) fuer Service-Instanziierung
serviceHub = Consumer-facing Aggregation (DB-Interfaces, Runtime-State, lazy Service-Resolution via serviceCenter)

- modules/serviceHub/ erstellt: ServiceHub, PublicService, getInterface()
- 22 Consumer-Dateien migriert (routes, features, tests): imports von modules.services auf serviceHub bzw. serviceCenter umgestellt
- resolver.py: legacy fallback auf altes services/ entfernt
- modules/services/ komplett geloescht (83 Dateien inkl. dead code mainAiChat.py)
- pre-extraction: progress callback durch chunk-pipeline propagiert, operationType DATA_EXTRACT->DATA_ANALYSE fuer guenstigeres Modell
2026-03-14 11:51:45 +01:00
47340e6949 fix:removed unnecessary files 2026-03-06 14:18:07 +01:00
53d2d9d873 first version of service center
implemented on chatbot
2026-03-06 14:03:18 +01:00
ValueOn AG
829711f755 fixed system and dynamic data rbac 2026-01-26 12:39:00 +01:00
ValueOn AG
e737bf5cdb gpdr compliancy implemented 2026-01-25 23:57:41 +01:00
ValueOn AG
280cafd54a refactored features phase II 2026-01-23 01:10:00 +01:00
ValueOn AG
bb9630d6c4 fixed imports 2026-01-22 21:11:25 +01:00
ValueOn AG
362080791a isolate features 2026-01-22 17:00:29 +01:00
ValueOn AG
77e1414744 module testing 2026-01-20 00:55:39 +01:00
ValueOn AG
ccc41e7023 harmonized module names 2026-01-19 09:18:37 +01:00
ValueOn AG
64b44473aa fixed data extraction and generation handling with parts 2026-01-02 00:05:54 +01:00
ValueOn AG
4b00e741b3 refactored service center 2025-12-15 21:55:26 +01:00
ValueOn AG
1195105b94 cleaned up dependencies and code architecture 2025-12-09 23:25:06 +01:00
ValueOn AG
d009f93dba rbac roles and rules integration tests passed 2025-12-07 23:51:05 +01:00
ValueOn AG
6e6cf7012b rbac module testing done 2025-12-07 22:00:55 +01:00
ValueOn AG
54246745a9 refactored uam to rbac 2025-12-07 13:48:39 +01:00
ValueOn AG
9bd7821cf5 feat: refactored ai calls and pydantic models 2025-11-17 23:12:18 +01:00