gateway/modules/serviceCenter/services
Ida dff3d41845 fix(rag): stable ingestion idempotency across re-extractions (AC4)
Re-indexing the same file always triggered a full embedding run —
ingestion.skipped.duplicate never fired. Two independent causes:

1. _computeIngestionHash included contentObjectId in its payload, but
   extractors generate fresh uuid4() per run, making the hash a
   per-run nonce. Now hashed over (contentType, data) in extractor
   order — stable across re-extractions, sensitive to content,
   ordering, and type changes.
2. _autoIndexFile upserted the fresh pre-scan FileContentIndex before
   requestIngestion's duplicate check, wiping structure._ingestion
   and status=indexed from the prior run. The pre-upsert now merges
   the existing _ingestion metadata and preserves the indexed status.

Verified end-to-end: second PATCH /scope on an already-indexed file
logs  and returns in ~2s
with zero embedding API calls.

Adds test_ingestion_hash_stability.py (5 cases).
2026-04-29 14:39:40 +02:00
..
serviceAgent fix(rag): stable ingestion idempotency across re-extractions (AC4) 2026-04-29 14:39:40 +02:00
serviceAi teamsbot 2026-04-25 01:13:01 +02:00
serviceBackgroundJobs datamodel sctirc fk logic in one place 2026-04-26 18:11:42 +02:00
serviceBilling data source fixes 2026-04-21 00:50:36 +02:00
serviceChat wired infomaniac to ai adapters and tools 2026-04-29 01:52:47 +02:00
serviceClickup next version of visual workflow editor with ClickUp Connections 2026-03-25 09:38:06 +01:00
serviceExtraction cleanup internal marked exports 2026-04-26 08:31:35 +02:00
serviceGeneration Graph and data class falignment strict 2026-04-26 22:53:44 +02:00
serviceKnowledge fix(rag): stable ingestion idempotency across re-extractions (AC4) 2026-04-29 14:39:40 +02:00
serviceMessaging automation unification implemented 2026-04-07 00:49:08 +02:00
serviceSharepoint included sharepoint nodes 2026-03-22 19:46:50 +01:00
serviceSubscription datamodel sctirc fk logic in one place 2026-04-26 18:11:42 +02:00
serviceTicket first version of service center 2026-03-06 14:03:18 +01:00
serviceWeb refactor: modules/services/ abgeloest durch serviceCenter + serviceHub 2026-03-14 11:51:45 +01:00
__init__.py first version of service center 2026-03-06 14:03:18 +01:00