familienarchiv

Author	SHA1	Message	Date
Marcel	bbfd234746	refactor(ocr): use stream .toList() instead of FQCN Collectors.toList() Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 21:47:36 +02:00
Marcel	92f3c04d54	fix(ocr): add partial unique index and align SenderModelServiceTest with suite style Add V42 partial unique index on ocr_training_runs(person_id) WHERE status='QUEUED' to enforce the per-person queued coalescing guarantee at the DB level. Also adds @ExtendWith(MockitoExtension.class) to SenderModelServiceTest for consistency with the rest of the service test suite, with lenient() on the shared txTemplate stub. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 21:25:18 +02:00
Marcel	0d5f3f38d0	perf(ocr): resolve person names in single batch query in getTrainingInfo Replace the per-run getById loop with a single getAllById call on distinct person IDs, eliminating the N+1 query when training history contains multiple sender model runs. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 21:21:12 +02:00
Marcel	4aa477555d	refactor(ocr): return TrainingInfoResponse directly from getTrainingInfo endpoint Remove the intermediate Map<String,Object> and return the typed record directly so OpenAPI codegen produces a concrete TypeScript type. Fixes lastRun serializing as {} (empty object) instead of null when no training run exists. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 21:18:27 +02:00
Marcel	e16dcdb7dc	docs(ocr): document tail-recursive queue drain design in promoteNextQueuedRun Some checks failed CI / Unit & Component Tests (push) Failing after 2m36s Details CI / OCR Service Tests (push) Successful in 34s Details CI / Backend Unit Tests (push) Failing after 2m43s Details CI / Unit & Component Tests (pull_request) Failing after 2m38s Details CI / OCR Service Tests (pull_request) Successful in 35s Details CI / Backend Unit Tests (pull_request) Failing after 2m43s Details Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 20:54:53 +02:00
Marcel	f76a9cce1f	test(ocr): add failure path and DONE status assertions to SenderModelServiceTest Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 20:38:43 +02:00
Marcel	e2081b57e7	refactor(ocr): extract exportSenderData helper in triggerSenderTraining Some checks failed CI / Unit & Component Tests (push) Failing after 2m36s Details CI / OCR Service Tests (push) Successful in 37s Details CI / Backend Unit Tests (push) Failing after 2m51s Details CI / Unit & Component Tests (pull_request) Failing after 2m42s Details CI / OCR Service Tests (pull_request) Successful in 35s Details CI / Backend Unit Tests (pull_request) Failing after 2m54s Details Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 20:24:38 +02:00
Marcel	2459408930	refactor(ocr): move person-name enrichment from OcrController into OcrTrainingService Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 20:18:21 +02:00
Marcel	09f4601d15	test(ocr): verify triggerSenderTraining upserts SenderModel with correct path and cer Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 20:13:21 +02:00
Marcel	1b34a36a77	fix(ocr): eliminate race window in runOrQueueSenderTraining by creating RUNNING row atomically Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 20:11:56 +02:00
Marcel	8d041a377d	fix(ocr): correct trainSenderModel URI from /train to /train-sender Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 20:08:18 +02:00
Marcel	18cf839fac	feat(ocr): wire SenderModelService into OcrAsyncRunner; stage missing foundational files Some checks failed CI / Unit & Component Tests (push) Failing after 2m21s Details CI / OCR Service Tests (push) Successful in 29s Details CI / Backend Unit Tests (push) Failing after 2m38s Details CI / Unit & Component Tests (pull_request) Failing after 2m26s Details CI / OCR Service Tests (pull_request) Successful in 31s Details CI / Backend Unit Tests (pull_request) Failing after 2m44s Details OcrAsyncRunner now passes the per-sender model path to streamBlocks for HANDWRITING_KURRENT documents. processDocument replaced extractBlocks with streamBlocks + AtomicReference, removing the unchecked raw-array pattern. Also stages all previously uncommitted foundational files for this feature: SenderModel entity, SenderModelRepository, Flyway migrations V40/V41, updated OcrClient/RestClientOcrClient streaming API, TrainingDataExportService.exportForSender, TranscriptionService Kurrent hook, application.yaml OCR config, and frontend i18n/test additions. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 19:27:02 +02:00
Marcel	386dc83958	refactor(ocr): move sender training methods from OcrTrainingService to SenderModelService Eliminates cross-domain repository access: OcrTrainingService no longer holds SenderModelRepository. SenderModelService now owns the full sender training lifecycle (runOrQueueSenderTraining, triggerSenderTraining, promoteNextQueuedRun), removing the circular dependency risk. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 19:08:10 +02:00
Marcel	60c1ec7b5f	refactor(ocr): delete buildTrainingInfoMap() dead code The controller now builds the map inline (with personNames support). This method had zero callers. Fixes reviewer concerns from @felixbrandt and @mkeller. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 18:52:51 +02:00
Marcel	10a4a4d94b	fix(ocr): log debug instead of silently swallowing person name resolution errors Replaces catch(Exception ignored){} with log.debug() in getTrainingInfo(). Adds controller test documenting the graceful degradation behavior (response stays 200 when personService.getById() throws). Fixes reviewer concerns from @felixbrandt and @nullx. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 18:51:15 +02:00
Marcel	7a342a07cf	test: add unit tests for SenderModelService, runOrQueueSenderTraining, and updateBlock hook - SenderModelServiceTest: 6 tests covering activation threshold (99/100), retrain delta (149/150), runNow flag (queued vs triggered) - OcrTrainingServiceTest: 3 tests for runOrQueueSenderTraining — idle returns true, running saves QUEUED, duplicate QUEUED coalesces - TranscriptionServiceTest: 3 tests for updateBlock — sets source=MANUAL, triggers training for HANDWRITING_KURRENT with sender, skips when no sender Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 18:00:59 +02:00
Marcel	bd23a76330	test: fix broken tests after per-sender model integration - OcrAsyncRunnerTest: switch from extractBlocks/4-arg streamBlocks stubs to 5-arg streamBlocks (senderModelPath param) via doAnswer - TranscriptionServiceTest: stub documentService.getDocumentById in updateBlock tests so the new Kurrent training hook does not NPE - OcrControllerTest: add @MockitoBean PersonService (now injected into OcrController for personNames assembly in getTrainingInfo) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 17:56:51 +02:00
Marcel	ba36a88b65	feat(ocr): add Preprocessing NDJSON event to Java stream pipeline Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 14:21:00 +02:00
Marcel	d075bf390a	feat(tag-search): expand children and surface ancestor path in search results Modifies TagService.search() to enrich name-matches with tree relatives: root matches expand descendants, child matches prepend ancestors. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 11:27:41 +02:00
Marcel	4ec4062274	refactor(#248 ): simplify TagService.buildTree() to single-pass LinkedHashMap approach Some checks failed CI / Unit & Component Tests (pull_request) Failing after 3m12s Details CI / Backend Unit Tests (pull_request) Failing after 2m57s Details CI / Unit & Component Tests (push) Failing after 2m41s Details CI / Backend Unit Tests (push) Failing after 2m45s Details Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 07:45:40 +02:00
Marcel	e6497ebff4	fix(#248 ): add @Schema(REQUIRED) to TagTreeNodeDTO, improve mergeTags log, add comments Some checks failed CI / Unit & Component Tests (pull_request) Failing after 2m42s Details CI / Backend Unit Tests (pull_request) Failing after 2m44s Details CI / Unit & Component Tests (push) Failing after 2m35s Details CI / Backend Unit Tests (push) Failing after 2m44s Details Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 01:01:09 +02:00
Marcel	d7a46de1cc	refactor(#248 ): address PR review concerns — TagOperator enum, typed projection, bean validation - Replace stringly-typed "AND"/"OR" tagOperator with TagOperator enum (DocumentService, DocumentController) - Replace Object[] with TagCount projection interface in TagRepository.findDocumentCountsPerTag() - Use @NotNull + @Valid on MergeTagDTO.targetId; remove manual null check from TagController - Correct ALLOWED_TAG_COLORS to match actual frontend CSS tokens (sage/sienna/amber/slate/violet/rose/cobalt/moss/sand/coral) - Add TOCTOU comment to validateNoAncestorCycle() with mitigation explanation - Add test: deleteWithDescendants_skipsDocTagDeletion_whenDescendantIdsIsEmpty - Update TagServiceTest to use mock TagRepository.TagCount projection Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 00:24:04 +02:00
Marcel	a669f6368d	feat(#248 ): expose parentId in TagTreeNodeDTO OpenAPI schema and regenerate TypeScript types Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 22:33:12 +02:00
Marcel	5e5c249aba	feat(#248 ): add POST /api/tags/{id}/merge and DELETE /api/tags/{id}/subtree endpoints Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 22:27:41 +02:00
Marcel	609d242f5d	feat(#248 ): enrich TagTreeNodeDTO with parentId and populate documentCount via single aggregate query Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 22:24:50 +02:00
Marcel	c03c391879	test(#248 ): add deleteWithDescendants test coverage to TagServiceTest Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 22:20:19 +02:00
Marcel	f921284db6	feat(#248 ): add TagService.mergeTags() with validateNotSelf/validateNotDescendant/transferDocuments helpers Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 22:18:41 +02:00
Marcel	b9b572436a	feat(#248 ): add merge/delete/count native queries to TagRepository Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 22:15:14 +02:00
Marcel	a05d9c22ae	fix(#248 ): TagService.getById() throws DomainException(TAG_NOT_FOUND) instead of ResponseStatusException Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 22:13:45 +02:00
Marcel	de7c48117b	feat(#248 ): add TAG_NOT_FOUND, TAG_MERGE_SELF, TAG_MERGE_INVALID_TARGET error codes Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 22:10:52 +02:00
Marcel	06fd5ae2da	fix(#221 ): resolve inherited color on child tags in document responses Some checks failed CI / Unit & Component Tests (push) Failing after 2m51s Details CI / Backend Unit Tests (push) Failing after 2m46s Details Colors are stored only on root-level tags. DocumentService now calls TagService.resolveEffectiveColors() before returning search results and single-document responses, so child tags carry their parent's color when serialised to JSON. Parent tags are batch-loaded in a single query. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 19:28:21 +02:00
Marcel	e8e54cc282	feat(#221 ): change TagInput binding to Tag[], add color dots and hierarchy grouping Backend: - TagRepository: add findDescendantIdsByName() recursive CTE query - TagService: add expandTagNamesToDescendantIdSets() for document search Frontend: - TagInput: accept Tag[] (id, name, color, parentId) instead of string[] - Chips show color dot via var(--c-tag-{color}) when tag has color - Suggestions grouped hierarchically: children indented under their parents - Update DescriptionSection, edit/new pages, SearchFilterBar, +page.svelte Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 16:11:38 +02:00
Marcel	57dc72b51d	feat(#221 ): add AND/OR tag filtering with hierarchy expansion in document search - Replace hasTags(List<String>) spec with hasTags(List<Set<UUID>>, useOr) - AND mode: one EXISTS subquery per expanded tag ID set; empty set = disjunction - OR mode: union of all expanded sets into a single EXISTS subquery - DocumentService calls tagService.expandTagNamesToDescendantIdSets() before building spec - DocumentController exposes ?tagOp=AND\|OR query param (default AND) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 15:44:18 +02:00
Marcel	3fba740469	feat(#221 ): tag entity hierarchy fields, service, repository, controller - Tag entity: add parentId (UUID FK) and color (String) fields - TagUpdateDTO and TagTreeNodeDTO records - ErrorCode: INVALID_TAG_COLOR, TAG_CYCLE_DETECTED - TagRepository: findAncestorIds() recursive CTE query - TagService: cycle detection, color validation, getTagTree() - TagController: use TagUpdateDTO, add GET /api/tags/tree endpoint Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 15:26:23 +02:00
Marcel	f9ac963b9f	feat(#221 ): add V39 migration for tag hierarchy and colors Adds parent_id FK (ON DELETE SET NULL), self-reference check constraint, parent_id index, and nullable color column to the tag table. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 15:15:17 +02:00
Marcel	da5c92fe39	fix(#240 ): remove readyCount from weekly stats DTO and SQL query Some checks failed CI / Unit & Component Tests (push) Failing after 2m26s Details CI / Backend Unit Tests (push) Failing after 2m46s Details CI / Unit & Component Tests (pull_request) Failing after 2m32s Details CI / Backend Unit Tests (pull_request) Failing after 2m30s Details The Lesefertig pulse was removed from the UI; drop the backend support for it too — removes the subquery from findWeeklyStats(), the projection getter, the DTO field, and updates all affected tests. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 13:19:53 +02:00
Marcel	23410aa4b8	fix(#240 ): rename V37→V38 (V37 was already applied); regenerate api.ts The original needsExpert V37 migration was applied to the dev DB before the feature was removed. Renaming our new indexes migration to V38 avoids the Flyway checksum conflict. Regenerated api.ts now reflects the @Schema(requiredMode=REQUIRED) annotations — DTO fields are non-optional. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 12:23:14 +02:00
Marcel	e041c75793	test(#240 ): add Testcontainers integration tests for native SQL queue queries 6 new tests covering findSegmentationQueue (excludes PLACEHOLDER, excludes annotated docs), findTranscriptionQueue (below-90%-reviewed docs, zero-block case), findReadyToReadQueue (>=90% reviewed), and findWeeklyStats (zeros on empty DB). Runs against real PostgreSQL 16 via Testcontainers. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 12:15:21 +02:00
Marcel	adea7d498f	fix(#240 ): add @Schema(requiredMode=REQUIRED) to both queue DTOs; add V37 indexes All non-null DTO fields are now marked required so the generated api.ts emits required (non-optional) types for callers. V37 migration adds created_at/updated_at indexes on document_annotations and transcription_blocks to avoid full table scans in the weekly stats correlated subqueries. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 12:09:09 +02:00
Marcel	4cf01a0f1d	test(#240 ): add TranscriptionQueueControllerTest Verifies 401/403/200 responses for all four endpoints. Matches the @WebMvcTest + @RequirePermission pattern used across the project. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 12:07:14 +02:00
Marcel	2e4d9a8375	refactor(#240 ): replace Object[] positional mapping with Spring Data projections Introduces TranscriptionQueueProjection and TranscriptionWeeklyStatsProjection interfaces so column reordering in native SQL can never silently produce wrong data. Removes the four type-coercion helpers (toUUID, toLocalDate, toInt, toLong) from TranscriptionQueueService. Covered by TranscriptionQueueServiceTest (6 tests). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 12:05:21 +02:00
Marcel	ff1606f63d	fix(#240 ): update test fixtures broken by rebase changes Some checks failed CI / Unit & Component Tests (push) Failing after 2m29s Details CI / Backend Unit Tests (push) Failing after 2m38s Details CI / Unit & Component Tests (pull_request) Failing after 2m31s Details CI / Backend Unit Tests (pull_request) Failing after 2m42s Details Two backend tests passed a 6-element enrichment row but the rebase added summary_snippet as column 7 — added null at index 6 to both fixtures. Two frontend page.server tests mocked only 4 dashboard API calls but the page now makes 8 (3 Mission Control queues + weekly-stats added on this branch) — added the 4 missing mock responses. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 11:50:49 +02:00
Marcel	8980d810d4	fix(#240 ): use annotationCount as denominator in queue thresholds Some checks failed CI / Unit & Component Tests (pull_request) Failing after 2m24s Details CI / Backend Unit Tests (pull_request) Failing after 2m51s Details CI / Unit & Component Tests (push) Failing after 2m24s Details CI / Backend Unit Tests (push) Failing after 2m37s Details The ready-to-read and transcription queue queries were dividing reviewed blocks by textedBlockCount instead of annotationCount. A document with 4/15 annotations typed — all 4 reviewed — scored 4/4 = 100 % and incorrectly appeared in the Lesefertig column. Both queries now compute the ratio as: reviewed / annotationCount so a document must have ≥ 90 % of all its drawn regions reviewed before it graduates to Lesefertig. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 11:00:18 +02:00
Marcel	ca0cf4903c	refactor(#240 ): remove needsExpert feature completely Some checks failed CI / Unit & Component Tests (pull_request) Failing after 2m23s Details CI / Backend Unit Tests (pull_request) Failing after 2m43s Details CI / Backend Unit Tests (push) Has been cancelled Details CI / Unit & Component Tests (push) Has started running Details Drops the needsExpert / needs_expert flag end-to-end: DB migration (V37, never applied), Document entity field, PATCH endpoint, service method, DTO field, all three queue queries, ExpertBadge component, i18n key, generated API types, and test fixture. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 10:52:14 +02:00
Marcel	9404ec34ce	fix(#240 ): add missing V36 index migration and rename needs_expert to V37 V36 (add_index_transcription_blocks_document_id) was applied to the dev database during a previous local session but never committed to git. Flyway checksum mismatch prevented the backend from starting. - V36__add_index_transcription_blocks_document_id.sql: restored from the index that already exists in the database (idx_transcription_blocks_document_id) - V36__add_needs_expert_to_documents.sql → V37__add_needs_expert_to_documents.sql Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 10:42:18 +02:00
Marcel	2ea603a3bf	feat(#240 ): backend for Mission Control Strip — queue endpoints + expert flag Adds the server-side foundation for the dashboard transcription widget: - V36 migration: needs_expert BOOLEAN NOT NULL DEFAULT FALSE on documents - Document entity: needsExpert field (@Schema required) - DocumentRepository: 4 native queries — segmentation queue, transcription queue, ready-to-read queue (seeded weekly shuffle sort), weekly pulse stats - TranscriptionQueueService: maps Object[] rows to typed DTOs, handles PostgreSQL type variations (UUID/String, Date/LocalDate, Number/BigDecimal) - TranscriptionQueueController: GET /api/transcription/{segmentation-queue, transcription-queue, ready-to-read, weekly-stats} — all guarded by READ_ALL - DocumentService + DocumentController: PATCH /api/documents/{id}/needs-expert toggles the expert flag (WRITE_ALL required) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 10:41:55 +02:00
Marcel	d7b2357834	feat(search): surface summary snippet when summary matched the query Some checks failed CI / Unit & Component Tests (push) Failing after 2m33s Details CI / Backend Unit Tests (push) Failing after 2m44s Details Add a summary_snippet column to findEnrichmentData using ts_headline on documents.summary, only when the summary's tsvector matches the query. Expose it via SearchMatchData.summarySnippet / summaryOffsets and render a "Zusammenfassung" / "Summary" / "Resumen" labelled row in the document list — identical treatment to the transcription snippet row. Fixes the case where a document appeared in search results with no visible match explanation (e.g. searching "frucht" found a document whose summary mentioned "Früchte"). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 09:10:10 +02:00
Marcel	091f7e5d25	feat(search): partial-word matching via to_tsquery prefix queries Replace websearch_to_tsquery with a CROSS JOIN LATERAL subquery that appends :* to each lexeme so prefix matches work (e.g. "furchtb" finds "furchtbar"). websearch_to_tsquery still handles the safe tokenisation of user input (stop words, special chars, operators); regexp_replace then adds :* before to_tsquery re-parses the result. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 09:10:10 +02:00
Marcel	32f151ff31	feat(search): add snippetOffsets to SearchMatchData and use ts_headline for highlighted snippets - SearchMatchData gains a 6th field snippetOffsets: List<MatchOffset> so the frontend can render highlighted terms inside the transcription snippet without {#html}. - DocumentRepository.findEnrichmentData now calls ts_headline() with chr(1)/chr(2) sentinels instead of returning raw block text; parseHighlight() strips the sentinels and produces clean text + MatchOffset list in one pass. - DocumentService exposes ParsedHighlight and parseHighlight() as public so they can be called from cross-package integration tests. - All related tests updated to the new 6-argument SearchMatchData constructor and to call parseHighlight() for asserting the snippet clean text and offsets. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 09:10:10 +02:00
Marcel	162397d4eb	fix(search): make ParsedHighlight and parseHighlight public for cross-package test access Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 09:10:10 +02:00

1 2 3 4 5 ...

295 Commits