familienarchiv

Author	SHA1	Message	Date
marcel	b33d0eb850	feat(lesereisen): implement lesereisen All checks were successful CI / Unit & Component Tests (push) Successful in 4m34s Details CI / OCR Service Tests (push) Successful in 27s Details CI / Backend Unit Tests (push) Successful in 5m1s Details CI / fail2ban Regex (push) Successful in 47s Details CI / Semgrep Security Scan (push) Successful in 23s Details CI / Compose Bucket Idempotency (push) Successful in 1m11s Details	2026-06-12 14:04:02 +02:00
marcel	d650b6c066	refactor(search): remove NLP/smart-search feature entirely (#772 ) All checks were successful CI / Unit & Component Tests (push) Successful in 3m23s Details CI / OCR Service Tests (push) Successful in 24s Details CI / Backend Unit Tests (push) Successful in 3m46s Details CI / fail2ban Regex (push) Successful in 46s Details CI / Semgrep Security Scan (push) Successful in 25s Details CI / Compose Bucket Idempotency (push) Successful in 1m8s Details ## Summary - Removes the NLP/smart-search feature completely — the feature was too unreliable and slow; users get better results with the regular search filters - Deletes the entire backend `search/` package (NlSearchController, NlQueryParserService, NlpClient, NlSearchRateLimiter — 14 classes + 6 test classes) - Deletes the `nlp-service/` Python microservice (FastAPI, rapidfuzz, DB-backed person matching) - Removes all frontend NL search components: SmartModeToggle, SmartSearchStatus, InterpretationChipRow, DisambiguationPicker, chip-types, theme-chip-removal - Strips smart-mode logic from SearchFilterBar and documents/+page.svelte - Removes `SMART_SEARCH_UNAVAILABLE` / `SMART_SEARCH_RATE_LIMITED` error codes from backend, frontend types, and all three i18n files (de/en/es) - Removes `nlp-service` container and `APP_NLP_BASE_URL` from both docker-compose files - Removes Ollama/NLP Prometheus scrape job and Grafana dashboard - Deletes ADRs 028 (×2), 034, 035 ## Test plan - [ ] Backend compiles: `cd backend && ./mvnw compile -q` → BUILD SUCCESS - [ ] Frontend server tests pass: `cd frontend && npm run test -- --project=server` - [ ] No NLP/smart-search references remain in source: `grep -r "SmartSearch\\|NlSearch\\|nlp-service\\|SMART_SEARCH" backend/src frontend/src` - [ ] `docker compose config` validates both compose files - [ ] Search page loads, filter bar works, no smart-mode toggle visible 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Marcel <marcel@familienarchiv> Reviewed-on: #772	2026-06-08 10:57:00 +02:00
Marcel	09b77e9b36	test(person): pin fetchPool dedup when one person matches two tokens (#763 review) All checks were successful CI / Unit & Component Tests (push) Successful in 3m20s Details CI / OCR Service Tests (push) Successful in 24s Details CI / Backend Unit Tests (push) Successful in 3m53s Details CI / fail2ban Regex (push) Successful in 44s Details CI / Semgrep Security Scan (push) Successful in 21s Details CI / Compose Bucket Idempotency (push) Successful in 1m5s Details Assert that when the same person id is returned by two different token fetches, the person appears exactly once in the result -- pinning fetchPool's putIfAbsent dedup so a future refactor can't silently double-classify a candidate. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-07 08:47:47 +02:00
Marcel	9d202b042b	test(person): close fetch-to-classify seam for alias matches on real Postgres (#763 review) AC#4 (maiden alias -> direct) and AC#5 (alias first name -> fetchable + classifiable) were each split across PersonRepositoryTest (the fetch) and PersonServiceTest (the classifier with stubs) -- nothing walked searchByName -> resolveByName end-to-end on real Postgres. Add two tests in the existing @DataJpaTest slice that build a real PersonService over the autowired repositories, persist a person with a MAIDEN_NAME alias and one with an alias firstName, and assert both classify as direct. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-07 08:47:47 +02:00
Marcel	f1bb9d3a69	feat(search): map direct/partial NameMatches into resolve buckets (#763 ) resolveNames now delegates to PersonService.resolveByName and maps by match strength: 1 direct → resolved (auto-select), ≥2 direct → ambiguous, 0 direct with partials → ambiguous suggestions, 0 candidates → folded into full-text. A single direct match no longer forces the picker when looser substring hits coexist. The MAX_CANDIDATES cap moved into PersonService (after classification); the MAX_NAME_LENGTH guard, resolved-cap overflow, and sender/receiver mapping are preserved. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-07 08:47:47 +02:00
Marcel	ca52145556	feat(person): add resolveByName for direct/partial name matching (#763 ) Token-set containment over all of a person's name components (firstName, lastName, alias, each PersonNameAlias first+last, title) decides direct vs partial. Orchestrates tokenize → cap(8) → fetch pool → classify → cap(10) after classification, with an empty-token guard and a PII-free debug log of the outcome bucket. MAX_TOKENS is a DoS control; the after-classify cap keeps a direct match that sorts past position 10 among partials. Read-only transaction keeps lazy nameAliases reachable during classification (ADR-022). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-07 08:47:47 +02:00
Marcel	9a26bf75b0	feat(person): match alias first names in searchByName (#763 ) The direct-match classifier accepts alias firstName tokens, so the fetch must surface candidates matchable only via an alias first name. Add a.firstName to the searchByName LIKE clause (reuses the bound :query — injection-proof). The person_name_aliases.first_name column already exists; no migration. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-07 08:47:47 +02:00
Marcel	9c616f9fb8	feat(person): add name-match tokenizer for direct matching (#763 ) Lowercase, split on whitespace/hyphen/apostrophe, drop empties. Applied symmetrically to query and candidate name components so "Anna-Maria" and "Anna Maria" tokenize alike. Foundation for resolveByName direct matching. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-07 08:47:47 +02:00
Marcel	b825076733	test(search): DataJpaTest for descendant-expansion via TagRepository Verifies the recursive CTE in findDescendantIdsByName expands a parent tag to include all child IDs, and that findByNameContainingIgnoreCase matches both parent and child names when the fragment appears in both. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-07 08:47:47 +02:00
Marcel	01df815bad	test(search): add 11 tag-resolution test cases to NlQueryParserServiceTest Covers multi-tag match, no-match FTS fallback, mixed resolution, personRole bypass, cap at 10, short-keyword skip, dedup, rawQuery suppression when all keywords resolve, flag independence, colour propagation via resolveEffectiveColors, and colour=null when depth constraint prevents resolution. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-07 08:47:47 +02:00
Marcel	dcd0e725a7	feat(search): implement keyword→tag resolution in NlQueryParserService Keywords that substring-match the tag taxonomy become OR-union tag filters; non-matching keywords stay as FTS text. Resolved tags surface in the NlQueryInterpretation as TagHint objects with effective colours. The rawQuery fallback is now guarded by hadStructuredMatch to prevent double-apply when all keywords resolve. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-07 08:47:47 +02:00
Marcel	5a09cd4cb4	feat(search): extend NlQueryInterpretation with resolvedTags + tagsApplied Positional record fields added; all 3 construction sites updated with neutral defaults; NlQueryParserService wired for TagService (4th constructor arg); NlQueryParserServiceTest and NlSearchControllerTest synced. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-07 08:47:47 +02:00
Marcel	0f0d89702d	feat(search): add TagService.findByNameContaining for NL tag resolution Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-07 08:47:47 +02:00
Marcel	79e4a3f9db	feat(search): add searchDocumentsByPersonId with Specification-based sender/receiver query Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-06 16:04:54 +02:00
Marcel	70e8a6e6ad	feat(search): implement NlSearchController with @WebMvcTest tests (7 cases) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-06 15:58:35 +02:00
Marcel	3af1095d13	feat(search): implement NlQueryParserService with Mockito tests (23 cases) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-06 15:54:45 +02:00
Marcel	8c835e957a	feat(search): implement RestClientOllamaClient with WireMock tests Switch to wiremock-jetty12 artifact and force ee10 Jetty deps to 12.1.8 to resolve compatibility with Spring Boot 4's Jetty 12.1.8 core. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-06 15:43:49 +02:00
Marcel	fe8fcba7a7	feat(search): add NlSearchRateLimiter with Bucket4j/Caffeine Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-06 15:39:06 +02:00
Marcel	e0fac783e8	feat(person): add findByDisplayNameContaining service method Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-06 15:30:30 +02:00
Marcel	ddf378aaac	fix(person): resolve ambiguous sender names to null on upload (#731 ) All checks were successful CI / Unit & Component Tests (pull_request) Successful in 3m18s Details CI / OCR Service Tests (pull_request) Successful in 25s Details CI / Backend Unit Tests (pull_request) Successful in 3m38s Details CI / fail2ban Regex (pull_request) Successful in 43s Details CI / Semgrep Security Scan (pull_request) Successful in 22s Details CI / Compose Bucket Idempotency (pull_request) Successful in 1m6s Details findByName resolved via Optional<Person> findByFirstNameIgnoreCaseAndLastNameIgnoreCase, which threw NonUniqueResultException once two people shared a first+last name case- insensitively (hans müller / Hans Müller) — a 500 on the routine upload path (DocumentService.storeDocument sender resolution). findByName now resolves exact-case → single case-insensitive match → else empty. The sender path deliberately diverges from the alias path: an ambiguous name leaves the sender UNSET rather than guessing the lowest id, because correct provenance beats a confidently-wrong pre-fill a reviewer won't re-check. The two new name queries use explicit HQL equality so a null first name binds as `= NULL` (no match) instead of the derived-query fold to `first_name IS NULL`, which would widen a last-name-only row in as a sender. Pins the opaque error path (IncorrectResultSizeDataAccessException stays INTERNAL_ERROR with no Hibernate/SQL/row-count leak) and extends ADR-032 with the Person section. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-06 13:03:04 +02:00
Marcel	20cfe41f21	fix(person): resolve case-colliding aliases without throwing (#731 ) findOrCreateByAlias resolved via Optional<Person> findByAliasIgnoreCase, which throws NonUniqueResultException once two aliases collide only by case (müller / Müller) — a generic 500 on the importer path. Mirror the #730 tag fix: resolve exact-case first, then the lowest-id case-insensitive sibling, then create-when-absent (institution/group and maiden-name alias preserved). The throwing Optional<…>IgnoreCase variant is deleted so it can't be reused. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-06 12:50:21 +02:00
Marcel	43601a3770	test(transcription): persist real persons for mention FK after V71 (#684 ) All checks were successful CI / Unit & Component Tests (push) Successful in 3m20s Details CI / OCR Service Tests (push) Successful in 23s Details CI / Backend Unit Tests (push) Successful in 3m39s Details CI / fail2ban Regex (push) Successful in 46s Details CI / Semgrep Security Scan (push) Successful in 22s Details CI / Compose Bucket Idempotency (push) Successful in 1m6s Details V71 gives transcription_block_mentioned_persons.person_id a real FK, so two TranscriptionBlockMentionsRepositoryTest cases that inserted mention rows with random (non-existent) person ids now violate fk_tbmp_person. Persist real Person rows and use their ids. Caught by CI's full suite. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-06 12:34:46 +02:00
Marcel	6603bc5333	test(person): address PR #736 review nits - AC-3 cascade test: assert an innocent bystander's mention row survives the delete, proving the cascade is scoped to the deleted person (Nora). - Fix integration-test comment: receivers is @ManyToMany(LAZY), not an EAGER @ElementCollection (Sara). - ADR-032: note the @ prefix is kept in the degraded path, stripped in live mentions (Leonie). - Add trailing newline to PersonRepository.java (Felix). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-06 12:34:46 +02:00
Marcel	6d267f2269	test(person): describe DB-cascade mechanism in delete service-path test (#684 ) The deletePerson service-path guard (AC-4) is unchanged behaviourally, but its comments described the removed reassignSenderToNull/deleteReceiverReferences chain. Update them to the V71 ON DELETE cascade mechanism. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-06 12:34:46 +02:00
Marcel	ff76a3784f	refactor(person): simplify mergePersons to lean on V71 cascade (#684 ) Drop the explicit deleteReceiverReferences call from mergePersons — the source's leftover receiver join rows now cascade-drop via V71's ON DELETE CASCADE on deleteById. Remove the now-unused deleteReceiverReferences repository method (and its repo test), and add clearAutomatically + flushAutomatically to the remaining merge native queries so the L1 cache cannot desync from the bulk updates. Rewrite the merge unit test with verifyNoMoreInteractions and add an end-to-end merge regression test (AC-7). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-06 12:34:46 +02:00
Marcel	534665459f	refactor(person): thin deletePerson to lean on V71 DB cascade (#684 ) Drop the application-layer sender/receiver detach from deletePerson — the V71 ON DELETE constraints now enforce it. Remove the now-unused reassignSenderToNull repository method and rewrite the unit test to assert only the existence check plus deleteById (verifyNoMoreInteractions). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-06 12:34:46 +02:00
Marcel	fd792f6d78	feat(person): enforce person-delete integrity at the DB layer (V71) (#684 ) Add ON DELETE behaviour to the two V1 FKs into persons (documents.sender_id -> SET NULL, document_receivers.person_id -> CASCADE) and a real FK with ON DELETE CASCADE on the transcription_block_mentioned_persons soft reference, cleaning up pre-existing orphan mention rows first. The cascade stays strictly at the join/reference layer and never reaches documents rows. Proven by new Postgres-backed PersonRepositoryTest cascade tests (AC-1/2/3/8 plus the cascade-boundary document-survival guard). Rewrites the now-stale V56 'no FK' comment. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-06 12:34:46 +02:00
Marcel	2710f2e233	test(tag): close review-flagged gaps in case-collision coverage (#730 ) Two adversarial gaps from PR #733 review: - Unit: exact-case must win even when its id is NOT the lowest, proving exact-case short-circuits before the lowest-id tie-break (a naive "lowest id across all CI matches" would pick the wrong row). - Integration: assert findAllByNameIgnoreCase folds the UPPERCASE "GLÜCKWÜNSCHE" — the exact string findOrCreate passes — so the umlaut proof matches the resolution path under test, not a lowercase probe. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-06 11:07:39 +02:00
Marcel	a58378e8f0	test(tag): pin case-colliding tag resolution on real Postgres (#730 ) All checks were successful CI / Unit & Component Tests (pull_request) Successful in 3m16s Details CI / OCR Service Tests (pull_request) Successful in 21s Details CI / Backend Unit Tests (pull_request) Successful in 3m35s Details CI / fail2ban Regex (pull_request) Successful in 44s Details CI / Semgrep Security Scan (pull_request) Successful in 21s Details CI / Compose Bucket Idempotency (pull_request) Successful in 1m6s Details Mocked TagServiceTest can't prove the two things that actually broke: that findAllByNameIgnoreCase folds umlauts the way Postgres LOWER() does, and that saving a document tagged with a case-colliding tag no longer throws NonUniqueResultException. Testcontainers postgres:16-alpine: - updateDocument on a doc tagged with the child "weihnachten" succeeds and keeps exactly the child tag (not the parent). - findOrCreate("GLÜCKWÜNSCHE") resolves the Glückwünsche/glückwünsche umlaut pair deterministically (lowest id) without throwing — the regression catcher a plain-ASCII pair would miss. - bulk-edit funnels through resolveTags → findOrCreate, guarding a future refactor that bypasses it. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-06 10:53:04 +02:00
Marcel	d000170f52	fix(tag): resolve case-colliding tag names without throwing (#730 ) findOrCreate used tagRepository.findByNameIgnoreCase, which returns Optional<Tag> and threw NonUniqueResultException whenever two tags collided case-insensitively (a canonical parent and its same-named lowercase child). Every document carrying such a tag became un-editable: any save re-resolves the whole tag set by name and blew up with a 500. Replace the throwing lookup with exact-case-first resolution: findByName (exact) → findAllByNameIgnoreCase (lowest-id, deterministic, never throws) → create. Delete findByNameIgnoreCase so the throwing call can't be reintroduced. Case collisions are valid tree nodes — no migration, no unique(lower(name)) constraint. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-06 10:49:02 +02:00
Marcel	f656f7c1ff	test(document): close review-flagged coverage gaps for auto-title sync (#726 ) - save-time: precision+raw carry-over when the DTO omits them (exercises the shared skip-null resolvers), and a RANGE label round-trip (Sara/Elicit) - factory: a bare Document with a null index builds "" rather than NPE-ing (Felix) - backfill matcher: negative near-misses — ASCII hyphen vs en dash, missing separator before trailing text, year-with-trailing-letters, index followed by text without a separator (Sara) - backfill integration: tighten the count assertion to exactly 1 on the clean test DB (Sara) Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-04 17:10:50 +02:00
Marcel	12db7b3596	test(document): integration-test title backfill against real Postgres (#726 ) Pins backfill behaviour on postgres:16-alpine (H2 unusable — title is NOT NULL): a stale auto-title is rewritten, the sweep is idempotent (second run touches nothing), prose is left alone, and the mechanical rename adds no document_versions rows. Permission (401/403) stays in the faster @WebMvcTest slice. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-04 16:32:07 +02:00
Marcel	26b45f1c78	feat(document): one-time backfill endpoint for stale auto-titles (#726 ) Adds POST /api/admin/backfill-titles (ADMIN-only, synchronous) which rebuilds every machine-generated title from the row's current state. A grammar heuristic (DocumentTitleBackfillMatcher) decides overwritability: index matched literally via startsWith (originalFilename is user-controlled — no regex injection / ReDoS, CWE-1333), date-label forms derived from the same Locale.GERMAN formatters as the factory so they cannot drift, prose left untouched, fail-closed on any surprise. Saves via the repository directly (no recordVersion — follows backfillFileHashes), so the mechanical rename never version-spams document_versions. Idempotent: a second run rewrites nothing. Emits one SLF4J-parameterized scanned/updated/skipped line. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-04 16:29:57 +02:00
Marcel	e6ce00035e	feat(document): regenerate auto-title on save when date/location change (#726 ) updateDocument now captures the machine title from the persisted state before any setter runs, and rebuilds it from the new state only when the submitted title still equals that machine value — an exact comparison that relies on the edit form round-tripping an untouched title verbatim. A hand-written or freshly-typed title is kept; a blank submission falls back to the rebuilt auto-title (title is always present); a file-replaced document no longer matches its import-time title and is treated as manual. projectedState mirrors the setter asymmetry exactly (date/location overwrite incl. null-clear; precision/end/raw skip-null from the entity). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-04 16:20:46 +02:00
Marcel	b1f77bcfb6	refactor(document): extract title composition into shared DocumentTitleFactory (#726 ) Move DocumentTitleFormatter from importing into the document package and introduce DocumentTitleFactory there as the single source of truth for the {index} – {dateLabel} – {location} formula. DocumentImporter now consumes the factory instead of owning the composition; the document package owns the rule, importing depends on it (not the reverse). No behavioral change — importer title assertions and the #666 fixture parity test stay green. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-04 16:15:00 +02:00
Marcel	4e68b81bf7	feat(document): remove conversation repository queries Delete findConversation and findSinglePersonCorrespondence (no remaining callers after the service methods were removed) and their integration test section. Drops the now-unused LocalDate import. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 10:26:54 +02:00
Marcel	985b31f71f	feat(document): remove conversation service methods Delete getConversationFiltered (the endpoint's only caller is gone) and the dead 2-arg getConversation(personA, personB) which had zero callers, along with both getConversationFiltered test blocks. The hasSender/ hasReceiver specifications stay — document search still uses them. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 10:26:54 +02:00
Marcel	50f554680c	refactor(document): drop the 5-minute Cache-Control TTL on /density (#709 ) Some checks failed CI / Unit & Component Tests (pull_request) Successful in 3m21s Details CI / OCR Service Tests (pull_request) Successful in 23s Details CI / fail2ban Regex (pull_request) Has been cancelled Details CI / Semgrep Security Scan (pull_request) Has been cancelled Details CI / Compose Bucket Idempotency (pull_request) Has been cancelled Details CI / Backend Unit Tests (pull_request) Has been cancelled Details CI / Unit & Component Tests (push) Successful in 3m21s Details CI / OCR Service Tests (push) Successful in 19s Details CI / Backend Unit Tests (push) Successful in 3m45s Details CI / fail2ban Regex (push) Successful in 45s Details CI / Semgrep Security Scan (push) Successful in 21s Details CI / Compose Bucket Idempotency (push) Successful in 1m6s Details The density chart is an interactive filter control; a 5-minute private browser cache let it show stale month counts after an edit/upload/re-tag. The in-memory aggregation is sub-200ms p95 over ~5k docs, so there is no load reason to cache. Removing the explicit header lets Spring Security's default no-store directive apply, so the response is always fresh. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 19:56:50 +02:00
Marcel	1dd162f1be	test(document): prove the DB rejects end-before-start; assert persisted end (#678 ) All checks were successful CI / Unit & Component Tests (pull_request) Successful in 3m20s Details CI / OCR Service Tests (pull_request) Successful in 22s Details CI / Backend Unit Tests (pull_request) Successful in 3m31s Details CI / fail2ban Regex (pull_request) Successful in 45s Details CI / Semgrep Security Scan (pull_request) Successful in 20s Details CI / Compose Bucket Idempotency (pull_request) Successful in 1m4s Details CI / Unit & Component Tests (push) Successful in 3m23s Details CI / OCR Service Tests (push) Successful in 23s Details CI / Backend Unit Tests (push) Successful in 3m37s Details CI / fail2ban Regex (push) Successful in 43s Details CI / Semgrep Security Scan (push) Successful in 22s Details CI / Compose Bucket Idempotency (push) Successful in 1m5s Details Addresses Sara's review concerns: - Add a negative Testcontainers test: saveAndFlush of a RANGE with end < start throws DataIntegrityViolationException, proving chk_meta_date_end_after_start actually fires (H2 wouldn't) and exercising the backstop's trigger end-to-end. Guards against silent app/DB drift if the service guard ever regresses. - Tighten updateDocument_acceptsRange_whenEndAfterStart to assert the persisted end value, not just that save was called. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 11:03:28 +02:00
Marcel	ff7cfd4b1a	fix(exception): log the violated constraint name at WARN (#678 ) Addresses Tobias's review concern: the generic DataIntegrityViolation backstop turned every integrity violation into a silent 400 with no constraint name, no stack, no Sentry — an unanticipated write bug would fail invisibly in production. Now extract the constraint NAME from the cause chain (schema metadata, safe for Loki) and log it parameterized at WARN, so the failure is debuggable. Still never pass `ex`/`getMessage()` (SQL + values, CWE-209) and still no Sentry — the response stays generic, so the response logic is not brittle. New test proves the WARN names the constraint but never carries the SQL. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 11:03:04 +02:00
Marcel	88600d54cd	test(document): prove Postgres accepts an equal-date RANGE (#678 ) All checks were successful CI / Unit & Component Tests (pull_request) Successful in 3m19s Details CI / OCR Service Tests (pull_request) Successful in 21s Details CI / Backend Unit Tests (pull_request) Successful in 3m43s Details CI / fail2ban Regex (pull_request) Successful in 43s Details CI / Semgrep Security Scan (pull_request) Successful in 20s Details CI / Compose Bucket Idempotency (pull_request) Successful in 1m4s Details Testcontainers integration test persisting a RANGE doc with end == start against real Postgres + Flyway, which (unlike H2) enforces the V69 chk_meta_date_end_after_start CHECK. Pins the app guard's isBefore semantics to the actual >= constraint, guarding against app/DB drift (AC2). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 09:29:37 +02:00
Marcel	3a4c2c6225	feat(exception): backstop DataIntegrityViolation as a clean 400 (#678 ) Add @ExceptionHandler(DataIntegrityViolationException) returning 400 VALIDATION_ERROR with a fixed constant message, so any integrity violation that slips past the upstream guards (a future constraint, or the import path) becomes a clean 400 instead of a 500 + Sentry alert (AC9). Deliberately generic — it does not inspect which constraint failed. Never echoes ex.getMessage() (constraint name + SQL, CWE-209), logs at WARN without passing the exception (would re-leak the SQL to Loki), and does not call Sentry.captureException. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 09:20:22 +02:00
Marcel	73f614bc3a	feat(document): reject end date without RANGE precision (#678 ) Add the second validateDateRange predicate mirroring chk_meta_date_end_only_for_range, so a direct API client that sets an end date without RANGE precision gets a clean 400 INVALID_DATE_RANGE instead of a 500 (AC6). Shares the code with the end-before-start branch. Also fix updateDocument_preservesStoredPrecision_whenDtoOmitsIt: its stored fixture (MONTH + end date) is a state the DB CHECK forbids, so the carried-over-state guard correctly rejects it. Switched to RANGE + end — the only DB-valid non-null-end combo — preserving the test's intent. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 09:17:52 +02:00
Marcel	6c5e5273bb	test(document): lock in accepted RANGE cases — equal/after/open/null-start (#678 ) Cover AC2 (end == start), AC3 (open-ended, end null) and AC4 (null start + end set, which must not reject or NPE), plus end-after-start. Guards the guard against future over-rejection that would diverge from the DB CHECK. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 09:13:59 +02:00
Marcel	a574d96351	feat(document): reject RANGE with end before start (#678 ) Add ErrorCode.INVALID_DATE_RANGE and a validateDateRange guard on DocumentService.updateDocument, run right after applyDatePrecision so it fires before any save (updateDocumentTags persists earlier in the method). Mirrors the V69 chk_meta_date_end_after_start CHECK: end >= start with a null start allowed, using isBefore so equal dates stay valid. Turns a user date typo into a clean 400 instead of a 500 + Sentry alert. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 09:12:54 +02:00
Marcel	3b594c0b0b	test(document): pin undated null->false coercion on /ids (#683 ) All checks were successful CI / Unit & Component Tests (pull_request) Successful in 3m16s Details CI / OCR Service Tests (pull_request) Successful in 20s Details CI / Backend Unit Tests (pull_request) Successful in 3m31s Details CI / fail2ban Regex (pull_request) Successful in 43s Details CI / Semgrep Security Scan (pull_request) Successful in 20s Details CI / Compose Bucket Idempotency (pull_request) Successful in 1m2s Details CI / Unit & Component Tests (push) Successful in 3m22s Details CI / OCR Service Tests (push) Successful in 21s Details CI / Backend Unit Tests (push) Successful in 3m25s Details CI / fail2ban Regex (push) Successful in 44s Details CI / Semgrep Security Scan (push) Successful in 20s Details CI / Compose Bucket Idempotency (push) Successful in 1m4s Details The /search path already pins the Boolean-undated->primitive coercion via search_withoutUndatedParam_forwardsFalseToService; add the symmetric pin for getDocumentIds so an absent param provably resolves to undated=false on the record (never NPE). Raised in the #702 review. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-31 15:55:14 +02:00
Marcel	4c2f036de0	test(document): collapse all-null SearchFilters literals to noFilters() (#683 ) Replace the ~29 repeated `new SearchFilters(null, null, null, null, null, null, null, null, null, false)` literals across the search test suites with a shared SearchFiltersFixtures.noFilters() factory (and noFilters() .withUndated(true) for the undated-only case). Tests that pin a specific field keep their explicit `new SearchFilters(...)` so intent stays visible. Pure test-ergonomics cleanup raised in the #702 review; no behaviour change. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-31 15:53:34 +02:00
Marcel	dcb57ffacd	refactor(document): thread SearchFilters through the search chain (#683 ) All checks were successful CI / Unit & Component Tests (pull_request) Successful in 3m21s Details CI / OCR Service Tests (pull_request) Successful in 20s Details CI / Backend Unit Tests (pull_request) Successful in 3m26s Details CI / fail2ban Regex (pull_request) Successful in 44s Details CI / Semgrep Security Scan (pull_request) Successful in 19s Details CI / Compose Bucket Idempotency (pull_request) Successful in 1m3s Details Replace the long positional filter lists on the document search chain with the SearchFilters record. searchDocuments now takes (SearchFilters, DocumentSort, String dir, Pageable) and findIdsForFilter takes a single SearchFilters; the four private helpers (buildSearchSpec, runSearch, countUndatedForFilter, isPureTextRelevance) no longer carry a positional 10-field filter list. The controller builds the record after its existing tagOp/undated coercions; the density path adapts its DensityFilters into a SearchFilters at the shared buildSearchSpec call. The forced-undated count path is preserved via filters.withUndated(true), so countUndatedForFilter still ignores the user's toggle (#668) while runSearch honours it. No behaviour change. Controller binding tests swap their positional any()/eq() matchers for ArgumentCaptor<SearchFilters>, asserting captured.undated()/.status()/ .sender() — strictly stronger than the previous any()-soup. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-31 15:20:13 +02:00
Marcel	6c4d10d12f	test(security): lock READ_ALL -> 403 on comment-write endpoints (#697 ) Round out the "read-only users can't write anything" boundary: a READ_ALL principal is forbidden from posting a block comment, replying, and editing a comment (the prior tests only used a no-authority principal for create). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-31 13:28:37 +02:00
Marcel	2cdb48f4a4	refactor(document): compute hasTranscription only on the detail path (#697 ) Move the hasTranscription existence query out of the shared getDocumentById into a dedicated getDocumentDetail used solely by GET /api/documents/{id}. The flag is only consumed by the detail page, so the extra EXISTS query no longer runs for the many internal getDocumentById callers (e.g. the Geschichte resolve loop and the dashboard resume path). Behaviour of the detail endpoint is unchanged. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-31 13:28:37 +02:00

1 2 3 4 5 ...

605 Commits