docs(c4): accuracy audit — split L3 diagrams, add 6 new sub-diagrams, fix all stale content #448

marcel · 2026-05-06T10:11:39+02:00

marcel commented

2026-05-06 10:11:39 +02:00

Summary

Split dense L3 backend and frontend monolith diagrams into 9 focused sub-diagrams (dagre handles ≤14 nodes without arrow spaghetti)
Fixed stale content: email-based user lookup, renamed /conversations → /briefwechsel, updated component descriptions throughout
Added 6 entirely new sub-diagrams for subsystems absent from the original docs: Transcription Pipeline (3b.2), Persons & Family Graph (3c.2), OCR Orchestration (3d), Supporting Domains (3e), People/Stories/Discovery frontend (3c), Admin & Help frontend (3d)

Diagram count: 15 Mermaid blocks total (was 4)

Diagrams added or changed

Diagram	Change
L1 System Context	Updated persona descriptions (transcribes)
L2 Containers	Updated frontend container description
Backend 3a — Security	Fixed email field, permitted endpoints
Backend 3b — Document & Import	Updated batch ops, FTS, presigned URLs, audit logging
Backend 3b.2 — Transcription Pipeline	New — TranscriptionBlockController, AnnotationController, CommentController, queue service
Backend 3c — Users, Groups & Auth	Rewritten — added InviteController, AuthController, password reset
Backend 3c.2 — Persons & Family Graph	New — RelationshipController, RelationshipInferenceService, transitive graph
Backend 3d — OCR Orchestration	New — OcrController, OcrAsyncRunner, RestClientOcrClient, Python OCR Service
Backend 3e — Supporting Domains	New — Audit, Dashboard, Notification/SSE, Geschichte, GlobalExceptionHandler
Frontend 3a — Middleware & Auth	Added register/forgot-password/reset-password routes
Frontend 3b — Document Workflows	Rewritten — added docNew, docBulkEdit, enrichPage
Frontend 3c — People, Stories & Discovery	New — briefwechsel, aktivitaeten, geschichten, stammbaum
Frontend 3d — Administration & Help	New — adminUsers, adminGroups, adminTags, adminOcr, adminSystem, hilfe

Test plan

Verify all 15 Mermaid blocks render without layout errors (open in Gitea preview or mermaid.live)
Confirm no component name in any diagram is missing from the codebase

## Summary - **Split** dense L3 backend and frontend monolith diagrams into 9 focused sub-diagrams (dagre handles ≤14 nodes without arrow spaghetti) - **Fixed** stale content: email-based user lookup, renamed `/conversations` → `/briefwechsel`, updated component descriptions throughout - **Added** 6 entirely new sub-diagrams for subsystems absent from the original docs: Transcription Pipeline (3b.2), Persons & Family Graph (3c.2), OCR Orchestration (3d), Supporting Domains (3e), People/Stories/Discovery frontend (3c), Admin & Help frontend (3d) **Diagram count:** 15 Mermaid blocks total (was 4) ## Diagrams added or changed | Diagram | Change | |---|---| | L1 System Context | Updated persona descriptions (transcribes) | | L2 Containers | Updated frontend container description | | Backend 3a — Security | Fixed email field, permitted endpoints | | Backend 3b — Document & Import | Updated batch ops, FTS, presigned URLs, audit logging | | Backend 3b.2 — Transcription Pipeline | **New** — TranscriptionBlockController, AnnotationController, CommentController, queue service | | Backend 3c — Users, Groups & Auth | Rewritten — added InviteController, AuthController, password reset | | Backend 3c.2 — Persons & Family Graph | **New** — RelationshipController, RelationshipInferenceService, transitive graph | | Backend 3d — OCR Orchestration | **New** — OcrController, OcrAsyncRunner, RestClientOcrClient, Python OCR Service | | Backend 3e — Supporting Domains | **New** — Audit, Dashboard, Notification/SSE, Geschichte, GlobalExceptionHandler | | Frontend 3a — Middleware & Auth | Added register/forgot-password/reset-password routes | | Frontend 3b — Document Workflows | Rewritten — added docNew, docBulkEdit, enrichPage | | Frontend 3c — People, Stories & Discovery | **New** — briefwechsel, aktivitaeten, geschichten, stammbaum | | Frontend 3d — Administration & Help | **New** — adminUsers, adminGroups, adminTags, adminOcr, adminSystem, hilfe | ## Test plan - [ ] Verify all 15 Mermaid blocks render without layout errors (open in Gitea preview or mermaid.live) - [ ] Confirm no component name in any diagram is missing from the codebase

marcel added 9 commits 2026-05-06 10:11:39 +02:00

docs(c4): split L3 monolith diagrams into five focused sub-diagrams aff15fd208

Backend L3 split into 3a (Security & Auth), 3b (Document/File/Import),
3c (People/Users/Groups). Frontend L3 split into 3a (Middleware/Auth/Layout)
and 3b (Pages & Shared Components). Each sub-diagram stays within dagre's
clean-layout range (5–10 components, 6–12 relationships).

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs(c4): update L1 personas and L2 frontend container description d4eb0edfe1

docs(c4): fix 3a security — email field, permitted endpoints d51c5e9b80

docs(c4): update 3b document domain — descriptions, batch ops, FTS, presigned URLs 1e3423934c

docs(c4): add 3b.2 transcription pipeline — annotations, blocks, comments 04b2b33d7f

docs(c4): restructure 3c users/groups, add 3c.2 persons and family graph dbbb989aa6

docs(c4): add 3d OCR orchestration and 3e supporting domains f744e8c59d

docs(c4): update frontend 3a — hooks layers, add register/forgot/reset routes be40cc0f38

docs(c4): rewrite frontend 3b, add 3c people/stories/discovery, add 3d admin/help

CI / Unit & Component Tests (push) Failing after 3m26s

Details

CI / OCR Service Tests (push) Successful in 32s

Details

CI / Backend Unit Tests (push) Failing after 3m24s

Details

CI / Unit & Component Tests (pull_request) Failing after 3m30s

Details

CI / OCR Service Tests (pull_request) Successful in 40s

Details

CI / Backend Unit Tests (pull_request) Failing after 3m21s

Details

e1f66e2e65

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

marcel commented

2026-05-06 10:18:55 +02:00

🏗️ Markus Keller — Senior Application Architect

Verdict: ⚠️ Approved with concerns

The split from one 28-component monolith into 9 focused sub-diagrams is the right call — dagre ≤14 nodes produces readable layouts without arrow spaghetti. The cross-reference pattern for shared repos (placing personRepo/tagRepo outside the System_Boundary in 3b) is pragmatic and avoids duplicating those components across diagrams. The username→email fix on CustomUserDetailsService is a meaningful accuracy improvement.

Two structural concerns need addressing before this is a trustworthy reference document.

Blockers

1. GroupController and TagController bypass the service layer (diagram 3c)

Rel(groupCtrl, groupRepo, "Reads / writes groups", "")
Rel(tagCtrl, tagRepo, "Reads / writes tags", "")

The project's documented layering rule is: controllers never call repositories directly. These two relationships either:

Accurately document a violation in the codebase — in which case this should be flagged as tech debt, not silently normalized in the architecture docs, or
Are a diagram inaccuracy — there are GroupService/TagService components that the diagram is missing.

Given that tags have recursive CTE queries, merge/reparent logic, and subtree deletion (per the tagRepo description in the same diagram), it would be architecturally unusual to have no TagService. Please verify against the actual code and either add the service layer or note the violation explicitly.

2. Missing DashboardController in diagram 3e

The frontend diagram 3c shows:

Rel(aktivitaeten, backend, "GET /api/dashboard/activity, GET /api/notifications", "HTTP / JSON")

Diagram 3e has a dashSvc (DashboardService) but no controller to serve /api/dashboard/*. dashSvc has no incoming call from any controller shown — it's a floating component. Either add DashboardController as a component in 3e, or clarify which controller routes /api/dashboard/activity.

Suggestions

StatsController in 3e shows an incoming Rel(frontend, statsCtrl, ...) but no outgoing relationship — it doesn't delegate to anything. If it calls its own repository directly, show that. If it calls a service, add the Rel.
The minioConf → fileSvc relationship (Provides S3Client and S3Presigner beans) models DI as a runtime arrow. In C4 this is technically a deployment/infrastructure concern, not a runtime component relationship. It won't confuse readers here but is worth noting as a deviation from strict C4 semantics.

## 🏗️ Markus Keller — Senior Application Architect **Verdict: ⚠️ Approved with concerns** The split from one 28-component monolith into 9 focused sub-diagrams is the right call — dagre ≤14 nodes produces readable layouts without arrow spaghetti. The cross-reference pattern for shared repos (placing `personRepo`/`tagRepo` outside the `System_Boundary` in 3b) is pragmatic and avoids duplicating those components across diagrams. The username→email fix on `CustomUserDetailsService` is a meaningful accuracy improvement. Two structural concerns need addressing before this is a trustworthy reference document. ### Blockers **1. `GroupController` and `TagController` bypass the service layer (diagram 3c)** ``` Rel(groupCtrl, groupRepo, "Reads / writes groups", "") Rel(tagCtrl, tagRepo, "Reads / writes tags", "") ``` The project's documented layering rule is: *controllers never call repositories directly*. These two relationships either: - **Accurately document a violation** in the codebase — in which case this should be flagged as tech debt, not silently normalized in the architecture docs, or - **Are a diagram inaccuracy** — there are `GroupService`/`TagService` components that the diagram is missing. Given that tags have recursive CTE queries, merge/reparent logic, and subtree deletion (per the `tagRepo` description in the same diagram), it would be architecturally unusual to have no `TagService`. Please verify against the actual code and either add the service layer or note the violation explicitly. **2. Missing `DashboardController` in diagram 3e** The frontend diagram 3c shows: ``` Rel(aktivitaeten, backend, "GET /api/dashboard/activity, GET /api/notifications", "HTTP / JSON") ``` Diagram 3e has a `dashSvc` (DashboardService) but no controller to serve `/api/dashboard/*`. `dashSvc` has no incoming call from any controller shown — it's a floating component. Either add `DashboardController` as a component in 3e, or clarify which controller routes `/api/dashboard/activity`. ### Suggestions - `StatsController` in 3e shows an incoming `Rel(frontend, statsCtrl, ...)` but no outgoing relationship — it doesn't delegate to anything. If it calls its own repository directly, show that. If it calls a service, add the `Rel`. - The `minioConf → fileSvc` relationship (`Provides S3Client and S3Presigner beans`) models DI as a runtime arrow. In C4 this is technically a deployment/infrastructure concern, not a runtime component relationship. It won't confuse readers here but is worth noting as a deviation from strict C4 semantics.

marcel commented

2026-05-06 10:19:06 +02:00

👨‍💻 Felix Brandt — Senior Fullstack Developer

Verdict: ⚠️ Approved with concerns

Good accuracy work. The username→email fix on CustomUserDetailsService catches a real stale fact — the CLAUDE.md and AppUserRepository description both confirm the lookup is by email. The /briefwechsel rename is correct. Adding the full auth flow (register/forgot/reset) and new domains (Stammbaum, Geschichten, Aktivitaeten) makes this the first complete frontend diagram the project has had.

Two findings need resolution before this serves as a reliable reference:

Blockers

1. GroupController and TagController calling repositories directly (diagram 3c)

Rel(groupCtrl, groupRepo, "Reads / writes groups", "")
Rel(tagCtrl, tagRepo, "Reads / writes tags", "")

Per CLAUDE.md: "controllers never call repositories directly; services never reach into another domain's repository." If these relationships are accurate to the code, this is an architectural violation that should be visible as such — not silently drawn as a normal pattern. If there are GroupService/TagService classes being omitted, the diagram needs them. Please cross-check the actual GroupController.java and TagController.java and either add the intermediate service or annotate the relationship as a known deviation.

2. Missing DashboardController (diagram 3e)

dashSvc (DashboardService) assembles the activity feed and is documented correctly. But nothing calls it from outside — it's unreachable in the diagram. The frontend diagram 3c calls GET /api/dashboard/activity, so a controller must exist. Add it to 3e or explain where that endpoint lives.

Suggestions

StatsController in 3e has no outgoing Rel. What does it call to get "total persons, total documents"? If it calls DocumentRepository and PersonRepository directly (which would also be a layering concern), that should be visible. If there's a StatsService, add it.
The description of TranscriptionQueueService says "mission-control enrichment workflow" — this is slightly informal for a component description. Consider aligning with the actual method/endpoint names in the codebase.
In diagram 3b, the comment pattern for cross-diagram repos is helpful: "See diagram 3c.2. Used here by DocumentService to resolve sender / receiver persons." This is exactly the right way to handle cross-boundary references without duplicating the full component.

## 👨‍💻 Felix Brandt — Senior Fullstack Developer **Verdict: ⚠️ Approved with concerns** Good accuracy work. The username→email fix on `CustomUserDetailsService` catches a real stale fact — the CLAUDE.md and `AppUserRepository` description both confirm the lookup is by email. The `/briefwechsel` rename is correct. Adding the full auth flow (register/forgot/reset) and new domains (Stammbaum, Geschichten, Aktivitaeten) makes this the first complete frontend diagram the project has had. Two findings need resolution before this serves as a reliable reference: ### Blockers **1. `GroupController` and `TagController` calling repositories directly (diagram 3c)** ``` Rel(groupCtrl, groupRepo, "Reads / writes groups", "") Rel(tagCtrl, tagRepo, "Reads / writes tags", "") ``` Per CLAUDE.md: *"controllers never call repositories directly; services never reach into another domain's repository."* If these relationships are accurate to the code, this is an architectural violation that should be visible as such — not silently drawn as a normal pattern. If there are `GroupService`/`TagService` classes being omitted, the diagram needs them. Please cross-check the actual `GroupController.java` and `TagController.java` and either add the intermediate service or annotate the relationship as a known deviation. **2. Missing `DashboardController` (diagram 3e)** `dashSvc` (DashboardService) assembles the activity feed and is documented correctly. But nothing calls it from outside — it's unreachable in the diagram. The frontend diagram 3c calls `GET /api/dashboard/activity`, so a controller must exist. Add it to 3e or explain where that endpoint lives. ### Suggestions - `StatsController` in 3e has no outgoing `Rel`. What does it call to get "total persons, total documents"? If it calls `DocumentRepository` and `PersonRepository` directly (which would also be a layering concern), that should be visible. If there's a `StatsService`, add it. - The description of `TranscriptionQueueService` says "mission-control enrichment workflow" — this is slightly informal for a component description. Consider aligning with the actual method/endpoint names in the codebase. - In diagram 3b, the comment pattern for cross-diagram repos is helpful: `"See diagram 3c.2. Used here by DocumentService to resolve sender / receiver persons."` This is exactly the right way to handle cross-boundary references without duplicating the full component.

marcel commented

2026-05-06 10:19:14 +02:00

🚀 Tobias Wendt — DevOps & Platform Engineer

Verdict: ✅ Approved

Pure documentation change — no Compose file, no CI pipeline, no Dockerfile, no infrastructure config touched. Nothing to flag from a DevOps perspective.

What I checked:

No changes to docker-compose.yml, .gitea/workflows/, or any infrastructure file
The OCR Orchestration diagram (3d) correctly models the Python OCR service as a separate container (Container(ocrPy, "OCR Service", "Python FastAPI")) — this matches how we actually run it in Docker Compose with a separate service boundary
The ocrPy → minio presigned URL flow is accurately documented — the Python service fetches the PDF from MinIO via a presigned URL generated by the Spring backend, which is exactly the pattern we use to avoid routing large files through the JVM
The auditSvc @Async component has a note about "transaction-aware logging to prevent deadlocks on concurrent saves" — this kind of why-it-exists context is exactly what belongs in architecture docs

The test plan item "verify all 15 Mermaid blocks render without layout errors" is manual. If this project adds CI at some point, mermaid-js/mermaid-cli can lint .md files for syntax errors. Worth a future issue but not a blocker for a docs PR.

## 🚀 Tobias Wendt — DevOps & Platform Engineer **Verdict: ✅ Approved** Pure documentation change — no Compose file, no CI pipeline, no Dockerfile, no infrastructure config touched. Nothing to flag from a DevOps perspective. What I checked: - No changes to `docker-compose.yml`, `.gitea/workflows/`, or any infrastructure file - The OCR Orchestration diagram (3d) correctly models the Python OCR service as a separate container (`Container(ocrPy, "OCR Service", "Python FastAPI")`) — this matches how we actually run it in Docker Compose with a separate service boundary - The `ocrPy → minio` presigned URL flow is accurately documented — the Python service fetches the PDF from MinIO via a presigned URL generated by the Spring backend, which is exactly the pattern we use to avoid routing large files through the JVM - The `auditSvc @Async` component has a note about "transaction-aware logging to prevent deadlocks on concurrent saves" — this kind of why-it-exists context is exactly what belongs in architecture docs The test plan item "verify all 15 Mermaid blocks render without layout errors" is manual. If this project adds CI at some point, `mermaid-js/mermaid-cli` can lint `.md` files for syntax errors. Worth a future issue but not a blocker for a docs PR.

marcel commented

2026-05-06 10:19:26 +02:00

📋 Elicit — Requirements Engineer

Verdict: ⚠️ Approved with concerns

From a requirements and traceability perspective, this PR substantially improves the architecture documentation. The expanded L1 persona descriptions now correctly reflect that both Administrator and Family Member personas are transcription participants — not just readers. The route inventory in the frontend diagrams (3b, 3c, 3d) creates a useful traceability artifact linking features to backend endpoints.

Blockers

1. /api/dashboard/activity endpoint is traceable from the frontend but has no backend component

The frontend diagram 3c clearly maps the aktivitaeten page to GET /api/dashboard/activity. Tracing that endpoint into diagram 3e hits a dead end — DashboardService exists but is not reachable from any documented controller. For a requirements reviewer, this is a broken trace: the user's need (view activity feed) leads to a frontend page, which calls an endpoint, which connects to nothing in the backend diagrams.

This creates a gap in requirements traceability. If the endpoint exists in production, it should be documented in 3e.

Suggestions

The helpPage component is documented as "Static transcription style guide for Kurrent and Sütterlin character recognition. No backend calls." — the "No backend calls" annotation is a useful pattern that clarifies scope. Consider applying it consistently wherever routes are intentionally client-only.
The enrichPage description ("Guided enrichment workflow. Loader: GET /api/documents/{id}. Progressively saves annotations and transcription blocks.") accurately captures the workflow but the term "progressively saves" is slightly ambiguous — does it autosave on each annotation or only on explicit action? This is a minor documentation imprecision, not a diagram error.
The /hilfe/transkription route is documented in 3d. The CLAUDE.md route structure lists it as hilfe/transkription/ — confirm the trailing slash convention is consistent.
The invite-based registration flow is now fully documented across backend 3c (InviteController) and frontend 3a (registerPage). This is well-traced and gives a complete picture of the user registration journey.

## 📋 Elicit — Requirements Engineer **Verdict: ⚠️ Approved with concerns** From a requirements and traceability perspective, this PR substantially improves the architecture documentation. The expanded L1 persona descriptions now correctly reflect that both Administrator and Family Member personas are transcription participants — not just readers. The route inventory in the frontend diagrams (3b, 3c, 3d) creates a useful traceability artifact linking features to backend endpoints. ### Blockers **1. `/api/dashboard/activity` endpoint is traceable from the frontend but has no backend component** The frontend diagram 3c clearly maps the `aktivitaeten` page to `GET /api/dashboard/activity`. Tracing that endpoint into diagram 3e hits a dead end — `DashboardService` exists but is not reachable from any documented controller. For a requirements reviewer, this is a broken trace: the user's need (view activity feed) leads to a frontend page, which calls an endpoint, which connects to nothing in the backend diagrams. This creates a gap in requirements traceability. If the endpoint exists in production, it should be documented in 3e. ### Suggestions - The `helpPage` component is documented as "Static transcription style guide for Kurrent and Sütterlin character recognition. No backend calls." — the "No backend calls" annotation is a useful pattern that clarifies scope. Consider applying it consistently wherever routes are intentionally client-only. - The `enrichPage` description ("Guided enrichment workflow. Loader: GET /api/documents/{id}. Progressively saves annotations and transcription blocks.") accurately captures the workflow but the term "progressively saves" is slightly ambiguous — does it autosave on each annotation or only on explicit action? This is a minor documentation imprecision, not a diagram error. - The `/hilfe/transkription` route is documented in 3d. The CLAUDE.md route structure lists it as `hilfe/transkription/` — confirm the trailing slash convention is consistent. - The invite-based registration flow is now fully documented across backend 3c (`InviteController`) and frontend 3a (`registerPage`). This is well-traced and gives a complete picture of the user registration journey.

marcel commented

2026-05-06 10:19:39 +02:00

🔐 Nora "NullX" Steiner — Application Security Engineer

Verdict: ✅ Approved

This is a documentation-only PR so there are no code-level vulnerabilities to audit. What I reviewed is whether the security controls are accurately and completely documented, and whether any descriptions would mislead a future developer into missing a security requirement.

What's documented well

Anti-enumeration in forgot-password — the frontend 3a description explicitly notes: "Always responds with success to prevent email enumeration." This is exactly the kind of security rationale that belongs in architecture docs. A future developer implementing a similar endpoint will see this pattern and understand why.
CSRF reasoning — diagram 3a backend shows SecurityConfig with the correct note about CSRF being disabled. The companion description in the component doesn't explain why it's safe (Basic Auth from cookies, browsers block cross-origin custom headers), but this was true in the old diagram too and is out of scope for this PR.
Rate-limiting on invite endpoint — InviteController is correctly described as "Rate-limited via WebConfig interceptor." This documents an important control that a reviewer would otherwise miss.
Permission requirements on write endpoints — the diagrams consistently note required permissions on controllers (ADMIN, ADMIN_USER, BLOG_WRITE, WRITE_ALL). This is good: security requirements are visible at the component level.
InviteController placement — correctly placed in the auth flow (3c backend), separate from the user management controllers, which reflects the access control boundary.

Suggestion (non-blocking)

The AuthController description mentions /register, /forgot-password, /reset-password. These endpoints are permitAll() in SecurityConfig. Consider adding a one-line note to the secFilter component in 3a: "Permits: /api/auth/register, /api/auth/forgot-password, /api/auth/reset-password." Currently the only way to know which endpoints bypass auth is to cross-reference diagram 3a (security) with diagram 3c (auth). Consolidating this in the security diagram makes the attack surface immediately visible.

No blockers from a security documentation standpoint.

## 🔐 Nora "NullX" Steiner — Application Security Engineer **Verdict: ✅ Approved** This is a documentation-only PR so there are no code-level vulnerabilities to audit. What I reviewed is whether the security controls are accurately and completely documented, and whether any descriptions would mislead a future developer into missing a security requirement. ### What's documented well - **Anti-enumeration in forgot-password** — the frontend 3a description explicitly notes: *"Always responds with success to prevent email enumeration."* This is exactly the kind of security rationale that belongs in architecture docs. A future developer implementing a similar endpoint will see this pattern and understand why. - **CSRF reasoning** — diagram 3a backend shows `SecurityConfig` with the correct note about CSRF being disabled. The companion description in the component doesn't explain *why* it's safe (Basic Auth from cookies, browsers block cross-origin custom headers), but this was true in the old diagram too and is out of scope for this PR. - **Rate-limiting on invite endpoint** — `InviteController` is correctly described as "Rate-limited via WebConfig interceptor." This documents an important control that a reviewer would otherwise miss. - **Permission requirements on write endpoints** — the diagrams consistently note required permissions on controllers (ADMIN, ADMIN_USER, BLOG_WRITE, WRITE_ALL). This is good: security requirements are visible at the component level. - **InviteController placement** — correctly placed in the auth flow (3c backend), separate from the user management controllers, which reflects the access control boundary. ### Suggestion (non-blocking) The `AuthController` description mentions `/register`, `/forgot-password`, `/reset-password`. These endpoints are `permitAll()` in `SecurityConfig`. Consider adding a one-line note to the `secFilter` component in 3a: *"Permits: /api/auth/register, /api/auth/forgot-password, /api/auth/reset-password."* Currently the only way to know which endpoints bypass auth is to cross-reference diagram 3a (security) with diagram 3c (auth). Consolidating this in the security diagram makes the attack surface immediately visible. No blockers from a security documentation standpoint.

marcel commented

2026-05-06 10:19:48 +02:00

🧪 Sara Holt — Senior QA Engineer

Verdict: ⚠️ Approved with concerns

Documentation PR — no production code changed, so the test pyramid analysis focuses on whether the documentation itself is verifiable. The PR's test plan is a good start but has a gap that could allow stale diagrams to go undetected in the future.

Blockers

1. Test plan item 2 is manual with no tooling path

"Confirm no component name in any diagram is missing from the codebase"

This is currently a manual check. It will drift immediately — the next time a controller is renamed, nobody will update the diagram. For a documentation PR that explicitly aims for accuracy, there should be at least a note on how this would be verified systematically. A lightweight approach: a CI grep that checks a sample of component names from the diagrams against find backend/src -name "*.java". Even if not automated now, documenting the verification method makes future drift visible.

Suggestions

The test plan item 1 ("verify all 15 Mermaid blocks render without layout errors") is manual. The Mermaid CLI (@mermaid-js/mermaid-cli) can parse and render .md files in CI — a single mmdc --check run would catch syntax errors at PR time. This doesn't need to block this PR but deserves a follow-up issue.
The diagram split (9 backend sub-diagrams + 4 frontend sub-diagrams) makes each diagram individually testable — you can verify one sub-domain at a time rather than the entire system at once. This is an improvement from a QA perspective. The original monolithic diagram would have required reviewing all 28+ components simultaneously.
The PR description says "15 Mermaid blocks total (was 4)". The test plan references 15 blocks — good alignment between claim and test scope.

## 🧪 Sara Holt — Senior QA Engineer **Verdict: ⚠️ Approved with concerns** Documentation PR — no production code changed, so the test pyramid analysis focuses on whether the documentation itself is verifiable. The PR's test plan is a good start but has a gap that could allow stale diagrams to go undetected in the future. ### Blockers **1. Test plan item 2 is manual with no tooling path** > "Confirm no component name in any diagram is missing from the codebase" This is currently a manual check. It will drift immediately — the next time a controller is renamed, nobody will update the diagram. For a documentation PR that explicitly aims for accuracy, there should be at least a note on *how* this would be verified systematically. A lightweight approach: a CI grep that checks a sample of component names from the diagrams against `find backend/src -name "*.java"`. Even if not automated now, documenting the verification method makes future drift visible. ### Suggestions - The test plan item 1 ("verify all 15 Mermaid blocks render without layout errors") is manual. The Mermaid CLI (`@mermaid-js/mermaid-cli`) can parse and render `.md` files in CI — a single `mmdc --check` run would catch syntax errors at PR time. This doesn't need to block this PR but deserves a follow-up issue. - The diagram split (9 backend sub-diagrams + 4 frontend sub-diagrams) makes each diagram individually testable — you can verify one sub-domain at a time rather than the entire system at once. This is an improvement from a QA perspective. The original monolithic diagram would have required reviewing all 28+ components simultaneously. - The PR description says "15 Mermaid blocks total (was 4)". The test plan references 15 blocks — good alignment between claim and test scope.

marcel commented

2026-05-06 10:19:58 +02:00

🎨 Leonie Voss — UX Designer & Accessibility Strategist

Verdict: ✅ Approved

This is a pure architecture documentation PR — no Svelte components, no Tailwind, no UI changes. Nothing to flag from a UX or accessibility perspective.

What I did check from a documentation usability standpoint:

Readability improvement is real. The old L3 backend was a single 28-component diagram — un-navigable in any rendered view. The split into sub-diagrams by domain (security, documents, transcription, persons, OCR, supporting) maps to how a developer actually navigates the codebase. Someone working on the transcription pipeline now has a focused diagram (3b.2) rather than a needle-in-a-haystack experience.
The frontend diagrams (3a–3d) now match what users actually see. The Stammbaum, Geschichten, Briefwechsel, and Aktivitaeten features were invisible in the old docs. A new developer onboarding to the frontend now has a complete picture of the route structure and what each page calls.
The dual-audience of the app is correctly reflected in L1. Both Administrator and Family Member are now described as transcription participants, which matches the actual user workflow for this archive application.

No design or accessibility concerns for this PR. The documentation improvement is clear and well-structured.

## 🎨 Leonie Voss — UX Designer & Accessibility Strategist **Verdict: ✅ Approved** This is a pure architecture documentation PR — no Svelte components, no Tailwind, no UI changes. Nothing to flag from a UX or accessibility perspective. What I did check from a documentation usability standpoint: - **Readability improvement is real.** The old L3 backend was a single 28-component diagram — un-navigable in any rendered view. The split into sub-diagrams by domain (security, documents, transcription, persons, OCR, supporting) maps to how a developer actually navigates the codebase. Someone working on the transcription pipeline now has a focused diagram (3b.2) rather than a needle-in-a-haystack experience. - **The frontend diagrams (3a–3d) now match what users actually see.** The Stammbaum, Geschichten, Briefwechsel, and Aktivitaeten features were invisible in the old docs. A new developer onboarding to the frontend now has a complete picture of the route structure and what each page calls. - **The dual-audience of the app is correctly reflected in L1.** Both Administrator and Family Member are now described as transcription participants, which matches the actual user workflow for this archive application. No design or accessibility concerns for this PR. The documentation improvement is clear and well-structured.

marcel commented

2026-05-06 10:30:18 +02:00

Resolution — Blocker fixes (commit `42c6e58d`)

All three blockers addressed. Every finding was a diagram inaccuracy — the actual backend code follows the layering rules correctly throughout.

Blocker 1 + 2 — Diagram 3c: `GroupController` and `TagController` service layer

Verified against the source:

GroupController.java injects UserService and delegates all group operations to it (no direct repo call)
TagController.java injects TagService and delegates all tag operations to it (no direct repo call)

Fixed: Rel(groupCtrl, groupRepo, ...) → Rel(groupCtrl, userSvc, "Delegates to", ""). Added missing TagService component inside the boundary. Changed Rel(tagCtrl, tagRepo, ...) → Rel(tagCtrl, tagSvc, ...) + Rel(tagSvc, tagRepo, ...).

Blocker 3 — Diagram 3e: missing `DashboardController` and `StatsController` outgoing rel

Verified against the source:

DashboardController.java exists, mapped to /api/dashboard, serves /resume, /pulse, and /activity. Delegates to DashboardService.
StatsController.java delegates to StatsService, which queries aggregate counts.

Fixed: Added DashboardController component with incoming Rel from frontend and outgoing delegation to dashSvc. Added StatsService component with statsCtrl → statsSvc → db chain.

Sara's concern — verification method for component name accuracy

The concern is valid. A lightweight approach that doesn't require CI infrastructure:

# Extract component names from the diagrams
grep -oP 'Component\(\K[^,]+' docs/architecture/c4-diagrams.md | sort -u

# Cross-check against Java class names in the backend
find backend/src/main/java -name "*.java" -printf "%f\n" | sed 's/\.java$//' | sort -u

Diffing these two lists catches any renamed or removed class that the diagram still references. I'll open a follow-up issue to track adding this as a CI lint step.

Node counts after additions: diagram 3c = 11 nodes, diagram 3e = 14 nodes — both within the ≤14 dagre budget noted in the PR description.

## Resolution — Blocker fixes (commit 42c6e58d) All three blockers addressed. Every finding was a **diagram inaccuracy** — the actual backend code follows the layering rules correctly throughout. ### Blocker 1 + 2 — Diagram 3c: `GroupController` and `TagController` service layer Verified against the source: - `GroupController.java` injects `UserService` and delegates all group operations to it (no direct repo call) - `TagController.java` injects `TagService` and delegates all tag operations to it (no direct repo call) **Fixed:** `Rel(groupCtrl, groupRepo, ...)` → `Rel(groupCtrl, userSvc, "Delegates to", "")`. Added missing `TagService` component inside the boundary. Changed `Rel(tagCtrl, tagRepo, ...)` → `Rel(tagCtrl, tagSvc, ...) + Rel(tagSvc, tagRepo, ...)`. ### Blocker 3 — Diagram 3e: missing `DashboardController` and `StatsController` outgoing rel Verified against the source: - `DashboardController.java` exists, mapped to `/api/dashboard`, serves `/resume`, `/pulse`, and `/activity`. Delegates to `DashboardService`. - `StatsController.java` delegates to `StatsService`, which queries aggregate counts. **Fixed:** Added `DashboardController` component with incoming `Rel` from frontend and outgoing delegation to `dashSvc`. Added `StatsService` component with `statsCtrl → statsSvc → db` chain. --- ### Sara's concern — verification method for component name accuracy The concern is valid. A lightweight approach that doesn't require CI infrastructure: ```bash # Extract component names from the diagrams grep -oP 'Component\(\K[^,]+' docs/architecture/c4-diagrams.md | sort -u # Cross-check against Java class names in the backend find backend/src/main/java -name "*.java" -printf "%f\n" | sed 's/\.java$//' | sort -u ``` Diffing these two lists catches any renamed or removed class that the diagram still references. I'll open a follow-up issue to track adding this as a CI lint step. --- Node counts after additions: diagram 3c = 11 nodes, diagram 3e = 14 nodes — both within the ≤14 dagre budget noted in the PR description.

marcel commented

2026-05-06 10:44:40 +02:00

🏗️ Markus Keller — Senior Application Architect

Verdict: ⚠️ Approved with concerns

Splitting 4 overloaded Mermaid blocks into 15 focused sub-diagrams was the right call — dagre stops making spaghetti at this granularity, and each diagram now covers exactly one bounded context. The new sub-diagrams (3b.2 Transcription Pipeline, 3c.2 Persons & Family Graph, 3d OCR Orchestration, 3e Supporting Domains) fill gaps that mattered.

That said, there are two accuracy concerns that a future developer could use as a blueprint to introduce real layering violations.

Blockers

Diagram 3b: docSvc is drawn accessing personRepo and tagRepo directly

Rel(docSvc, personRepo, "Resolves sender / receivers", "")
Rel(docSvc, tagRepo, "Finds or creates tags", "")

The CLAUDE.md layering rule is explicit: "services never reach into another domain's repository — always call the other domain's service instead." If this diagram is accurate, it documents a real violation. If it's a simplification, it will mislead anyone who reads the architecture docs as implementation guidance.

Recommendation: Either draw docSvc → personSvc and docSvc → tagSvc (the correct path), or add a note on the component that this is a documented exception with a justification. Don't let the diagram silently model the wrong pattern.

Diagram 3c: groupCtrl and tagCtrl are drawn calling their repositories directly

Rel(groupCtrl, groupRepo, "Reads / writes groups", "")
Rel(tagCtrl, tagRepo, "Reads / writes tags", "")

Controllers calling repositories is prohibited by the same layering rule. If no GroupService / TagService exists, that's an existing gap worth noting. If they do exist, they're missing from the diagram.

Suggestions

Diagram 3e shows dashSvc with an incoming Rel(dashSvc, auditQuery, ...) but no frontend → dashSvc relationship. If the DashboardService is used by an API endpoint, that entry-point is missing. If it's only called internally, clarify that.
The OCR diagram (3d) is the most complex at 13 nodes — still within dagre's comfort zone, but worth watching if new components are added.

Overall the content is a significant improvement. The layering inaccuracies are the only things worth resolving before this becomes the canonical reference.

## 🏗️ Markus Keller — Senior Application Architect **Verdict: ⚠️ Approved with concerns** Splitting 4 overloaded Mermaid blocks into 15 focused sub-diagrams was the right call — dagre stops making spaghetti at this granularity, and each diagram now covers exactly one bounded context. The new sub-diagrams (3b.2 Transcription Pipeline, 3c.2 Persons & Family Graph, 3d OCR Orchestration, 3e Supporting Domains) fill gaps that mattered. That said, there are two accuracy concerns that a future developer could use as a blueprint to introduce real layering violations. ### Blockers **Diagram 3b: `docSvc` is drawn accessing `personRepo` and `tagRepo` directly** ``` Rel(docSvc, personRepo, "Resolves sender / receivers", "") Rel(docSvc, tagRepo, "Finds or creates tags", "") ``` The CLAUDE.md layering rule is explicit: *"services never reach into another domain's repository — always call the other domain's service instead."* If this diagram is accurate, it documents a real violation. If it's a simplification, it will mislead anyone who reads the architecture docs as implementation guidance. **Recommendation**: Either draw `docSvc → personSvc` and `docSvc → tagSvc` (the correct path), or add a note on the component that this is a documented exception with a justification. Don't let the diagram silently model the wrong pattern. **Diagram 3c: `groupCtrl` and `tagCtrl` are drawn calling their repositories directly** ``` Rel(groupCtrl, groupRepo, "Reads / writes groups", "") Rel(tagCtrl, tagRepo, "Reads / writes tags", "") ``` Controllers calling repositories is prohibited by the same layering rule. If no `GroupService` / `TagService` exists, that's an existing gap worth noting. If they do exist, they're missing from the diagram. ### Suggestions - **Diagram 3e** shows `dashSvc` with an incoming `Rel(dashSvc, auditQuery, ...)` but no frontend → dashSvc relationship. If the DashboardService is used by an API endpoint, that entry-point is missing. If it's only called internally, clarify that. - The OCR diagram (3d) is the most complex at 13 nodes — still within dagre's comfort zone, but worth watching if new components are added. Overall the content is a significant improvement. The layering inaccuracies are the only things worth resolving before this becomes the canonical reference.

marcel commented

2026-05-06 10:45:01 +02:00

👨‍💻 Felix Brandt — Senior Fullstack Developer

Verdict: ⚠️ Approved with concerns

Docs-only PR, so TDD doesn't apply directly — but I read architecture docs the same way I read code: they should be accurate enough to implement from. These diagrams will be read by whoever touches these domains next, and two patterns caught my eye.

Blockers

The groupCtrl → groupRepo and tagCtrl → tagRepo direct relationships in diagram 3c

Rel(groupCtrl, groupRepo, "Reads / writes groups", "")
Rel(tagCtrl, tagRepo, "Reads / writes tags", "")

If this is accurate, then GroupController and TagController are hitting repositories directly — that puts business logic in the wrong layer and makes it untestable without a web context. If it's a simplification, the diagram is actively misleading. Either fix the diagram or open a follow-up issue to introduce GroupService / TagService.

The docSvc → personRepo and docSvc → tagRepo relationships in diagram 3b

Same concern as Markus raised. The correct shape is docSvc → personSvc → personRepo. If the real code bypasses PersonService, that's a testability problem — PersonService.getById() would carry validation logic that DocumentService silently skips.

Suggestions

Diagram 3b.2: TranscriptionService and AnnotationService both have direct Rel(..., db, ...) arrows that bypass their repositories. For consistency with how every other service is drawn, these should route through named repository components. If the services call the DB directly via JdbcTemplate (not JPA), that's worth noting explicitly.
The enrichPage component in diagram 3b calls GET/POST /api/transcription and POST /api/documents/{id}/annotations — the transcription endpoint path is just /api/transcription, but in diagram 3b.2 the controller is described as /api/transcription. Looks consistent, just confirming the path is stable.
Naming: transcriptionQueueSvc is a great name in the diagram. Just make sure the actual class name matches — having diagram names diverge from class names is how docs go stale again.

The split from monolith diagrams to focused per-domain views is excellent. Fix the layering inaccuracies and this becomes genuinely useful documentation.

## 👨‍💻 Felix Brandt — Senior Fullstack Developer **Verdict: ⚠️ Approved with concerns** Docs-only PR, so TDD doesn't apply directly — but I read architecture docs the same way I read code: they should be accurate enough to implement from. These diagrams will be read by whoever touches these domains next, and two patterns caught my eye. ### Blockers **The `groupCtrl → groupRepo` and `tagCtrl → tagRepo` direct relationships in diagram 3c** ``` Rel(groupCtrl, groupRepo, "Reads / writes groups", "") Rel(tagCtrl, tagRepo, "Reads / writes tags", "") ``` If this is accurate, then `GroupController` and `TagController` are hitting repositories directly — that puts business logic in the wrong layer and makes it untestable without a web context. If it's a simplification, the diagram is actively misleading. Either fix the diagram or open a follow-up issue to introduce `GroupService` / `TagService`. **The `docSvc → personRepo` and `docSvc → tagRepo` relationships in diagram 3b** Same concern as Markus raised. The correct shape is `docSvc → personSvc → personRepo`. If the real code bypasses `PersonService`, that's a testability problem — `PersonService.getById()` would carry validation logic that `DocumentService` silently skips. ### Suggestions - **Diagram 3b.2**: `TranscriptionService` and `AnnotationService` both have direct `Rel(..., db, ...)` arrows that bypass their repositories. For consistency with how every other service is drawn, these should route through named repository components. If the services call the DB directly via JdbcTemplate (not JPA), that's worth noting explicitly. - The `enrichPage` component in diagram 3b calls `GET/POST /api/transcription` and `POST /api/documents/{id}/annotations` — the transcription endpoint path is just `/api/transcription`, but in diagram 3b.2 the controller is described as `/api/transcription`. Looks consistent, just confirming the path is stable. - Naming: `transcriptionQueueSvc` is a great name in the diagram. Just make sure the actual class name matches — having diagram names diverge from class names is how docs go stale again. The split from monolith diagrams to focused per-domain views is excellent. Fix the layering inaccuracies and this becomes genuinely useful documentation.

marcel commented

2026-05-06 10:45:02 +02:00

🚀 Tobias Wendt — DevOps & Platform Engineer

Verdict: ✅ Approved

Docs-only change touching docs/architecture/c4-diagrams.md. No infrastructure files, no Compose changes, no CI modifications.

What I checked

No new services or containers introduced — the OCR Python service and MinIO were already in the stack; their inclusion in diagram 3d is documentation of existing reality, not a proposal.
The diagram correctly shows MinIO in diagram 3d with Rel(ocrPy, minio, "Fetches PDF via presigned URL", "HTTP / S3 presigned") — that's the correct pattern for keeping the OCR Python service from needing application credentials.
No hardcoded endpoints, secrets, or credentials visible in the diagram content.

Nothing here touches my domain. Merge when the layering concerns from Markus and Felix are resolved.

## 🚀 Tobias Wendt — DevOps & Platform Engineer **Verdict: ✅ Approved** Docs-only change touching `docs/architecture/c4-diagrams.md`. No infrastructure files, no Compose changes, no CI modifications. ### What I checked - No new services or containers introduced — the OCR Python service and MinIO were already in the stack; their inclusion in diagram 3d is documentation of existing reality, not a proposal. - The diagram correctly shows MinIO in diagram 3d with `Rel(ocrPy, minio, "Fetches PDF via presigned URL", "HTTP / S3 presigned")` — that's the correct pattern for keeping the OCR Python service from needing application credentials. - No hardcoded endpoints, secrets, or credentials visible in the diagram content. Nothing here touches my domain. Merge when the layering concerns from Markus and Felix are resolved.

marcel commented

2026-05-06 10:45:07 +02:00

📋 Elicit — Requirements Engineer

Verdict: ✅ Approved

Reviewed from a requirements completeness and traceability standpoint.

What I checked

Coverage: The 15 diagrams now cover all major subsystems described in CLAUDE.md and the PR description table. The pre-PR docs were missing the Transcription Pipeline, Persons/Family Graph, OCR Orchestration, and Supporting Domains — all four are now documented. This closes a significant documentation gap.

Accuracy of actor descriptions: L1 was updated to reflect that both admins and family members transcribe documents. This is consistent with the product framing I understand from the codebase (device split: transcribers on laptop/tablet, readers on phones).

Route coverage: The frontend diagrams now cover the routes listed in CLAUDE.md's Route Structure section. Key additions — /briefwechsel (was still /conversations in the old diagram), /aktivitaeten, /geschichten, /stammbaum, /enrich/[id], the full admin panel hierarchy, and the auth flows (register, forgot-password, reset-password).

Suggestions (non-blocking)

The hilfe component in diagram 3d is described as having "No backend calls" — this is a useful detail, but it also means it's a purely static page. Worth confirming in the diagram description that it's intentionally static (no future dynamic content planned), or leave a note in the issue tracker if that's a known future need.
The test plan ("Verify all 15 Mermaid blocks render") is sufficient for a docs-only PR. For a more thorough acceptance test, the second criterion ("Confirm no component name in any diagram is missing from the codebase") would benefit from a code grep or naming convention check, since RelationshipInferenceService, OcrAsyncRunner, etc. are referenced — worth confirming those are actual class names.

No requirements gaps found.

## 📋 Elicit — Requirements Engineer **Verdict: ✅ Approved** Reviewed from a requirements completeness and traceability standpoint. ### What I checked **Coverage**: The 15 diagrams now cover all major subsystems described in CLAUDE.md and the PR description table. The pre-PR docs were missing the Transcription Pipeline, Persons/Family Graph, OCR Orchestration, and Supporting Domains — all four are now documented. This closes a significant documentation gap. **Accuracy of actor descriptions**: L1 was updated to reflect that both admins and family members transcribe documents. This is consistent with the product framing I understand from the codebase (device split: transcribers on laptop/tablet, readers on phones). **Route coverage**: The frontend diagrams now cover the routes listed in CLAUDE.md's Route Structure section. Key additions — `/briefwechsel` (was still `/conversations` in the old diagram), `/aktivitaeten`, `/geschichten`, `/stammbaum`, `/enrich/[id]`, the full admin panel hierarchy, and the auth flows (register, forgot-password, reset-password). ### Suggestions (non-blocking) - The `hilfe` component in diagram 3d is described as having "No backend calls" — this is a useful detail, but it also means it's a purely static page. Worth confirming in the diagram description that it's intentionally static (no future dynamic content planned), or leave a note in the issue tracker if that's a known future need. - The test plan ("Verify all 15 Mermaid blocks render") is sufficient for a docs-only PR. For a more thorough acceptance test, the second criterion ("Confirm no component name in any diagram is missing from the codebase") would benefit from a code grep or naming convention check, since `RelationshipInferenceService`, `OcrAsyncRunner`, etc. are referenced — worth confirming those are actual class names. No requirements gaps found.

marcel commented

2026-05-06 10:45:18 +02:00

🔐 Nora "NullX" Steiner — Application Security Engineer

Verdict: ✅ Approved

Docs-only PR. My review is focused on whether the security architecture is accurately documented and whether the diagrams reveal any security gaps worth tracking.

What I checked

Diagram 3a (Security & Authentication) is the most security-relevant diagram. Key observations:

CustomUserDetailsService description was corrected from "Loads AppUser by username" to "Loads AppUser by email" — this is an important accuracy fix. The old description could have misled a developer looking for the authentication lookup to the wrong field.
SecurityConfig correctly documents CSRF-disabled with the permitted endpoints listed (password-reset, invite, register). The original diagram had no detail here — this is a meaningful improvement.
PermissionAspect description accurately reflects that it guards @RequirePermission annotations via AOP.

Diagram 3d (OCR) correctly shows that the Python OCR service fetches PDFs via presigned URLs, not by receiving the PDF directly over HTTP. This is the correct architecture to avoid sending large files through the application backend.

Suggestions (non-blocking)

Diagram 3a: The secFilter description says it "Permits password-reset, invite, and register endpoints without authentication." It would be worth noting (even as a diagram comment) whether the rate-limiting interceptor from WebConfig is also active on these endpoints — the invite controller description mentions it, but diagram 3a doesn't show it. Not a blocker for docs, but that's the kind of detail that gets forgotten when the security picture is reviewed.
Diagram 3b.2 (Transcription): The CommentController description mentions "@mention support." Any @mention parsing involves user-controlled input being resolved to user identities — worth confirming the mention parser sanitizes content before storing. Not a diagram problem, just a note for future audit scope.

No security architecture inaccuracies found in this PR.

## 🔐 Nora "NullX" Steiner — Application Security Engineer **Verdict: ✅ Approved** Docs-only PR. My review is focused on whether the security architecture is accurately documented and whether the diagrams reveal any security gaps worth tracking. ### What I checked **Diagram 3a (Security & Authentication)** is the most security-relevant diagram. Key observations: - `CustomUserDetailsService` description was corrected from "Loads AppUser by **username**" to "Loads AppUser by **email**" — this is an important accuracy fix. The old description could have misled a developer looking for the authentication lookup to the wrong field. - `SecurityConfig` correctly documents CSRF-disabled with the permitted endpoints listed (password-reset, invite, register). The original diagram had no detail here — this is a meaningful improvement. - `PermissionAspect` description accurately reflects that it guards `@RequirePermission` annotations via AOP. **Diagram 3d (OCR)** correctly shows that the Python OCR service fetches PDFs via presigned URLs, not by receiving the PDF directly over HTTP. This is the correct architecture to avoid sending large files through the application backend. ### Suggestions (non-blocking) - **Diagram 3a**: The `secFilter` description says it "Permits password-reset, invite, and register endpoints without authentication." It would be worth noting (even as a diagram comment) whether the rate-limiting interceptor from `WebConfig` is also active on these endpoints — the invite controller description mentions it, but diagram 3a doesn't show it. Not a blocker for docs, but that's the kind of detail that gets forgotten when the security picture is reviewed. - **Diagram 3b.2 (Transcription)**: The `CommentController` description mentions "@mention support." Any `@mention` parsing involves user-controlled input being resolved to user identities — worth confirming the mention parser sanitizes content before storing. Not a diagram problem, just a note for future audit scope. No security architecture inaccuracies found in this PR.

marcel commented

2026-05-06 10:45:28 +02:00

🧪 Sara Holt — QA Engineer & Test Strategist

Verdict: ✅ Approved

Docs-only PR — no test coverage changes. My review checks whether the diagrams accurately reflect what is (or should be) testable, and whether the documented architecture creates any test coverage blind spots.

What I checked

Test plan adequacy: The PR's test plan is:

Verify all 15 Mermaid blocks render without layout errors
Confirm no component name in any diagram is missing from the codebase

Point 1 is straightforward and appropriate. Point 2 is the more valuable one — the old diagrams had stale component names that no longer existed in the codebase. Running a grep pass for each named Component(...) before merge would validate this PR's accuracy claim concretely.

Documentation of async / testability-sensitive patterns: The diagrams correctly distinguish @Async components:

MassImportService — Spring Service — @Async
OcrAsyncRunner — Spring Component — @Async
AuditService — Spring Service — @Async

These are exactly the components where @Transactional test rollback doesn't work cleanly, and where Awaitility is needed in integration tests. The diagrams now make these patterns visible.

Suggestions (non-blocking)

The TranscriptionQueueService description says it "Exposes segmentation, transcription, and review queue projections." If this service is the entry point for the enrichment workflow's queue, there should be integration tests covering queue state transitions — worth checking if those exist or if this is undocumented.
SseEmitterRegistry is described as an "In-memory ConcurrentHashMap." In-memory SSE registries are not testable with @Transactional rollback and require explicit cleanup between tests. Worth flagging for any future test that hits notification flows.

No blockers for this PR — it's documentation only and accurate on the testability-relevant patterns.

## 🧪 Sara Holt — QA Engineer & Test Strategist **Verdict: ✅ Approved** Docs-only PR — no test coverage changes. My review checks whether the diagrams accurately reflect what is (or should be) testable, and whether the documented architecture creates any test coverage blind spots. ### What I checked **Test plan adequacy**: The PR's test plan is: 1. Verify all 15 Mermaid blocks render without layout errors 2. Confirm no component name in any diagram is missing from the codebase Point 1 is straightforward and appropriate. Point 2 is the more valuable one — the old diagrams had stale component names that no longer existed in the codebase. Running a grep pass for each named `Component(...)` before merge would validate this PR's accuracy claim concretely. **Documentation of async / testability-sensitive patterns**: The diagrams correctly distinguish `@Async` components: - `MassImportService` — `Spring Service — @Async` - `OcrAsyncRunner` — `Spring Component — @Async` - `AuditService` — `Spring Service — @Async` These are exactly the components where `@Transactional` test rollback doesn't work cleanly, and where `Awaitility` is needed in integration tests. The diagrams now make these patterns visible. ### Suggestions (non-blocking) - The `TranscriptionQueueService` description says it "Exposes segmentation, transcription, and review queue projections." If this service is the entry point for the enrichment workflow's queue, there should be integration tests covering queue state transitions — worth checking if those exist or if this is undocumented. - `SseEmitterRegistry` is described as an "In-memory ConcurrentHashMap." In-memory SSE registries are not testable with `@Transactional` rollback and require explicit cleanup between tests. Worth flagging for any future test that hits notification flows. No blockers for this PR — it's documentation only and accurate on the testability-relevant patterns.

marcel commented

2026-05-06 10:45:35 +02:00

🎨 Leonie Voss — UX Designer & Accessibility Strategist

Verdict: ✅ Approved

Docs-only PR touching architecture documentation. No UI/UX or accessibility changes — nothing in my design domain is affected.

What I noted

The frontend diagram split (3a–3d) mirrors the visual structure of the app well. The four-way decomposition — Middleware & Auth / Document Workflows / People & Discovery / Admin & Help — maps closely to the navigation zones a user actually experiences.

One observation that touches the design/architecture boundary: diagram 3c documents /stammbaum as using a "D3-based Stammbaum" visualisation. The family tree is one of the most interaction-heavy pages in the app, and D3 with complex graph layouts is the hardest surface to make accessible and mobile-responsive. This diagram is the first formal documentation of that component existing — if it's not yet covered by accessibility tests in E2E, that's worth a follow-up issue. Not a blocker for this docs PR.

No concerns with the documentation itself.

## 🎨 Leonie Voss — UX Designer & Accessibility Strategist **Verdict: ✅ Approved** Docs-only PR touching architecture documentation. No UI/UX or accessibility changes — nothing in my design domain is affected. ### What I noted The frontend diagram split (3a–3d) mirrors the visual structure of the app well. The four-way decomposition — Middleware & Auth / Document Workflows / People & Discovery / Admin & Help — maps closely to the navigation zones a user actually experiences. One observation that touches the design/architecture boundary: **diagram 3c** documents `/stammbaum` as using a "D3-based Stammbaum" visualisation. The family tree is one of the most interaction-heavy pages in the app, and D3 with complex graph layouts is the hardest surface to make accessible and mobile-responsive. This diagram is the first formal documentation of that component existing — if it's not yet covered by accessibility tests in E2E, that's worth a follow-up issue. Not a blocker for this docs PR. No concerns with the documentation itself.

marcel referenced this issue from a commit

2026-05-06 11:04:10 +02:00

docs(c4): fix diagram 3c service layer and add missing 3e components

marcel added 3 commits 2026-05-06 11:04:10 +02:00

docs(c4): fix diagram 3c service layer and add missing 3e components 42c6e58d36

- diagram 3c: GroupController delegates to UserService (not groupRepo directly)
- diagram 3c: add TagService; TagController delegates to TagService (not tagRepo)
- diagram 3e: add DashboardController serving /api/dashboard/resume|pulse|activity
- diagram 3e: add StatsService; StatsController delegates to StatsService

Addresses blocker feedback from Markus, Felix, and Elicit in PR #448 review.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs: remove accidentally committed spec file 18a7ee1fa5

Spec file was pre-staged from a prior session and bundled into the previous commit. Specs belong in Gitea issues, not committed to the repo.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs(c4): fix service layer relationships in diagrams 3b and 3b.2

CI / Unit & Component Tests (push) Failing after 3m33s

Details

CI / OCR Service Tests (push) Successful in 35s

Details

CI / Backend Unit Tests (push) Failing after 3m22s

Details

CI / Unit & Component Tests (pull_request) Failing after 3m28s

Details

CI / OCR Service Tests (pull_request) Successful in 31s

Details

CI / Backend Unit Tests (pull_request) Failing after 3m16s

Details

83c3d85b00

Diagram 3b: DocumentService calls PersonService and TagService, not
their repositories directly. Replace personRepo/tagRepo cross-ref
stubs with personSvc/tagSvc to accurately reflect the layering rule.

Diagram 3b.2: TranscriptionService, AnnotationService, and
CommentService each use a JPA repository, not JDBC directly. Add
TranscriptionBlockRepository, AnnotationRepository, and
CommentRepository components and route the service→repo→db chain.
TranscriptionQueueService delegates to DocumentService and
AuditLogQueryService (no repo of its own); replace the incorrect
→db arrow with cross-diagram stubs.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

marcel commented

2026-05-06 11:04:31 +02:00

Resolution — Round 2 blocker fixes (commit `83c3d85b`)

All round 2 concerns addressed. Both findings were diagram inaccuracies — the actual backend code follows the layering rules correctly throughout.

Blocker — Diagram 3b: `docSvc → personRepo/tagRepo` direct relationships

Verified against source (DocumentService.java):

Line 71: private final PersonService personService;
Line 73: private final TagService tagService;

DocumentService never touches PersonRepository or TagRepository directly — it goes through the service layer as the layering rule requires.

Fixed: Replaced the personRepo/tagRepo cross-ref stubs outside the System_Boundary with personSvc/tagSvc stubs. Changed:

Rel(docSvc, personRepo, ...) → Rel(docSvc, personSvc, "Resolves sender / receivers", "")
Rel(docSvc, tagRepo, ...) → Rel(docSvc, tagSvc, "Finds or creates tags", "")
Removed personRepo → db and tagRepo → db stubs (those details live in diagrams 3c.2 and 3c respectively)

Suggestion (Felix) — Diagram 3b.2: services routing directly to `db`

Verified against source:

TranscriptionService → TranscriptionBlockRepository, TranscriptionBlockVersionRepository
AnnotationService → AnnotationRepository
CommentService → CommentRepository
TranscriptionQueueService → DocumentService + AuditLogQueryService (no repository of its own — pure cross-domain delegation)

The direct → db arrows were all inaccurate. These services use JPA repositories (or delegate to other services), never JDBC directly.

Fixed:

Added TranscriptionBlockRepository, AnnotationRepository, CommentRepository components inside the System_Boundary
Routed transcriptionSvc → blockRepo → db, annotationSvc → annotationRepo → db, commentSvc → commentRepo → db
For transcriptionQueueSvc: replaced the incorrect → db arrow with cross-diagram stubs DocumentService (see 3b) and AuditLogQueryService (see 3e) and correct delegation relationships
Updated transcriptionQueueSvc component description to reflect actual behavior

Node counts after additions: diagram 3b.2 = 14 nodes (3 controllers + 4 services + 3 repos + 2 cross-ref stubs + frontend + db) — at the ≤14 dagre budget.

## Resolution — Round 2 blocker fixes (commit 83c3d85b) All round 2 concerns addressed. Both findings were **diagram inaccuracies** — the actual backend code follows the layering rules correctly throughout. --- ### Blocker — Diagram 3b: `docSvc → personRepo/tagRepo` direct relationships Verified against source (`DocumentService.java`): - Line 71: `private final PersonService personService;` - Line 73: `private final TagService tagService;` `DocumentService` never touches `PersonRepository` or `TagRepository` directly — it goes through the service layer as the layering rule requires. **Fixed:** Replaced the `personRepo`/`tagRepo` cross-ref stubs outside the `System_Boundary` with `personSvc`/`tagSvc` stubs. Changed: - `Rel(docSvc, personRepo, ...)` → `Rel(docSvc, personSvc, "Resolves sender / receivers", "")` - `Rel(docSvc, tagRepo, ...)` → `Rel(docSvc, tagSvc, "Finds or creates tags", "")` - Removed `personRepo → db` and `tagRepo → db` stubs (those details live in diagrams 3c.2 and 3c respectively) --- ### Suggestion (Felix) — Diagram 3b.2: services routing directly to `db` Verified against source: - `TranscriptionService` → `TranscriptionBlockRepository`, `TranscriptionBlockVersionRepository` - `AnnotationService` → `AnnotationRepository` - `CommentService` → `CommentRepository` - `TranscriptionQueueService` → `DocumentService` + `AuditLogQueryService` (no repository of its own — pure cross-domain delegation) The direct `→ db` arrows were all inaccurate. These services use JPA repositories (or delegate to other services), never JDBC directly. **Fixed:** - Added `TranscriptionBlockRepository`, `AnnotationRepository`, `CommentRepository` components inside the `System_Boundary` - Routed `transcriptionSvc → blockRepo → db`, `annotationSvc → annotationRepo → db`, `commentSvc → commentRepo → db` - For `transcriptionQueueSvc`: replaced the incorrect `→ db` arrow with cross-diagram stubs `DocumentService` (see 3b) and `AuditLogQueryService` (see 3e) and correct delegation relationships - Updated `transcriptionQueueSvc` component description to reflect actual behavior --- Node counts after additions: diagram 3b.2 = 14 nodes (3 controllers + 4 services + 3 repos + 2 cross-ref stubs + frontend + db) — at the ≤14 dagre budget.

marcel commented

2026-05-06 11:18:18 +02:00

🏗️ Markus Keller — Senior Application Architect

Verdict: ⚠️ Approved with concerns

The split into 15 focused diagrams is the right call — dagre struggles beyond ~14 nodes and the original monolith diagrams were unreadable. The cross-diagram reference pattern (external stubs with "See diagram X" annotations) is a clean way to show inter-domain dependencies without duplicating entire sub-diagrams.

Blockers

Diagram 3d — OcrAsyncRunner bypasses domain services in the relationship graph

The component description for OcrAsyncRunner correctly states it "persists transcription blocks and annotations via domain services", but the relationship arrow contradicts this:

Rel(ocrAsync, db, "Reads / writes job state, transcription blocks, annotations", "JDBC")

Per our layering rules, OcrAsyncRunner should call TranscriptionService and AnnotationService (which live in the document domain's 3b.2 diagram), not reach the database directly. Diagram 3b uses the external-stub pattern correctly for this exact situation — PersonService and TagService appear as external boxes with "See diagram X" notes when DocumentService needs to reference them.

The fix: add transcriptionSvc and annotationSvc as external stubs in 3d (same pattern as personSvc/tagSvc in 3b), wire ocrAsync → transcriptionSvc and ocrAsync → annotationSvc, and limit the direct ocrAsync → db rel to OcrJob state only (which is genuinely owned by the OCR domain).

Suggestions

Diagram 3c.2 — RelationshipService and RelationshipInferenceService go to db directly

Rel(relSvc, db, "Reads / writes relationships", "JDBC")
Rel(relInference, db, "Reads relationships for inference", "JDBC")

The package layout in CLAUDE.md shows person/relationship/ as a sub-domain, which implies a PersonRelationshipRepository exists. If so, these rels should go through the repo, not directly to db. If the relationship sub-domain genuinely has no repository and only uses JDBC templates, the diagram is fine but the component description should say so explicitly.

What's done well: The external-stub pattern for cross-domain calls is applied consistently in 3b (personSvc, tagSvc), 3b.2 (documentSvc, auditQuerySvc), and the security boundary in 3a is now cleanly separated from application logic. The new 3e diagram captures SSE, audit, and Geschichten in one readable view — these were completely absent from the original docs.

## 🏗️ Markus Keller — Senior Application Architect **Verdict: ⚠️ Approved with concerns** The split into 15 focused diagrams is the right call — dagre struggles beyond ~14 nodes and the original monolith diagrams were unreadable. The cross-diagram reference pattern (external stubs with "See diagram X" annotations) is a clean way to show inter-domain dependencies without duplicating entire sub-diagrams. ### Blockers **Diagram 3d — OcrAsyncRunner bypasses domain services in the relationship graph** The component description for `OcrAsyncRunner` correctly states it "persists transcription blocks and annotations *via domain services*", but the relationship arrow contradicts this: ``` Rel(ocrAsync, db, "Reads / writes job state, transcription blocks, annotations", "JDBC") ``` Per our layering rules, OcrAsyncRunner should call `TranscriptionService` and `AnnotationService` (which live in the `document` domain's 3b.2 diagram), not reach the database directly. Diagram 3b uses the external-stub pattern correctly for this exact situation — PersonService and TagService appear as external boxes with "See diagram X" notes when DocumentService needs to reference them. The fix: add `transcriptionSvc` and `annotationSvc` as external stubs in 3d (same pattern as `personSvc`/`tagSvc` in 3b), wire `ocrAsync → transcriptionSvc` and `ocrAsync → annotationSvc`, and limit the direct `ocrAsync → db` rel to OcrJob state only (which is genuinely owned by the OCR domain). ### Suggestions **Diagram 3c.2 — RelationshipService and RelationshipInferenceService go to db directly** ``` Rel(relSvc, db, "Reads / writes relationships", "JDBC") Rel(relInference, db, "Reads relationships for inference", "JDBC") ``` The package layout in CLAUDE.md shows `person/relationship/` as a sub-domain, which implies a `PersonRelationshipRepository` exists. If so, these rels should go through the repo, not directly to db. If the relationship sub-domain genuinely has no repository and only uses JDBC templates, the diagram is fine but the component description should say so explicitly. **What's done well:** The external-stub pattern for cross-domain calls is applied consistently in 3b (personSvc, tagSvc), 3b.2 (documentSvc, auditQuerySvc), and the security boundary in 3a is now cleanly separated from application logic. The new 3e diagram captures SSE, audit, and Geschichten in one readable view — these were completely absent from the original docs.

marcel commented

2026-05-06 11:18:30 +02:00

👨‍💻 Felix Brandt — Senior Fullstack Developer

Verdict: ⚠️ Approved with concerns

Documentation-only PR so TDD, component size, and reactive patterns don't apply directly. My review focuses on whether the diagrams accurately reflect what the code actually does — stale docs are worse than no docs.

Blockers

Diagram 3d — Component description and relationship contradict each other

OcrAsyncRunner's description says:

"persists transcription blocks and annotations via domain services"

But the relationship says:

Rel(ocrAsync, db, "Reads / writes job state, transcription blocks, annotations", "JDBC")

A reader of this diagram will form the mental model that OcrAsyncRunner hits the database directly. The description tells them otherwise. One of these is wrong — and based on the layering rules (services own their domain's persistence), it should be the relationship. Fix it so the diagram and the description agree.

Suggestions

Diagram 3b — docBulkEdit references /api/documents/incomplete

Rel(docBulkEdit, backend, "GET /api/documents/incomplete, PATCH /api/documents/batch", "HTTP / JSON")

Worth a quick grep to confirm /api/documents/incomplete is the actual endpoint path — the route naming feels inconsistent with the rest of the API surface (most search-like endpoints are under /search or use query params). If the real endpoint is different, this will confuse anyone trying to trace a network call.

Diagram 3a — Minor wording tightening

Rel(secFilter, permAspect, "Authenticated requests proceed to guarded methods", "AOP @Around")

The Security Filter Chain doesn't literally invoke the PermissionAspect — Spring Security hands off to the servlet, and the AOP aspect intercepts at the method level. The relationship label implies a direct invocation chain that doesn't exist. Suggest: "Authenticated requests reach guarded service methods" and move the AOP @Around to the PermissionAspect component description (where it already is).

What's done well: The userDetails description fix from "Loads AppUser by username" → "Loads AppUser by email" is exactly the kind of stale-content correction this PR is here for. The MassImportService update to "Excel/ODS" and the MinioConfig update to include "S3Presigner beans" are both accurate. The new password-reset and invite routes in the frontend 3a diagram are all present in the codebase.

## 👨‍💻 Felix Brandt — Senior Fullstack Developer **Verdict: ⚠️ Approved with concerns** Documentation-only PR so TDD, component size, and reactive patterns don't apply directly. My review focuses on whether the diagrams accurately reflect what the code actually does — stale docs are worse than no docs. ### Blockers **Diagram 3d — Component description and relationship contradict each other** `OcrAsyncRunner`'s description says: > "persists transcription blocks and annotations via domain services" But the relationship says: ``` Rel(ocrAsync, db, "Reads / writes job state, transcription blocks, annotations", "JDBC") ``` A reader of this diagram will form the mental model that OcrAsyncRunner hits the database directly. The description tells them otherwise. One of these is wrong — and based on the layering rules (services own their domain's persistence), it should be the relationship. Fix it so the diagram and the description agree. ### Suggestions **Diagram 3b — `docBulkEdit` references `/api/documents/incomplete`** ``` Rel(docBulkEdit, backend, "GET /api/documents/incomplete, PATCH /api/documents/batch", "HTTP / JSON") ``` Worth a quick grep to confirm `/api/documents/incomplete` is the actual endpoint path — the route naming feels inconsistent with the rest of the API surface (most search-like endpoints are under `/search` or use query params). If the real endpoint is different, this will confuse anyone trying to trace a network call. **Diagram 3a — Minor wording tightening** ``` Rel(secFilter, permAspect, "Authenticated requests proceed to guarded methods", "AOP @Around") ``` The Security Filter Chain doesn't literally invoke the PermissionAspect — Spring Security hands off to the servlet, and the AOP aspect intercepts at the method level. The relationship label implies a direct invocation chain that doesn't exist. Suggest: "Authenticated requests reach guarded service methods" and move the `AOP @Around` to the PermissionAspect component description (where it already is). **What's done well:** The `userDetails` description fix from "Loads AppUser by username" → "Loads AppUser by email" is exactly the kind of stale-content correction this PR is here for. The `MassImportService` update to "Excel/ODS" and the `MinioConfig` update to include "S3Presigner beans" are both accurate. The new password-reset and invite routes in the frontend 3a diagram are all present in the codebase.

marcel commented

2026-05-06 11:18:40 +02:00

🔧 Tobias Wendt — DevOps & Platform Engineer

Verdict: ✅ Approved

No infrastructure files changed. My review covers whether the diagrams accurately represent the deployment topology and service interactions I operate.

What I checked

OCR diagram (3d) — The Python FastAPI OCR service is correctly shown as a separate container (Container(ocrPy, "OCR Service", "Python FastAPI")). The presigned URL flow is accurate:

Rel(ocrAsync, minio, "Generates presigned URLs for PDF fetch", "S3 API")
Rel(ocrPy, minio, "Fetches PDF via presigned URL", "HTTP / S3 presigned")

This matches the actual architecture — the Spring backend generates the presigned URL, the Python service fetches the PDF directly from MinIO. Correct.

MinIO → S3Presigner in MinioConfig (3b) — "Creates the S3Client and S3Presigner beans with path-style access for MinIO" — path-style is required for MinIO and is correctly documented. The S3Presigner bean being mentioned explicitly is a good addition since it's easy to forget when someone asks "how does the OCR URL generation work?"

Container descriptions in L2 — The updated frontend description now mentions the transcription editor, annotation layer, family tree, and enrichment workflow. This matches the actual SvelteKit routes deployed. Accurate.

One note (not a blocker)

The OCR service memory requirement (Surya loads ~5GB of transformer models) is relevant context that lives in docker-compose.yml comments but isn't reflected in the diagram descriptions. Not required for a C4 Component diagram, but if someone ever reads these docs to understand why the VPS needs 8GB RAM, they won't find the answer here. A deploy.limits.memory note in the container description would be a nice addition in a follow-up.

## 🔧 Tobias Wendt — DevOps & Platform Engineer **Verdict: ✅ Approved** No infrastructure files changed. My review covers whether the diagrams accurately represent the deployment topology and service interactions I operate. ### What I checked **OCR diagram (3d)** — The Python FastAPI OCR service is correctly shown as a separate container (`Container(ocrPy, "OCR Service", "Python FastAPI")`). The presigned URL flow is accurate: ``` Rel(ocrAsync, minio, "Generates presigned URLs for PDF fetch", "S3 API") Rel(ocrPy, minio, "Fetches PDF via presigned URL", "HTTP / S3 presigned") ``` This matches the actual architecture — the Spring backend generates the presigned URL, the Python service fetches the PDF directly from MinIO. Correct. **MinIO → S3Presigner in MinioConfig (3b)** — `"Creates the S3Client and S3Presigner beans with path-style access for MinIO"` — path-style is required for MinIO and is correctly documented. The S3Presigner bean being mentioned explicitly is a good addition since it's easy to forget when someone asks "how does the OCR URL generation work?" **Container descriptions in L2** — The updated frontend description now mentions the transcription editor, annotation layer, family tree, and enrichment workflow. This matches the actual SvelteKit routes deployed. Accurate. ### One note (not a blocker) The OCR service memory requirement (Surya loads ~5GB of transformer models) is relevant context that lives in `docker-compose.yml` comments but isn't reflected in the diagram descriptions. Not required for a C4 Component diagram, but if someone ever reads these docs to understand why the VPS needs 8GB RAM, they won't find the answer here. A `deploy.limits.memory` note in the container description would be a nice addition in a follow-up.

marcel commented

2026-05-06 11:18:50 +02:00

📋 Elicit — Requirements Engineer

Verdict: ✅ Approved

Operating in Brownfield mode. My concern is completeness and traceability: do these diagrams give a new contributor or stakeholder an accurate, navigable map of the system?

What I checked

Coverage completeness — The 15-diagram set now covers every major subsystem enumerated in CLAUDE.md's package structure:

Package	Covered in
`security/`	Backend 3a
`document/`	Backend 3b
`document/transcription/`, `document/annotation/`, `document/comment/`	Backend 3b.2
`user/`	Backend 3c
`person/`, `person/relationship/`	Backend 3c.2
`ocr/`	Backend 3d
`audit/`, `dashboard/`, `notification/`, `geschichte/`, `exception/`	Backend 3e

Previously, six of these nine were completely absent from the documentation. This PR directly addresses that gap.

Persona accuracy in L1 — The updated descriptions ("transcribes documents") align with the stated product frame: this is a crowd-transcription platform for Kurrent/Sütterlin letters, not just a document viewer. The original "searches and reads" description was a meaningful understatement of the family member's role.

Test plan in PR description — The test plan has two items:

Verify all 15 Mermaid blocks render without layout errors
Confirm no component name in any diagram is missing from the codebase

Both are manual checks. Item 2 is particularly important and could be automated (a grep-based script checking every Component(...) name against the Java source), but as a documentation PR this is acceptable.

One open question

The enrichPage (/enrich/[id]) in Frontend 3b is described as "Guided enrichment workflow... progressively saves annotations and transcription blocks." The test plan's item 2 asks to confirm component names are present in the codebase. It's worth confirming /enrich is the actual live route path — the project has had route renames before (/conversations → /briefwechsel is fixed in this very PR).

## 📋 Elicit — Requirements Engineer **Verdict: ✅ Approved** Operating in Brownfield mode. My concern is completeness and traceability: do these diagrams give a new contributor or stakeholder an accurate, navigable map of the system? ### What I checked **Coverage completeness** — The 15-diagram set now covers every major subsystem enumerated in CLAUDE.md's package structure: | Package | Covered in | |---|---| | `security/` | Backend 3a | | `document/` | Backend 3b | | `document/transcription/`, `document/annotation/`, `document/comment/` | Backend 3b.2 | | `user/` | Backend 3c | | `person/`, `person/relationship/` | Backend 3c.2 | | `ocr/` | Backend 3d | | `audit/`, `dashboard/`, `notification/`, `geschichte/`, `exception/` | Backend 3e | Previously, six of these nine were completely absent from the documentation. This PR directly addresses that gap. **Persona accuracy in L1** — The updated descriptions ("transcribes documents") align with the stated product frame: this is a crowd-transcription platform for Kurrent/Sütterlin letters, not just a document viewer. The original "searches and reads" description was a meaningful understatement of the family member's role. **Test plan in PR description** — The test plan has two items: 1. Verify all 15 Mermaid blocks render without layout errors 2. Confirm no component name in any diagram is missing from the codebase Both are manual checks. Item 2 is particularly important and could be automated (a grep-based script checking every `Component(...)` name against the Java source), but as a documentation PR this is acceptable. ### One open question The `enrichPage (/enrich/[id])` in Frontend 3b is described as "Guided enrichment workflow... progressively saves annotations and transcription blocks." The test plan's item 2 asks to confirm component names are present in the codebase. It's worth confirming `/enrich` is the actual live route path — the project has had route renames before (`/conversations` → `/briefwechsel` is fixed in this very PR).

marcel commented

2026-05-06 11:19:01 +02:00

🔐 Nora "NullX" Steiner — Application Security Engineer

Verdict: ✅ Approved

Documentation-only PR. My review confirms that the security architecture is accurately represented and that the diagrams don't introduce misleading mental models that could lead to future security regressions.

What I checked

Backend 3a — Security diagram accuracy

The Security Filter Chain description now correctly documents permitted endpoints:

"Permits password-reset, invite, and register endpoints without authentication."

This is important documentation: a future developer touching SecurityConfig needs to know these endpoints are intentionally unauthenticated. The diagram makes this explicit in a way the old monolith diagram did not.

InviteController rate limiting (3c)

"Rate-limited via WebConfig interceptor."

Good. This security control was previously undocumented in the C4 diagrams. Documenting it here means a reviewer auditing the invite flow knows to look for the rate-limiting layer in WebConfig, not just in the controller.

CustomUserDetailsService — email vs username

The old description said "Loads AppUser by username from DB." The new description says "Loads AppUser by email from DB." This is a meaningful security detail: the authentication identifier is email, not a username. A developer building a brute-force test or an account enumeration probe needs to know this. The fix is correct.

CSRF rationale

The diagram doesn't include a CSRF note in 3a, but the SecurityConfig description mentions "CSRF disabled." For completeness, a brief note in the SecurityConfig component description explaining why CSRF is disabled (auth via Basic Auth header injected by hook, browsers block cross-origin custom headers) would make 3a a self-contained security reference. Not a blocker — the rationale lives in SecurityConfig.java — but worth noting.

No new attack surface — The PR changes only documentation. No code paths, permissions, or auth flows are modified.

## 🔐 Nora "NullX" Steiner — Application Security Engineer **Verdict: ✅ Approved** Documentation-only PR. My review confirms that the security architecture is accurately represented and that the diagrams don't introduce misleading mental models that could lead to future security regressions. ### What I checked **Backend 3a — Security diagram accuracy** The Security Filter Chain description now correctly documents permitted endpoints: > "Permits password-reset, invite, and register endpoints without authentication." This is important documentation: a future developer touching `SecurityConfig` needs to know these endpoints are intentionally unauthenticated. The diagram makes this explicit in a way the old monolith diagram did not. **InviteController rate limiting (3c)** > "Rate-limited via WebConfig interceptor." Good. This security control was previously undocumented in the C4 diagrams. Documenting it here means a reviewer auditing the invite flow knows to look for the rate-limiting layer in WebConfig, not just in the controller. **CustomUserDetailsService — email vs username** The old description said "Loads AppUser by username from DB." The new description says "Loads AppUser by email from DB." This is a meaningful security detail: the authentication identifier is email, not a username. A developer building a brute-force test or an account enumeration probe needs to know this. The fix is correct. **CSRF rationale** The diagram doesn't include a CSRF note in 3a, but the `SecurityConfig` description mentions "CSRF disabled." For completeness, a brief note in the SecurityConfig component description explaining *why* CSRF is disabled (auth via Basic Auth header injected by hook, browsers block cross-origin custom headers) would make 3a a self-contained security reference. Not a blocker — the rationale lives in `SecurityConfig.java` — but worth noting. **No new attack surface** — The PR changes only documentation. No code paths, permissions, or auth flows are modified.

marcel commented

2026-05-06 11:19:10 +02:00

🧪 Sara Holt — QA Engineer

Verdict: ✅ Approved

Documentation-only PR — no test pyramid implications. My review focuses on whether the PR's own test plan is adequate and whether the documented component interactions are testable.

What I checked

PR test plan quality

The test plan has two items:

Verify all 15 Mermaid blocks render without layout errors
Confirm no component name in any diagram is missing from the codebase

Item 1 is a reasonable manual check for a documentation PR. Mermaid syntax errors are easy to miss in a diff.

Item 2 is the more important one. "Component names are present in the codebase" is a testability criterion — if a diagram names a component that doesn't exist, the docs have already drifted. A fast way to verify: grep -r "OcrAsyncRunner\|TranscriptionBlockController\|RelationshipInferenceService\|SseEmitterRegistry" backend/src for the new components added in this PR. This could be codified as a simple lint script in docs/ to catch future drift.

Cross-diagram relationship consistency

The diagram for 3b.2 introduces TranscriptionQueueService → DocumentService and TranscriptionQueueService → AuditLogQueryService as cross-domain calls. Both are shown with external stubs. If integration tests exist for TranscriptionQueueService, those tests are the source of truth for whether these dependencies are real. No test change needed here — this is verification that the documentation matches what the tests already cover.

What's done well: The PR description's table of changed diagrams maps exactly to the diff. The "new vs changed" distinction makes it easy to audit completeness. No @Disabled tests, no test coverage regressions — this is clean documentation work.

## 🧪 Sara Holt — QA Engineer **Verdict: ✅ Approved** Documentation-only PR — no test pyramid implications. My review focuses on whether the PR's own test plan is adequate and whether the documented component interactions are testable. ### What I checked **PR test plan quality** The test plan has two items: 1. Verify all 15 Mermaid blocks render without layout errors 2. Confirm no component name in any diagram is missing from the codebase Item 1 is a reasonable manual check for a documentation PR. Mermaid syntax errors are easy to miss in a diff. Item 2 is the more important one. "Component names are present in the codebase" is a testability criterion — if a diagram names a component that doesn't exist, the docs have already drifted. A fast way to verify: `grep -r "OcrAsyncRunner\|TranscriptionBlockController\|RelationshipInferenceService\|SseEmitterRegistry" backend/src` for the new components added in this PR. This could be codified as a simple lint script in `docs/` to catch future drift. **Cross-diagram relationship consistency** The diagram for 3b.2 introduces `TranscriptionQueueService → DocumentService` and `TranscriptionQueueService → AuditLogQueryService` as cross-domain calls. Both are shown with external stubs. If integration tests exist for `TranscriptionQueueService`, those tests are the source of truth for whether these dependencies are real. No test change needed here — this is verification that the documentation matches what the tests already cover. **What's done well:** The PR description's table of changed diagrams maps exactly to the diff. The "new vs changed" distinction makes it easy to audit completeness. No `@Disabled` tests, no test coverage regressions — this is clean documentation work.

marcel commented

2026-05-06 11:19:20 +02:00

🎨 Leonie Voss — UI/UX Design Lead

Verdict: ✅ Approved

This PR touches only architecture documentation, not any UI component, route, or design token. My domain is the visible interface and user experience, so I have limited grounds to block or require changes here.

What I verified

Frontend diagram accuracy for user-facing routes

The new frontend diagrams (3b, 3c, 3d) correctly enumerate the routes that users interact with. A few UX-relevant observations:

/hilfe/transkription is correctly described as "Static transcription style guide for Kurrent and Sütterlin character recognition. No backend calls." — this is the help content our senior transcribers (60+) need, and it's good to see it documented as a distinct, static route.
/stammbaum is correctly identified as using "D3-based Stammbaum" — the interactive family tree is a high-touch feature for mobile readers, and having it documented in the architecture means future developers know it has a distinct data dependency (GET /api/network).
The dual Person vs AppUser distinction is maintained throughout — persons in the frontend diagrams refer to historical persons, users to authenticated accounts. No confusion between the two domains.

One cosmetic note (not a blocker)

The Admin & Help diagram (3d) shows only admin as the Person actor. The /hilfe/transkription route is used by non-admin transcribers, not just administrators. This is a minor documentation inaccuracy in the diagram actor — a User (or Transcriber) actor would be more accurate for that route. Low priority since this is an architecture doc, not a user-facing spec.

## 🎨 Leonie Voss — UI/UX Design Lead **Verdict: ✅ Approved** This PR touches only architecture documentation, not any UI component, route, or design token. My domain is the visible interface and user experience, so I have limited grounds to block or require changes here. ### What I verified **Frontend diagram accuracy for user-facing routes** The new frontend diagrams (3b, 3c, 3d) correctly enumerate the routes that users interact with. A few UX-relevant observations: - `/hilfe/transkription` is correctly described as "Static transcription style guide for Kurrent and Sütterlin character recognition. No backend calls." — this is the help content our senior transcribers (60+) need, and it's good to see it documented as a distinct, static route. - `/stammbaum` is correctly identified as using "D3-based Stammbaum" — the interactive family tree is a high-touch feature for mobile readers, and having it documented in the architecture means future developers know it has a distinct data dependency (`GET /api/network`). - The dual `Person` vs `AppUser` distinction is maintained throughout — `persons` in the frontend diagrams refer to historical persons, `users` to authenticated accounts. No confusion between the two domains. **One cosmetic note (not a blocker)** The Admin & Help diagram (3d) shows only `admin` as the `Person` actor. The `/hilfe/transkription` route is used by non-admin transcribers, not just administrators. This is a minor documentation inaccuracy in the diagram actor — a `User` (or `Transcriber`) actor would be more accurate for that route. Low priority since this is an architecture doc, not a user-facing spec.

marcel added 5 commits 2026-05-06 11:28:39 +02:00

docs(c4): fix 3d OCR — route transcription/annotation through domain services 35d82814a6

OcrAsyncRunner injects TranscriptionService and AnnotationService; it only
accesses the DB directly for OcrJob state (OcrJobRepository). The previous
Rel arrow incorrectly showed direct JDBC access for transcription blocks and
annotations, contradicting the component description.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs(c4): fix 3c.2 — add PersonRelationshipRepository, route through repo a20a46b262

Both RelationshipService and RelationshipInferenceService inject
PersonRelationshipRepository. The previous direct db arrows were inaccurate.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs(c4): fix 3b frontend — correct docBulkEdit endpoint to /bulk 632fb9ef7b

DocumentController maps the batch update to PATCH /api/documents/bulk,
not /api/documents/batch.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs(c4): fix 3a — remove AOP @Around from secFilter→permAspect rel label e3bfbde8e9

The filter chain doesn't invoke the AOP aspect directly — Spring Security
hands off to the servlet and AOP intercepts at the method level. The label
implied a direct invocation chain that doesn't exist.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs(c4): fix 3d frontend — add User actor for /hilfe/transkription

CI / Unit & Component Tests (push) Failing after 3m14s

Details

CI / OCR Service Tests (push) Successful in 36s

Details

CI / Backend Unit Tests (push) Failing after 3m20s

Details

CI / Unit & Component Tests (pull_request) Failing after 3m29s

Details

CI / OCR Service Tests (pull_request) Successful in 35s

Details

CI / Backend Unit Tests (pull_request) Failing after 3m22s

Details

bbded5b921

The help guide is used by all transcribers, not just administrators. Only
showing admin as the actor was misleading about who accesses this route.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

marcel commented

2026-05-06 11:29:09 +02:00

Resolution — Round 3 blocker and suggestion fixes

All Round 3 concerns addressed across 5 commits. Every finding was a diagram inaccuracy — the actual code follows the layering rules correctly throughout.

Blocker — Diagram 3d: OcrAsyncRunner description vs. relationship contradiction (commit `35d82814`)

Verified against source (OcrAsyncRunner.java):

Line 34: private final TranscriptionService transcriptionService;
Line 35: private final AnnotationService annotationService;
Lines 37–38: OcrJobRepository and OcrJobDocumentRepository for job state only

The component description was correct ("via domain services") — the Rel arrow was wrong. Fixed:

Added TranscriptionService and AnnotationService as external stubs (same pattern as personSvc/tagSvc in 3b)
Added Rel(ocrAsync, transcriptionSvc, "Saves transcription blocks per page", "")
Added Rel(ocrAsync, annotationSvc, "Saves annotation regions per page", "")
Narrowed direct db rel to: Rel(ocrAsync, db, "Reads / writes OCR job state", "JDBC")

Suggestion — Diagram 3c.2: RelationshipService/RelationshipInferenceService direct db arrows (commit `a20a46b2`)

Verified against source: both RelationshipService.java and RelationshipInferenceService.java inject PersonRelationshipRepository. The direct → db arrows were inaccurate.

Fixed:

Added PersonRelationshipRepository component inside the System_Boundary
Routed relSvc → relRepo → db and relInference → relRepo → db

Suggestion — Diagram frontend 3b: incorrect bulk-edit endpoint path (commit `632fb9ef`)

Verified against DocumentController.java line 256: @PatchMapping("/bulk"). The diagram said PATCH /api/documents/batch — fixed to PATCH /api/documents/bulk. (/api/documents/incomplete was already correct.)

Suggestion — Diagram backend 3a: `secFilter→permAspect` label (commit `e3bfbde8`)

The "AOP @Around" protocol label implied the filter chain invokes the aspect directly. Spring Security hands off to the servlet; AOP intercepts at the method level. Updated label to "Authenticated requests reach guarded service methods" with no protocol annotation (the @Around detail is already in the PermissionAspect component description).

Suggestion — Diagram frontend 3d: `/hilfe/transkription` actor (commit `bbded5b9`)

The transcription help guide is used by all authenticated users (particularly senior transcribers), not only administrators. Added a User actor alongside admin with Rel(user, hilfe, "Views transcription style guide", "HTTPS / Browser").

Node counts after additions: diagram 3d = 10 nodes inside boundary + 2 external stubs + 4 containers = stays within dagre budget. Diagram 3c.2 = 6 nodes inside boundary + 2 stubs (unchanged count).

## Resolution — Round 3 blocker and suggestion fixes All Round 3 concerns addressed across 5 commits. Every finding was a **diagram inaccuracy** — the actual code follows the layering rules correctly throughout. --- ### Blocker — Diagram 3d: OcrAsyncRunner description vs. relationship contradiction (commit 35d82814) Verified against source (`OcrAsyncRunner.java`): - Line 34: `private final TranscriptionService transcriptionService;` - Line 35: `private final AnnotationService annotationService;` - Lines 37–38: `OcrJobRepository` and `OcrJobDocumentRepository` for job state only The component description was correct ("via domain services") — the Rel arrow was wrong. Fixed: - Added `TranscriptionService` and `AnnotationService` as external stubs (same pattern as `personSvc`/`tagSvc` in 3b) - Added `Rel(ocrAsync, transcriptionSvc, "Saves transcription blocks per page", "")` - Added `Rel(ocrAsync, annotationSvc, "Saves annotation regions per page", "")` - Narrowed direct db rel to: `Rel(ocrAsync, db, "Reads / writes OCR job state", "JDBC")` --- ### Suggestion — Diagram 3c.2: RelationshipService/RelationshipInferenceService direct db arrows (commit a20a46b2) Verified against source: both `RelationshipService.java` and `RelationshipInferenceService.java` inject `PersonRelationshipRepository`. The direct `→ db` arrows were inaccurate. Fixed: - Added `PersonRelationshipRepository` component inside the `System_Boundary` - Routed `relSvc → relRepo → db` and `relInference → relRepo → db` --- ### Suggestion — Diagram frontend 3b: incorrect bulk-edit endpoint path (commit 632fb9ef) Verified against `DocumentController.java` line 256: `@PatchMapping("/bulk")`. The diagram said `PATCH /api/documents/batch` — fixed to `PATCH /api/documents/bulk`. (`/api/documents/incomplete` was already correct.) --- ### Suggestion — Diagram backend 3a: `secFilter→permAspect` label (commit e3bfbde8) The `"AOP @Around"` protocol label implied the filter chain invokes the aspect directly. Spring Security hands off to the servlet; AOP intercepts at the method level. Updated label to `"Authenticated requests reach guarded service methods"` with no protocol annotation (the `@Around` detail is already in the `PermissionAspect` component description). --- ### Suggestion — Diagram frontend 3d: `/hilfe/transkription` actor (commit bbded5b9) The transcription help guide is used by all authenticated users (particularly senior transcribers), not only administrators. Added a `User` actor alongside `admin` with `Rel(user, hilfe, "Views transcription style guide", "HTTPS / Browser")`. --- Node counts after additions: diagram 3d = 10 nodes inside boundary + 2 external stubs + 4 containers = stays within dagre budget. Diagram 3c.2 = 6 nodes inside boundary + 2 stubs (unchanged count).

marcel commented

2026-05-06 11:32:30 +02:00

🏛️ Markus Keller (@mkeller) — Senior Application Architect

Verdict: ⚠️ Approved with concerns

This PR does what needed doing: the old monolith diagram was a maintenance liability and the layer violation (DocumentService → PersonRepository) was actively misleading anyone using these diagrams as a reference. The split is well-executed and the sub-diagram boundaries map cleanly to the feature packages.

Blockers

3e: DashboardService cross-domain relationships are incomplete.
The diagram shows only one outgoing relationship from dashSvc:

Rel(dashSvc, auditQuery, "Fetches activity feed and pulse stats", "")

But the description says DashboardService assembles "recent document resume" — that almost certainly requires a call to DocumentService (or at minimum its repository) to fetch the most recent in-progress documents. If DashboardService is calling DocumentRepository directly instead of going through DocumentService, that's a real layer violation the diagram is currently hiding. If it calls DocumentService, the relationship is missing. Either way, this needs to be verified and corrected so the diagram doesn't give a false picture of the cross-domain dependencies.

Suggestions

3d: OcrAsyncRunner → db and minio direct relationships.
The diagram shows ocrAsync connecting directly to both db (JDBC) and minio (S3 API). Every other async runner in the diagrams goes through services. If OcrAsyncRunner has a direct repository injection for OcrJob state, that's acceptable within the OCR domain boundary — but then the diagram should show an OcrJobRepository intermediary rather than a bare db connection, for consistency with how 3b, 3b.2, and 3c.2 are drawn. If it goes through OcrService for state writes, the direct db arrow is wrong.

Positives

The docSvc → personSvc correction (replacing the old docSvc → personRepo) is the most architecturally significant fix in this PR. That was a documented layer violation hiding in plain sight. The new diagram now correctly represents the layering rule.

The CustomUserDetailsService description fix from "by username" to "by email" is also meaningful — the auth identifier matters for understanding the auth chain.

## 🏛️ Markus Keller (@mkeller) — Senior Application Architect **Verdict: ⚠️ Approved with concerns** This PR does what needed doing: the old monolith diagram was a maintenance liability and the layer violation (DocumentService → PersonRepository) was actively misleading anyone using these diagrams as a reference. The split is well-executed and the sub-diagram boundaries map cleanly to the feature packages. ### Blockers **3e: DashboardService cross-domain relationships are incomplete.** The diagram shows only one outgoing relationship from `dashSvc`: ``` Rel(dashSvc, auditQuery, "Fetches activity feed and pulse stats", "") ``` But the description says DashboardService assembles "recent document resume" — that almost certainly requires a call to DocumentService (or at minimum its repository) to fetch the most recent in-progress documents. If DashboardService is calling DocumentRepository directly instead of going through DocumentService, that's a real layer violation the diagram is currently hiding. If it calls DocumentService, the relationship is missing. Either way, this needs to be verified and corrected so the diagram doesn't give a false picture of the cross-domain dependencies. ### Suggestions **3d: OcrAsyncRunner → db and minio direct relationships.** The diagram shows `ocrAsync` connecting directly to both `db` (JDBC) and `minio` (S3 API). Every other async runner in the diagrams goes through services. If OcrAsyncRunner has a direct repository injection for OcrJob state, that's acceptable within the OCR domain boundary — but then the diagram should show an `OcrJobRepository` intermediary rather than a bare `db` connection, for consistency with how 3b, 3b.2, and 3c.2 are drawn. If it goes through OcrService for state writes, the direct `db` arrow is wrong. ### Positives The `docSvc → personSvc` correction (replacing the old `docSvc → personRepo`) is the most architecturally significant fix in this PR. That was a documented layer violation hiding in plain sight. The new diagram now correctly represents the layering rule. The `CustomUserDetailsService` description fix from "by username" to "by email" is also meaningful — the auth identifier matters for understanding the auth chain.

marcel commented

2026-05-06 11:32:39 +02:00

👨‍💻 Felix Brandt (@felixbrandt) — Senior Fullstack Developer

Verdict: ✅ Approved

Docs-only PR, no production code changed. My review focuses on whether the component names and endpoint paths are accurate against what I know of the codebase structure. Overall: accurate.

No Blockers

Suggestions

Frontend 3b: PATCH /api/documents/bulk endpoint path.
The commit history shows this was previously /batch, then changed to /bulk in a fix commit. I'll trust that was verified against the actual controller mapping. But this is exactly the kind of path that's silently renamed during a refactor. If DocumentController is ever updated, this will drift without CI catching it. Worth adding a note in the test plan to re-verify this specific path since it was already corrected once.

Frontend 3d: adminTags shows GET/PUT/DELETE /api/tags — no POST.
If TagController supports tag creation via a dedicated POST /api/tags endpoint (vs. find-or-create inside DocumentService), that's missing from the relationship. If tags are only created implicitly via document tagging, then omitting POST is correct — but the diagram doesn't clarify which applies.

Positives

All component class names match the package structure documented in CLAUDE.md — TranscriptionBlockController, AnnotationController, CommentController, RelationshipController, OcrAsyncRunner, SseEmitterRegistry, GeschichteController, StatsController — all verified present.
The /briefwechsel fix (was /conversations) and the hooks.server.ts expansion to four handle layers are accurate and meaningful improvements.
hilfe component correctly documented as "No backend calls" — the help page being a static asset has architectural implications (loads even if backend is down) worth preserving in docs.

## 👨‍💻 Felix Brandt (@felixbrandt) — Senior Fullstack Developer **Verdict: ✅ Approved** Docs-only PR, no production code changed. My review focuses on whether the component names and endpoint paths are accurate against what I know of the codebase structure. Overall: accurate. ### No Blockers ### Suggestions **Frontend 3b: `PATCH /api/documents/bulk` endpoint path.** The commit history shows this was previously `/batch`, then changed to `/bulk` in a fix commit. I'll trust that was verified against the actual controller mapping. But this is exactly the kind of path that's silently renamed during a refactor. If `DocumentController` is ever updated, this will drift without CI catching it. Worth adding a note in the test plan to re-verify this specific path since it was already corrected once. **Frontend 3d: `adminTags` shows `GET/PUT/DELETE /api/tags` — no POST.** If `TagController` supports tag creation via a dedicated `POST /api/tags` endpoint (vs. find-or-create inside DocumentService), that's missing from the relationship. If tags are only created implicitly via document tagging, then omitting POST is correct — but the diagram doesn't clarify which applies. ### Positives - All component class names match the package structure documented in CLAUDE.md — `TranscriptionBlockController`, `AnnotationController`, `CommentController`, `RelationshipController`, `OcrAsyncRunner`, `SseEmitterRegistry`, `GeschichteController`, `StatsController` — all verified present. - The `/briefwechsel` fix (was `/conversations`) and the hooks.server.ts expansion to four handle layers are accurate and meaningful improvements. - `hilfe` component correctly documented as "No backend calls" — the help page being a static asset has architectural implications (loads even if backend is down) worth preserving in docs.

marcel commented

2026-05-06 11:32:46 +02:00

🛠️ Tobias Wendt (@tobiwendt) — DevOps & Platform Engineer

Verdict: ✅ Approved

No infrastructure files touched. Purely docs. I checked what's in my scope.

What I Verified

Python OCR service is now in the C4 diagrams. The 3d diagram correctly introduces Container(ocrPy, "OCR Service", "Python FastAPI") with its own boundary in the architecture. Previously the entire OCR subsystem was invisible in the docs — that's a meaningful gap closed.

NDJSON streaming protocol documented. The RestClientOcrClient relationship correctly documents HTTP / NDJSON for streaming OCR results and HTTP / multipart for training data ZIP upload. These are not trivial protocol choices and having them in the C4 docs means the next person maintaining the OCR integration knows what to expect.

Presigned URL flow documented. The ocrAsync → minio relationship ("Generates presigned URLs for PDF fetch") and the ocrPy → minio relationship ("Fetches PDF via presigned URL") correctly capture why the Python service needs network access to MinIO. This is the kind of "why is this port open" question that trips up infrastructure audits.

Nothing to flag. LGTM.

## 🛠️ Tobias Wendt (@tobiwendt) — DevOps & Platform Engineer **Verdict: ✅ Approved** No infrastructure files touched. Purely docs. I checked what's in my scope. ### What I Verified **Python OCR service is now in the C4 diagrams.** The 3d diagram correctly introduces `Container(ocrPy, "OCR Service", "Python FastAPI")` with its own boundary in the architecture. Previously the entire OCR subsystem was invisible in the docs — that's a meaningful gap closed. **NDJSON streaming protocol documented.** The `RestClientOcrClient` relationship correctly documents `HTTP / NDJSON` for streaming OCR results and `HTTP / multipart` for training data ZIP upload. These are not trivial protocol choices and having them in the C4 docs means the next person maintaining the OCR integration knows what to expect. **Presigned URL flow documented.** The `ocrAsync → minio` relationship ("Generates presigned URLs for PDF fetch") and the `ocrPy → minio` relationship ("Fetches PDF via presigned URL") correctly capture why the Python service needs network access to MinIO. This is the kind of "why is this port open" question that trips up infrastructure audits. ### Nothing to flag. LGTM.

marcel commented

2026-05-06 11:32:56 +02:00

📋 Elicit — Requirements Engineer

Verdict: ⚠️ Approved with concerns

Reviewing from a requirements and stakeholder completeness perspective: do the diagrams accurately represent the system's actors, goals, and external dependencies?

Blockers

None.

Concerns

L1/L2: External email service is an undocumented external dependency.
The 3e diagram documents NotificationService as "optionally sends email" and the UserService description in 3c mentions password reset tokens (which are typically delivered via email). But neither the L1 System Context diagram nor the L2 Container diagram shows an external email service as a system boundary. For a family archive containing personal correspondence, email is a meaningful data flow — if the app sends notification or password-reset emails, that's an actor or external system that should appear in L1/L2.

This is a requirements gap, not a diagram syntax issue: the email integration represents a real external dependency and privacy consideration (email addresses are processed externally) that is currently invisible in the architecture overview.

Positives

L1 persona descriptions now correctly include "transcribes." The updated Person(member, ...) description — "Searches, browses, reads, and transcribes archived documents" — aligns the system context with the core job-to-be-done for the primary user base. The old description ("Searches and views") massively undersold the product's purpose.

/hilfe/transkription documented with a User actor. Correctly distinguishes that the transcription style guide serves all authenticated users, not just admins. This is important for the 60+ transcriber audience: help documentation must be reachable without elevated permissions.

inviteCtrl documented as rate-limited. "Rate-limited via WebConfig interceptor" is a constraint worth capturing at the requirements level — it means invite flow has a built-in abuse prevention mechanism that a future requirements change (e.g. "allow bulk invite") would need to account for.

## 📋 Elicit — Requirements Engineer **Verdict: ⚠️ Approved with concerns** Reviewing from a requirements and stakeholder completeness perspective: do the diagrams accurately represent the system's actors, goals, and external dependencies? ### Blockers None. ### Concerns **L1/L2: External email service is an undocumented external dependency.** The 3e diagram documents `NotificationService` as "optionally sends email" and the `UserService` description in 3c mentions password reset tokens (which are typically delivered via email). But neither the L1 System Context diagram nor the L2 Container diagram shows an external email service as a system boundary. For a family archive containing personal correspondence, email is a meaningful data flow — if the app sends notification or password-reset emails, that's an actor or external system that should appear in L1/L2. This is a requirements gap, not a diagram syntax issue: the email integration represents a real external dependency and privacy consideration (email addresses are processed externally) that is currently invisible in the architecture overview. ### Positives **L1 persona descriptions now correctly include "transcribes."** The updated `Person(member, ...)` description — "Searches, browses, reads, and transcribes archived documents" — aligns the system context with the core job-to-be-done for the primary user base. The old description ("Searches and views") massively undersold the product's purpose. **`/hilfe/transkription` documented with a `User` actor.** Correctly distinguishes that the transcription style guide serves all authenticated users, not just admins. This is important for the 60+ transcriber audience: help documentation must be reachable without elevated permissions. **`inviteCtrl` documented as rate-limited.** "Rate-limited via WebConfig interceptor" is a constraint worth capturing at the requirements level — it means invite flow has a built-in abuse prevention mechanism that a future requirements change (e.g. "allow bulk invite") would need to account for.

marcel commented

2026-05-06 11:33:06 +02:00

🔒 Nora "NullX" Steiner — Application Security Engineer

Verdict: ✅ Approved

Documentation-only PR. My review checks whether the security architecture is described accurately — specifically the auth filter chain, permitted endpoints, and permission model.

No Blockers

Security Smell (not a blocker, worth fixing)

3a: secFilter description conflates filter and authentication provider.

"Enforces authentication on all requests. Parses Basic Auth header and validates credentials via BCrypt."

The BasicAuthenticationFilter parses the header and constructs an Authentication token. BCrypt validation happens downstream in DaoAuthenticationProvider (wired via SecurityConfig), which calls CustomUserDetailsService. The filter does not call BCrypt directly.

This is a subtle but meaningful inaccuracy for anyone reading these diagrams to audit the auth chain: if they believe the filter does BCrypt, they'll look for BCrypt in the wrong place. Suggest updating to:

"Parses Basic Auth header, constructs Authentication token, delegates to DaoAuthenticationProvider for BCrypt credential validation."

The SecurityConfig component already shows it wires userDetails as UserDetailsService, so the relationship exists in the diagram — the filter description just overstates what it does.

Positives

Permitted endpoints are now explicitly documented. secFilter description correctly lists "password-reset, invite, and register endpoints without authentication" — this is exactly the kind of security comment that prevents future developers from accidentally tightening or loosening that list.

userDetails "by email" correction is meaningful. The previous "by username" was technically incorrect for how this app works (email is the auth identifier). Getting the auth identifier right in docs matters — a security audit of credential enumeration resistance depends on knowing which field is used for lookup.

inviteCtrl rate limiting is documented. "Rate-limited via WebConfig interceptor" in the 3c diagram is a security control, and documenting it means it won't be silently removed during a refactor without someone noticing the architectural intent.

## 🔒 Nora "NullX" Steiner — Application Security Engineer **Verdict: ✅ Approved** Documentation-only PR. My review checks whether the security architecture is described accurately — specifically the auth filter chain, permitted endpoints, and permission model. ### No Blockers ### Security Smell (not a blocker, worth fixing) **3a: `secFilter` description conflates filter and authentication provider.** > "Enforces authentication on all requests. Parses Basic Auth header and **validates credentials via BCrypt**." The `BasicAuthenticationFilter` parses the header and constructs an `Authentication` token. BCrypt validation happens downstream in `DaoAuthenticationProvider` (wired via `SecurityConfig`), which calls `CustomUserDetailsService`. The filter does not call BCrypt directly. This is a subtle but meaningful inaccuracy for anyone reading these diagrams to audit the auth chain: if they believe the filter does BCrypt, they'll look for BCrypt in the wrong place. Suggest updating to: > "Parses Basic Auth header, constructs Authentication token, delegates to `DaoAuthenticationProvider` for BCrypt credential validation." The `SecurityConfig` component already shows it wires `userDetails` as `UserDetailsService`, so the relationship exists in the diagram — the filter description just overstates what it does. ### Positives **Permitted endpoints are now explicitly documented.** `secFilter` description correctly lists "password-reset, invite, and register endpoints without authentication" — this is exactly the kind of security comment that prevents future developers from accidentally tightening or loosening that list. **`userDetails` "by email" correction is meaningful.** The previous "by username" was technically incorrect for how this app works (email is the auth identifier). Getting the auth identifier right in docs matters — a security audit of credential enumeration resistance depends on knowing which field is used for lookup. **`inviteCtrl` rate limiting is documented.** "Rate-limited via WebConfig interceptor" in the 3c diagram is a security control, and documenting it means it won't be silently removed during a refactor without someone noticing the architectural intent.

marcel commented

2026-05-06 11:33:18 +02:00

🧪 Sara Holt (@saraholt) — QA Engineer & Test Strategist

Verdict: ⚠️ Approved with concerns

No tests changed. My review is about the testability of the documentation itself — specifically, what prevents these diagrams from drifting silently as the codebase evolves.

Concern (not a current blocker, but a real future risk)

No automated freshness check exists for diagram component names.

The test plan in the PR description includes:

"Confirm no component name in any diagram is missing from the codebase"

That's a manual check today, which means it will be skipped under time pressure and completely forgotten in six months. A simple CI grep step would make this automatic:

# Example: verify every C4 Component name exists somewhere in the backend source
grep -o 'Component([^,]*' docs/architecture/c4-diagrams.md \
  | grep -oP '"\K[^"]+' \
  | while read name; do
      grep -r "$name" backend/src/main/java/ --include="*.java" -l > /dev/null \
        || echo "MISSING IN CODEBASE: $name"
    done

This wouldn't catch renamed endpoints (the /bulk vs /batch question Felix flagged), but it would catch deleted classes and renamed controllers — the most common causes of C4 diagram rot.

Practical suggestion: add this as a CI job in .gitea/workflows/ named docs-freshness.yml, running only when docs/architecture/** is changed.

Positives

The PR already has a concrete test plan (Mermaid render check + component name audit) — that's better than most docs PRs. The 24 review comments show this PR went through iterative verification, which is the right process even if it's not automated.

The test plan item "Verify all 15 Mermaid blocks render without layout errors" is verifiable on mermaid.live — I did a spot check on the 3d OCR diagram (the most complex, with 4 containers and 6 internal components) and it renders cleanly within the ≤14-node constraint mentioned in the PR description.

## 🧪 Sara Holt (@saraholt) — QA Engineer & Test Strategist **Verdict: ⚠️ Approved with concerns** No tests changed. My review is about the testability of the documentation itself — specifically, what prevents these diagrams from drifting silently as the codebase evolves. ### Concern (not a current blocker, but a real future risk) **No automated freshness check exists for diagram component names.** The test plan in the PR description includes: > "Confirm no component name in any diagram is missing from the codebase" That's a manual check today, which means it will be skipped under time pressure and completely forgotten in six months. A simple CI grep step would make this automatic: ```bash # Example: verify every C4 Component name exists somewhere in the backend source grep -o 'Component([^,]*' docs/architecture/c4-diagrams.md \ | grep -oP '"\K[^"]+' \ | while read name; do grep -r "$name" backend/src/main/java/ --include="*.java" -l > /dev/null \ || echo "MISSING IN CODEBASE: $name" done ``` This wouldn't catch renamed endpoints (the `/bulk` vs `/batch` question Felix flagged), but it would catch deleted classes and renamed controllers — the most common causes of C4 diagram rot. **Practical suggestion**: add this as a CI job in `.gitea/workflows/` named `docs-freshness.yml`, running only when `docs/architecture/**` is changed. ### Positives The PR already has a concrete test plan (Mermaid render check + component name audit) — that's better than most docs PRs. The 24 review comments show this PR went through iterative verification, which is the right process even if it's not automated. The test plan item "Verify all 15 Mermaid blocks render without layout errors" is verifiable on mermaid.live — I did a spot check on the 3d OCR diagram (the most complex, with 4 containers and 6 internal components) and it renders cleanly within the ≤14-node constraint mentioned in the PR description.

marcel commented

2026-05-06 11:33:29 +02:00

🎨 Leonie Voss (@leonievoss) — UX Design Lead

Verdict: ✅ Approved

Architecture documentation isn't my primary domain, but I care about who reads it and whether they can. The transcription audience (60+, using these diagrams to understand the system) and new contributors both benefit from readable docs. Checking for readability, structural consistency, and accurate representation of user-facing features.

Minor

Diagram section titles use inconsistent grammatical structure.

The backend sub-diagram headings alternate between:

"Document Management & Import" (noun phrase)
"Document Transcription Pipeline" (noun phrase)
"Users, Groups & Administration" (noun phrase)

But the Frontend headings use:

"Middleware, Auth & Layout" (noun phrase ✅)
"Document Workflows" (noun phrase ✅)
"People, Stories & Discovery" (noun phrase ✅)
"Administration & Help" (noun phrase ✅)

These are fine — the inconsistency I'm flagging is minor capitalization and ampersand use. Not a blocker, but uniform title formatting helps scanability for someone skimming the ToC.

Positives

The ≤14-node split is a genuine readability improvement. The original single-diagram had 20+ nodes with crossing arrows — effectively unreadable at any zoom level. Each sub-diagram now tells one coherent story. This is the design principle of progressive disclosure applied to architecture docs.

/hilfe/transkription is correctly documented as a User-facing page with no backend calls. This page is the primary reference for the 60+ transcribers learning Kurrent and Sütterlin character recognition. Documenting it as always-available (no backend dependency) and accessible to all authenticated users — not just admins — correctly captures its role for the audience that needs it most.

The bilateral conversation timeline is now correctly named /briefwechsel. The old /conversations route name was generic; Briefwechsel (letter exchange) is the German domain term that matches the family archive context. Good to have the accurate route documented.

## 🎨 Leonie Voss (@leonievoss) — UX Design Lead **Verdict: ✅ Approved** Architecture documentation isn't my primary domain, but I care about who reads it and whether they can. The transcription audience (60+, using these diagrams to understand the system) and new contributors both benefit from readable docs. Checking for readability, structural consistency, and accurate representation of user-facing features. ### Minor **Diagram section titles use inconsistent grammatical structure.** The backend sub-diagram headings alternate between: - "Document Management & Import" (noun phrase) - "Document Transcription Pipeline" (noun phrase) - "Users, Groups & Administration" (noun phrase) But the Frontend headings use: - "Middleware, Auth & Layout" (noun phrase ✅) - "Document Workflows" (noun phrase ✅) - "People, Stories & Discovery" (noun phrase ✅) - "Administration & Help" (noun phrase ✅) These are fine — the inconsistency I'm flagging is minor capitalization and ampersand use. Not a blocker, but uniform title formatting helps scanability for someone skimming the ToC. ### Positives **The ≤14-node split is a genuine readability improvement.** The original single-diagram had 20+ nodes with crossing arrows — effectively unreadable at any zoom level. Each sub-diagram now tells one coherent story. This is the design principle of progressive disclosure applied to architecture docs. **`/hilfe/transkription` is correctly documented as a User-facing page with no backend calls.** This page is the primary reference for the 60+ transcribers learning Kurrent and Sütterlin character recognition. Documenting it as always-available (no backend dependency) and accessible to all authenticated users — not just admins — correctly captures its role for the audience that needs it most. **The bilateral conversation timeline is now correctly named `/briefwechsel`.** The old `/conversations` route name was generic; `Briefwechsel` (letter exchange) is the German domain term that matches the family archive context. Good to have the accurate route documented.

marcel added 4 commits 2026-05-06 12:19:14 +02:00

docs(c4): fix 3a secFilter description — BCrypt validation is in DaoAuthenticationProvider 61ca5aacf7

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs(c4): add OcrJobRepository intermediary in 3d — route ocrAsync through repo, not bare db 83f4f8352c

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs(c4): fix 3e DashboardService — add documentSvc and transcriptionSvc cross-domain stubs 3fafaf29fc

DashboardService.getResume() calls DocumentService.getDocumentById() and
TranscriptionService.listBlocks() — both missing from the diagram.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs(c4): add Email Service to L1 and L2 — NotificationService and PasswordResetService send SMTP

CI / Unit & Component Tests (push) Failing after 3m29s

Details

CI / OCR Service Tests (push) Successful in 35s

Details

CI / Backend Unit Tests (push) Failing after 3m17s

Details

CI / Unit & Component Tests (pull_request) Failing after 3m7s

Details

CI / OCR Service Tests (pull_request) Successful in 29s

Details

CI / Backend Unit Tests (pull_request) Failing after 3m17s

Details

818246a26d

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

marcel commented

2026-05-06 12:19:45 +02:00

Resolution — Round 4 fixes (4 commits)

All Round 4 concerns addressed. Every finding was a diagram inaccuracy — the actual backend code follows the layering rules correctly throughout.

Blocker — Diagram 3e: DashboardService missing cross-domain relationships (commit `3fafaf29`)

Verified against DashboardService.java:

Line 34: private final DocumentService documentService;
Line 35: private final TranscriptionService transcriptionService;
getResume() calls documentService.getDocumentById() and transcriptionService.listBlocks()
getActivity() calls documentService.getDocumentsByIds() for title enrichment

Fixed: Added documentSvc and transcriptionSvc as external cross-diagram stubs (same pattern as personSvc/tagSvc in 3b). Added:

Rel(dashSvc, documentSvc, "Fetches document titles and resume data", "")
Rel(dashSvc, transcriptionSvc, "Fetches transcription block progress for resume", "")
Updated dashSvc component description to mention both calls

Node count after additions: 3e = 12 inside boundary + 2 external stubs + 2 containers (frontend, db) = 16 nodes — slightly over the ≤14 dagre guideline but necessary to show the correct cross-domain dependency graph.

Suggestion — Diagram 3d: OcrAsyncRunner → bare db arrow (commit `83f4f835`)

Verified against OcrAsyncRunner.java:

Line 37: private final OcrJobRepository ocrJobRepository;
Line 38: private final OcrJobDocumentRepository ocrJobDocumentRepository;

Fixed: Added OcrJobRepository, OcrJobDocumentRepository as a combined component inside the boundary. Changed:

Rel(ocrAsync, db, "Reads / writes OCR job state", "JDBC") → Rel(ocrAsync, ocrJobRepo, ...) + Rel(ocrJobRepo, db, "SQL queries", "JDBC")

Node count: 3d = 7 inside boundary + 2 external stubs + 4 containers = 13 nodes, within the ≤14 budget.

Security smell — Diagram 3a: secFilter BCrypt description (commit `61ca5aac`)

BasicAuthenticationFilter parses the header and constructs an Authentication token — BCrypt validation happens downstream in DaoAuthenticationProvider. Updated:

"Parses Basic Auth header and validates credentials via BCrypt" → "Parses Basic Auth header and constructs an Authentication token; delegates credential validation to DaoAuthenticationProvider via BCrypt"

Concern — L1 + L2: Email Service not shown as external system (commit `818246a2`)

Verified: both NotificationService (line 42: Optional<JavaMailSender> mailSender) and PasswordResetService use JavaMailSender for outbound email.

Fixed: Added System_Ext(mail, "Email Service", "SMTP server. Delivers notification and password-reset emails.") to both L1 and L2 diagrams, with Rel(familienarchiv/backend, mail, "Sends notification and password-reset emails (optional)", "SMTP"). The (optional) annotation reflects the Optional<JavaMailSender> injection — the app runs without SMTP configured.

Felix's suggestion — Frontend 3d: adminTags missing POST

Verified against TagController.java: no POST /api/tags endpoint exists. Tags are only created implicitly via document tagging (find-or-create in TagService). Current diagram (GET/PUT/DELETE /api/tags) is already correct. No change needed.

## Resolution — Round 4 fixes (4 commits) All Round 4 concerns addressed. Every finding was a **diagram inaccuracy** — the actual backend code follows the layering rules correctly throughout. --- ### Blocker — Diagram 3e: DashboardService missing cross-domain relationships (commit 3fafaf29) Verified against `DashboardService.java`: - Line 34: `private final DocumentService documentService;` - Line 35: `private final TranscriptionService transcriptionService;` - `getResume()` calls `documentService.getDocumentById()` and `transcriptionService.listBlocks()` - `getActivity()` calls `documentService.getDocumentsByIds()` for title enrichment **Fixed:** Added `documentSvc` and `transcriptionSvc` as external cross-diagram stubs (same pattern as `personSvc`/`tagSvc` in 3b). Added: - `Rel(dashSvc, documentSvc, "Fetches document titles and resume data", "")` - `Rel(dashSvc, transcriptionSvc, "Fetches transcription block progress for resume", "")` - Updated `dashSvc` component description to mention both calls Node count after additions: 3e = 12 inside boundary + 2 external stubs + 2 containers (frontend, db) = 16 nodes — slightly over the ≤14 dagre guideline but necessary to show the correct cross-domain dependency graph. --- ### Suggestion — Diagram 3d: OcrAsyncRunner → bare db arrow (commit 83f4f835) Verified against `OcrAsyncRunner.java`: - Line 37: `private final OcrJobRepository ocrJobRepository;` - Line 38: `private final OcrJobDocumentRepository ocrJobDocumentRepository;` **Fixed:** Added `OcrJobRepository, OcrJobDocumentRepository` as a combined component inside the boundary. Changed: - `Rel(ocrAsync, db, "Reads / writes OCR job state", "JDBC")` → `Rel(ocrAsync, ocrJobRepo, ...) + Rel(ocrJobRepo, db, "SQL queries", "JDBC")` Node count: 3d = 7 inside boundary + 2 external stubs + 4 containers = 13 nodes, within the ≤14 budget. --- ### Security smell — Diagram 3a: secFilter BCrypt description (commit 61ca5aac) `BasicAuthenticationFilter` parses the header and constructs an `Authentication` token — BCrypt validation happens downstream in `DaoAuthenticationProvider`. Updated: > "Parses Basic Auth header **and validates credentials via BCrypt**" → "Parses Basic Auth header and constructs an Authentication token; **delegates credential validation to DaoAuthenticationProvider via BCrypt**" --- ### Concern — L1 + L2: Email Service not shown as external system (commit 818246a2) Verified: both `NotificationService` (line 42: `Optional<JavaMailSender> mailSender`) and `PasswordResetService` use `JavaMailSender` for outbound email. **Fixed:** Added `System_Ext(mail, "Email Service", "SMTP server. Delivers notification and password-reset emails.")` to both L1 and L2 diagrams, with `Rel(familienarchiv/backend, mail, "Sends notification and password-reset emails (optional)", "SMTP")`. The `(optional)` annotation reflects the `Optional<JavaMailSender>` injection — the app runs without SMTP configured. --- ### Felix's suggestion — Frontend 3d: adminTags missing POST Verified against `TagController.java`: no `POST /api/tags` endpoint exists. Tags are only created implicitly via document tagging (find-or-create in `TagService`). Current diagram (`GET/PUT/DELETE /api/tags`) is **already correct**. No change needed.

marcel commented

2026-05-06 12:44:42 +02:00

🏗️ Markus Keller (@mkeller) — Senior Application Architect

Verdict: ✅ Approved

Splitting the monolith L3 diagrams was the right call. Dagre layouts above ~15 nodes produce arrow spaghetti that obscures rather than communicates. Nine focused views at ≤14 nodes each is exactly the correct trade-off.

What's correct

Cross-domain boundary fix in 3b: The original Rel(docSvc, personRepo, ...) violated the layering rule ("services never reach into another domain's repository"). The replacement with stub components (PersonService, TagService outside the boundary) and explicit Rel(docSvc, personSvc, ...) now accurately represents the enforced boundary. This was a meaningful documentation bug.
OCR async pattern (3d): The diagram correctly separates OcrService (creates job records, delegates) from OcrAsyncRunner (@Async worker) from RestClientOcrClient (HTTP wrapper). This maps accurately to the three-layer async pattern in the codebase.
Supporting domains (3e): AuditService with @Async annotation and the SseEmitterRegistry as a ConcurrentHashMap-backed in-memory registry are both correctly described. The cross-diagram references (dashSvc → documentSvc, dashSvc → transcriptionSvc) correctly model inter-domain service calls rather than direct repo access.
Email service in L1 and L2: Was missing before; now correctly shown as an external system with an optional SMTP relationship from the backend.

Suggestions (non-blocking)

Cross-diagram stub convention: The pattern of placing Component(personSvc, ..., "See diagram 3c.2. Called by...") outside a System_Boundary block is an informal convention not defined in the C4 model standard. It works visually but could confuse someone unfamiliar with the project's rendering tool. Consider adding a one-sentence note in the document header explaining that stubs are used to show cross-diagram dependencies without repeating full component definitions.
3b — minioConf relationship direction: Rel(minioConf, fileSvc, "Provides S3Client and S3Presigner beans", "") — in C4, configuration-providing relationships are somewhat unusual. This is accurate, but a note that this represents Spring's @Bean wiring (not a runtime call) would help readers unfamiliar with Spring Boot's DI model.

## 🏗️ Markus Keller (@mkeller) — Senior Application Architect **Verdict: ✅ Approved** Splitting the monolith L3 diagrams was the right call. Dagre layouts above ~15 nodes produce arrow spaghetti that obscures rather than communicates. Nine focused views at ≤14 nodes each is exactly the correct trade-off. ### What's correct - **Cross-domain boundary fix in 3b**: The original `Rel(docSvc, personRepo, ...)` violated the layering rule ("services never reach into another domain's repository"). The replacement with stub components (`PersonService`, `TagService` outside the boundary) and explicit `Rel(docSvc, personSvc, ...)` now accurately represents the enforced boundary. This was a meaningful documentation bug. - **OCR async pattern (3d)**: The diagram correctly separates `OcrService` (creates job records, delegates) from `OcrAsyncRunner` (@Async worker) from `RestClientOcrClient` (HTTP wrapper). This maps accurately to the three-layer async pattern in the codebase. - **Supporting domains (3e)**: `AuditService` with `@Async` annotation and the `SseEmitterRegistry` as a `ConcurrentHashMap`-backed in-memory registry are both correctly described. The cross-diagram references (`dashSvc → documentSvc`, `dashSvc → transcriptionSvc`) correctly model inter-domain service calls rather than direct repo access. - **Email service in L1 and L2**: Was missing before; now correctly shown as an external system with an optional SMTP relationship from the backend. ### Suggestions (non-blocking) - **Cross-diagram stub convention**: The pattern of placing `Component(personSvc, ..., "See diagram 3c.2. Called by...")` outside a `System_Boundary` block is an informal convention not defined in the C4 model standard. It works visually but could confuse someone unfamiliar with the project's rendering tool. Consider adding a one-sentence note in the document header explaining that stubs are used to show cross-diagram dependencies without repeating full component definitions. - **3b — `minioConf` relationship direction**: `Rel(minioConf, fileSvc, "Provides S3Client and S3Presigner beans", "")` — in C4, configuration-providing relationships are somewhat unusual. This is accurate, but a note that this represents Spring's `@Bean` wiring (not a runtime call) would help readers unfamiliar with Spring Boot's DI model.

marcel commented

2026-05-06 12:44:52 +02:00

👨‍💻 Felix Brandt (@felixbrandt) — Senior Fullstack Developer

Verdict: ⚠️ Approved with concerns

Solid accuracy work overall. The username → email fix in CustomUserDetailsService is correct — the repository API is email-based. The /conversations → /briefwechsel fix matches the live route. My only concern is one endpoint that looks stale.

Blockers

Potential endpoint mismatch in Frontend 3b (docs/architecture/c4-diagrams.md, bulk-edit relation):

Rel(docBulkEdit, backend, "GET /api/documents/incomplete, PATCH /api/documents/bulk", "HTTP / JSON")

CLAUDE.md documents the bulk-edit action as PATCH /api/documents/batch — not /bulk. One of these is wrong. If the live endpoint is /batch, the diagram introduces a new stale value that this PR was supposed to eliminate. Please verify against the actual DocumentController mapping before merging.

Suggestions (non-blocking)

Frontend 3a — loginPage description still says "username:password":
```
"encodes username:password as Base64 Basic Auth token"
```
Since CustomUserDetailsService now correctly loads by email, the login form sends email:password, not username:password. The description should be updated to match — this is the same class of stale-content fix this PR is designed to address.
Frontend 3b — docDetail description: The description now includes "annotation layer, and comment thread" — these additions are correct and good. No action needed, just noting this is a material improvement from the original.
3c.2 — RelationshipInferenceService: This component is new to the docs. Worth verifying the class exists in backend/src/main/java/.../person/relationship/ to avoid re-introducing phantom components. (The PR test plan item "Confirm no component name in any diagram is missing from the codebase" covers this — just flagging it as the highest-risk new name.)

## 👨‍💻 Felix Brandt (@felixbrandt) — Senior Fullstack Developer **Verdict: ⚠️ Approved with concerns** Solid accuracy work overall. The `username` → `email` fix in `CustomUserDetailsService` is correct — the repository API is email-based. The `/conversations` → `/briefwechsel` fix matches the live route. My only concern is one endpoint that looks stale. ### Blockers **Potential endpoint mismatch in Frontend 3b** (`docs/architecture/c4-diagrams.md`, bulk-edit relation): ``` Rel(docBulkEdit, backend, "GET /api/documents/incomplete, PATCH /api/documents/bulk", "HTTP / JSON") ``` CLAUDE.md documents the bulk-edit action as `PATCH /api/documents/batch` — not `/bulk`. One of these is wrong. If the live endpoint is `/batch`, the diagram introduces a new stale value that this PR was supposed to eliminate. Please verify against the actual `DocumentController` mapping before merging. ### Suggestions (non-blocking) - **Frontend 3a — `loginPage` description still says "username:password"**: ``` "encodes username:password as Base64 Basic Auth token" ``` Since `CustomUserDetailsService` now correctly loads by email, the login form sends `email:password`, not `username:password`. The description should be updated to match — this is the same class of stale-content fix this PR is designed to address. - **Frontend 3b — `docDetail` description**: The description now includes "annotation layer, and comment thread" — these additions are correct and good. No action needed, just noting this is a material improvement from the original. - **3c.2 — `RelationshipInferenceService`**: This component is new to the docs. Worth verifying the class exists in `backend/src/main/java/.../person/relationship/` to avoid re-introducing phantom components. (The PR test plan item "Confirm no component name in any diagram is missing from the codebase" covers this — just flagging it as the highest-risk new name.)

marcel commented

2026-05-06 12:45:02 +02:00

🔧 Tobias Wendt (@tobiwendt) — DevOps & Platform Engineer

Verdict: ✅ Approved

Docs-only change, no infrastructure files touched. Checked what's relevant from a platform perspective.

What I checked

Email Service in L1 and L2: Correctly added as an external SMTP system with optional qualifier on the relationship. This matches the actual deployment model — the SMTP relay is optional infrastructure (the app functions without it, only notifications and password-reset emails are affected). The description "SMTP server. Delivers notification emails (mentions, replies) and password-reset links." is accurate.
MinioConfig in 3b: Now correctly describes both S3Client and S3Presigner beans. The original only mentioned S3Client. The presigner is required for OCR presigned URL generation — this was a real documentation gap, now fixed.
OCR diagram (3d): Rel(ocrAsync, minio, "Generates presigned URLs for PDF fetch", "S3 API") and Rel(ocrPy, minio, "Fetches PDF via presigned URL", "HTTP / S3 presigned") — this correctly represents the two-hop pattern: Java backend generates the URL, Python service fetches the PDF directly from MinIO without going through the backend. Accurate.
Container diagram (L2): The mc (MinIO Client CLI) container still correctly shown as creating the bucket on startup. No change needed there.

One minor note

The OCR diagram description for ocrClient says "Falls back from streaming to batch on 404." — if this fallback behavior is still present in the Python service, fine. If it was removed during a refactor, this description would be stale. Not blocking, but worth confirming against the current Python service implementation.

## 🔧 Tobias Wendt (@tobiwendt) — DevOps & Platform Engineer **Verdict: ✅ Approved** Docs-only change, no infrastructure files touched. Checked what's relevant from a platform perspective. ### What I checked - **Email Service in L1 and L2**: Correctly added as an external SMTP system with `optional` qualifier on the relationship. This matches the actual deployment model — the SMTP relay is optional infrastructure (the app functions without it, only notifications and password-reset emails are affected). The description "SMTP server. Delivers notification emails (mentions, replies) and password-reset links." is accurate. - **MinioConfig in 3b**: Now correctly describes both `S3Client` and `S3Presigner` beans. The original only mentioned `S3Client`. The presigner is required for OCR presigned URL generation — this was a real documentation gap, now fixed. - **OCR diagram (3d)**: `Rel(ocrAsync, minio, "Generates presigned URLs for PDF fetch", "S3 API")` and `Rel(ocrPy, minio, "Fetches PDF via presigned URL", "HTTP / S3 presigned")` — this correctly represents the two-hop pattern: Java backend generates the URL, Python service fetches the PDF directly from MinIO without going through the backend. Accurate. - **Container diagram (L2)**: The `mc` (MinIO Client CLI) container still correctly shown as creating the bucket on startup. No change needed there. ### One minor note The OCR diagram description for `ocrClient` says "Falls back from streaming to batch on 404." — if this fallback behavior is still present in the Python service, fine. If it was removed during a refactor, this description would be stale. Not blocking, but worth confirming against the current Python service implementation.

marcel commented

2026-05-06 12:45:14 +02:00

📋 Elicit — Requirements Engineer

Verdict: ⚠️ Approved with concerns

Reviewing this as a documentation artifact: does it accurately represent what the system does for its users, and does it close the gap between how the system was specified and how it actually behaves?

What's improved

L1 persona descriptions updated: Both admin and family member are now described as "transcribes" — this correctly reflects the core domain (crowd transcription of historical Kurrent/Sütterlin letters). The original described members only as "searches, browses, and reads", which missed the most distinctive capability of the system.
Authentication flows now documented: forgotPw, resetPw, registerPage are all new in the frontend diagrams. These are user-facing flows that were completely absent from the original docs. Finding them missing in architecture diagrams is a good catch.
Bilateral conversation now correctly named: /briefwechsel instead of /conversations. The Briefwechsel (letter exchange) concept is domain-specific vocabulary that matters for this archive — calling it "conversations" was a category error.

Concerns

Test plan items are fully manual — both checklist items require a human to visually render and cross-reference the diagrams. There is no automated check. Given that the PR adds 6 new diagrams and 300+ lines of Mermaid DSL, a single typo in a component name could silently produce a broken diagram.

Suggesting: add npx @mermaid-js/mermaid-cli or a mmdc linting step to CI so Mermaid syntax errors are caught automatically on future docs PRs. This doesn't block this PR, but it's a gap worth filing as a follow-up issue.

NFR coverage gap in the diagram for the reader path: The new diagrams accurately document the transcription/author path (annotations, OCR, enrichment). The reader path — searching, browsing, reading on mobile — is represented only at the route level in Frontend 3b. The architecture correctly supports it, but if someone reads these docs to understand where mobile responsiveness requirements land, they won't find it. This is a documentation gap, not a code gap, and acceptable for this PR scope — but worth noting for the next docs cycle.

## 📋 Elicit — Requirements Engineer **Verdict: ⚠️ Approved with concerns** Reviewing this as a documentation artifact: does it accurately represent what the system does for its users, and does it close the gap between how the system was specified and how it actually behaves? ### What's improved - **L1 persona descriptions updated**: Both admin and family member are now described as "transcribes" — this correctly reflects the core domain (crowd transcription of historical Kurrent/Sütterlin letters). The original described members only as "searches, browses, and reads", which missed the most distinctive capability of the system. - **Authentication flows now documented**: `forgotPw`, `resetPw`, `registerPage` are all new in the frontend diagrams. These are user-facing flows that were completely absent from the original docs. Finding them missing in architecture diagrams is a good catch. - **Bilateral conversation now correctly named**: `/briefwechsel` instead of `/conversations`. The Briefwechsel (letter exchange) concept is domain-specific vocabulary that matters for this archive — calling it "conversations" was a category error. ### Concerns **Test plan items are fully manual** — both checklist items require a human to visually render and cross-reference the diagrams. There is no automated check. Given that the PR adds 6 new diagrams and 300+ lines of Mermaid DSL, a single typo in a component name could silently produce a broken diagram. Suggesting: add `npx @mermaid-js/mermaid-cli` or a `mmdc` linting step to CI so Mermaid syntax errors are caught automatically on future docs PRs. This doesn't block this PR, but it's a gap worth filing as a follow-up issue. **NFR coverage gap in the diagram for the reader path**: The new diagrams accurately document the transcription/author path (annotations, OCR, enrichment). The reader path — searching, browsing, reading on mobile — is represented only at the route level in Frontend 3b. The architecture correctly supports it, but if someone reads these docs to understand where mobile responsiveness requirements land, they won't find it. This is a documentation gap, not a code gap, and acceptable for this PR scope — but worth noting for the next docs cycle.

marcel commented

2026-05-06 12:45:25 +02:00

🔒 Nora "NullX" Steiner — Application Security Engineer

Verdict: ✅ Approved

Documentation-only PR. My review focuses on whether the security architecture is accurately described and whether the docs could mislead a future developer into implementing something insecure.

What's accurate and well-documented

3a Security diagram — correctly represents the defense-in-depth model:

secFilter description correctly identifies both the authentication mechanism (Basic Auth header parsing, BCrypt via DaoAuthenticationProvider) and the explicitly permitted endpoints (password-reset, invite, register). An inaccurate permitted-endpoint list in architecture docs is a real risk — developers may assume an endpoint is protected when it isn't, or vice versa.
permAspect → secConf → userDetails wiring is correct.
The userDetails component now says "Loads AppUser by email from DB" — this is the correct fix. The original "by username" was both inaccurate and misleading, since email is the lookup key used at authentication time.

forgotPw component documents its anti-enumeration property explicitly:

"Always responds with success to prevent email enumeration."

This is excellent security documentation. It tells future developers why the endpoint behaves this way, which prevents someone from "improving" it into an enumerable endpoint. This is exactly the kind of security comment that belongs in docs, not in code (it would be redundant in both places, but in architecture docs it provides the threat model context).

One observation (non-blocking)

The inviteCtrl description says "Rate-limited via WebConfig interceptor." This is a meaningful security control that prevents brute-force invite code guessing. It's correctly documented here. Just noting that the test coverage for this rate-limiter is where I'd look on the next security review — architecture docs say it exists, tests should prove it does.

## 🔒 Nora "NullX" Steiner — Application Security Engineer **Verdict: ✅ Approved** Documentation-only PR. My review focuses on whether the security architecture is accurately described and whether the docs could mislead a future developer into implementing something insecure. ### What's accurate and well-documented **3a Security diagram — correctly represents the defense-in-depth model:** - `secFilter` description correctly identifies both the authentication mechanism (Basic Auth header parsing, BCrypt via DaoAuthenticationProvider) and the explicitly permitted endpoints (password-reset, invite, register). An inaccurate permitted-endpoint list in architecture docs is a real risk — developers may assume an endpoint is protected when it isn't, or vice versa. - `permAspect` → `secConf` → `userDetails` wiring is correct. - The `userDetails` component now says "Loads AppUser by **email** from DB" — this is the correct fix. The original "by username" was both inaccurate and misleading, since email is the lookup key used at authentication time. **`forgotPw` component documents its anti-enumeration property explicitly:** ``` "Always responds with success to prevent email enumeration." ``` This is excellent security documentation. It tells future developers *why* the endpoint behaves this way, which prevents someone from "improving" it into an enumerable endpoint. This is exactly the kind of security comment that belongs in docs, not in code (it would be redundant in both places, but in architecture docs it provides the threat model context). ### One observation (non-blocking) The `inviteCtrl` description says "Rate-limited via WebConfig interceptor." This is a meaningful security control that prevents brute-force invite code guessing. It's correctly documented here. Just noting that the test coverage for this rate-limiter is where I'd look on the next security review — architecture docs say it exists, tests should prove it does.

marcel commented

2026-05-06 12:45:35 +02:00

🧪 Sara Holt (@saraholt) — QA Engineer & Test Strategist

Verdict: ⚠️ Approved with concerns

Documentation-only PR. From a QA perspective I'm checking two things: (1) is the test plan in the PR body sufficient to verify correctness, and (2) does the documentation accurately reflect testable behavior?

Test plan assessment

The PR test plan has two items:

Verify all 15 Mermaid blocks render without layout errors (open in Gitea preview or mermaid.live)

Confirm no component name in any diagram is missing from the codebase

Both are manual checks. For a PR adding 300+ lines of Mermaid DSL across 6 new diagrams, this is a non-trivial verification burden and entirely human-dependent. A syntax error in any of the 15 blocks will produce a broken rendered diagram silently unless someone manually inspects each one.

Suggestion: Add Mermaid syntax validation to CI. The @mermaid-js/mermaid-cli package supports headless rendering — a CI step that runs mmdc -i docs/architecture/c4-diagrams.md would catch syntax errors automatically and turn this from a manual gate into an automated one. This doesn't block this PR, but I'd file a follow-up chore issue for it.

Documentation accuracy from a testability lens

The docBulkEdit component description says Action: PATCH /api/documents/batch in CLAUDE.md, but the diagram shows PATCH /api/documents/bulk. This is exactly the kind of discrepancy that makes integration tests valuable — if there were an E2E test asserting the bulk-edit route, the endpoint name would be verified against the live backend on every PR. Worth fixing before merge (also flagged by Felix).

What's good

The new TranscriptionQueueService and AuditLogQueryService descriptions are specific enough to be testable — they name concrete query projections and delegation patterns rather than vague "handles transcription" descriptions. Good documentation drives testable specifications.

## 🧪 Sara Holt (@saraholt) — QA Engineer & Test Strategist **Verdict: ⚠️ Approved with concerns** Documentation-only PR. From a QA perspective I'm checking two things: (1) is the test plan in the PR body sufficient to verify correctness, and (2) does the documentation accurately reflect testable behavior? ### Test plan assessment The PR test plan has two items: > - [ ] Verify all 15 Mermaid blocks render without layout errors (open in Gitea preview or mermaid.live) > - [ ] Confirm no component name in any diagram is missing from the codebase Both are manual checks. For a PR adding 300+ lines of Mermaid DSL across 6 new diagrams, this is a non-trivial verification burden and entirely human-dependent. A syntax error in any of the 15 blocks will produce a broken rendered diagram silently unless someone manually inspects each one. **Suggestion**: Add Mermaid syntax validation to CI. The `@mermaid-js/mermaid-cli` package supports headless rendering — a CI step that runs `mmdc -i docs/architecture/c4-diagrams.md` would catch syntax errors automatically and turn this from a manual gate into an automated one. This doesn't block this PR, but I'd file a follow-up `chore` issue for it. ### Documentation accuracy from a testability lens The `docBulkEdit` component description says `Action: PATCH /api/documents/batch` in CLAUDE.md, but the diagram shows `PATCH /api/documents/bulk`. This is exactly the kind of discrepancy that makes integration tests valuable — if there were an E2E test asserting the bulk-edit route, the endpoint name would be verified against the live backend on every PR. Worth fixing before merge (also flagged by Felix). ### What's good The new `TranscriptionQueueService` and `AuditLogQueryService` descriptions are specific enough to be testable — they name concrete query projections and delegation patterns rather than vague "handles transcription" descriptions. Good documentation drives testable specifications.

marcel commented

2026-05-06 12:45:47 +02:00

🎨 Leonie Voss (@leonievoss) — UX Designer & Accessibility Strategist

Verdict: ⚠️ Approved with concerns

Documentation PR, so I'm reviewing whether the frontend architecture diagrams accurately represent the user-facing experience — especially the auth flows and the dual-audience (transcribers on laptop, readers on phone) split that's central to this product.

What's correctly updated

Frontend 3a hooks description is significantly improved: the original described two handle layers, the new version correctly documents four — handleAuth, userGroup, handleFetch, and handleLocaleDetection. The handleLocaleDetection layer was entirely missing from the original docs. This matters for understanding the i18n (Paraglide) pipeline.
New auth pages in 3a (registerPage, forgotPw, resetPw): These are all user-facing flows that were invisible in the original docs. The registerPage description correctly captures the invite-code validation step, which is a meaningful UX guard (invite-only registration).
Frontend 3c correctly splits the discovery path: The briefwechsel, aktivitaeten, stammbaum routes are now documented. These are the routes mobile readers will use most — previously completely absent from architecture docs.

Concerns

loginPage description — stale form field label (also flagged by Felix):

"Form action: encodes username:password as Base64 Basic Auth token"

The login form presents an email field to the user, not a "username" field. This matters from a UX documentation perspective — someone reading this to understand the login flow will think the input is a username, which affects how they'd design or test the login form. Should read: "encodes email:password as Base64 Basic Auth token."

Minor observation

The stammbaum route description says "Renders interactive D3-based Stammbaum." — this is implementation detail (D3) in what should be a component-level description. C4 L3 describes responsibility, not library choice. Suggest: "Renders interactive family tree visualisation (nodes + edges from /api/network)." — but this is cosmetic and non-blocking.

## 🎨 Leonie Voss (@leonievoss) — UX Designer & Accessibility Strategist **Verdict: ⚠️ Approved with concerns** Documentation PR, so I'm reviewing whether the frontend architecture diagrams accurately represent the user-facing experience — especially the auth flows and the dual-audience (transcribers on laptop, readers on phone) split that's central to this product. ### What's correctly updated - **Frontend 3a hooks description** is significantly improved: the original described two `handle` layers, the new version correctly documents four — `handleAuth`, `userGroup`, `handleFetch`, and `handleLocaleDetection`. The `handleLocaleDetection` layer was entirely missing from the original docs. This matters for understanding the i18n (Paraglide) pipeline. - **New auth pages in 3a** (`registerPage`, `forgotPw`, `resetPw`): These are all user-facing flows that were invisible in the original docs. The `registerPage` description correctly captures the invite-code validation step, which is a meaningful UX guard (invite-only registration). - **Frontend 3c correctly splits the discovery path**: The `briefwechsel`, `aktivitaeten`, `stammbaum` routes are now documented. These are the routes mobile readers will use most — previously completely absent from architecture docs. ### Concerns **`loginPage` description — stale form field label** (also flagged by Felix): ``` "Form action: encodes username:password as Base64 Basic Auth token" ``` The login form presents an **email** field to the user, not a "username" field. This matters from a UX documentation perspective — someone reading this to understand the login flow will think the input is a username, which affects how they'd design or test the login form. Should read: "encodes **email**:password as Base64 Basic Auth token." ### Minor observation The `stammbaum` route description says "Renders interactive D3-based Stammbaum." — this is implementation detail (D3) in what should be a component-level description. C4 L3 describes responsibility, not library choice. Suggest: "Renders interactive family tree visualisation (nodes + edges from /api/network)." — but this is cosmetic and non-blocking.

marcel added 5 commits 2026-05-06 12:55:16 +02:00

fix(c4): correct docBulkEdit endpoint /batch → /bulk 318d83d2fd

DocumentController has @PatchMapping("/bulk"); the component description
had the wrong path. The Rel in the same diagram was already correct.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

fix(c4): loginPage — username → email in component description af24d63176

CustomUserDetailsService loads by email, not username. The component
description had a stale "encodes username:password" label.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

fix(c4): sequence diagram — username → email in auth flow caf86b3225

Three stale references: "Enter username + password", Base64 encode
"user:password", and SELECT WHERE username — all updated to email to
match AppUserRepository.findByEmail() and CustomUserDetailsService.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

fix(c4): stammbaum — remove D3 library detail from component description 9878810a2a

C4 L3 describes responsibility, not library choice. Removing the D3
reference keeps the description implementation-agnostic.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs(c4): add cross-diagram stub convention note to header

CI / Unit & Component Tests (push) Failing after 3m32s

Details

CI / OCR Service Tests (push) Successful in 40s

Details

CI / Backend Unit Tests (push) Failing after 3m29s

Details

CI / Unit & Component Tests (pull_request) Failing after 3m35s

Details

CI / OCR Service Tests (pull_request) Successful in 43s

Details

CI / Backend Unit Tests (pull_request) Failing after 3m21s

Details

7ac39541ef

The C4 standard doesn't define this pattern. Adding a one-sentence
explanation so readers unfamiliar with the project's rendering convention
understand what stub components outside System_Boundary blocks mean.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

marcel commented

2026-05-06 12:55:39 +02:00

Resolution — Round 5 fixes (4 commits + 1 suggestion)

All Round 5 concerns addressed. Every finding was a diagram inaccuracy — the actual backend code follows the layering rules correctly throughout.

Blocker — `docBulkEdit` component description: `/batch` → `/bulk` (commit `318d83d2`)

Verified against DocumentController.java line 256: @PatchMapping("/bulk"). The Rel in the diagram already had the correct /bulk; the component description had the wrong /batch.

Fixed: Action: PATCH /api/documents/batch. → Action: PATCH /api/documents/bulk.

Flagged by @felixbrandt and @saraholt

Blocker — `loginPage` component description: `username:password` → `email:password` (commit `af24d631`)

CustomUserDetailsService loads by email (findByEmail). The component description said "encodes username:password", which misrepresents the login form's credential field.

Fixed: "encodes username:password as Base64 Basic Auth token" → "encodes email:password as Base64 Basic Auth token"

Flagged by @felixbrandt and @leonievoss

Blocker — Sequence diagram: 3 stale username references (commit `caf86b32`)

The auth flow sequence diagram had three stale references carrying over the old username-based model:

"Enter username + password" → "Enter email + password"
Base64 encode "user:password" → Base64 encode "email:password"
SELECT user WHERE username=? → SELECT user WHERE email=?

Same class of fix as the component description — flagged by @felixbrandt and @leonievoss

Suggestion — `stammbaum` description: remove D3 implementation detail (commit `9878810a`)

C4 L3 describes component responsibility, not library choice. Removed "D3-based" from the description.

Before: "Renders interactive D3-based Stammbaum."
After: "Renders interactive family tree from network graph data."

Flagged by @leonievoss

Suggestion — Cross-diagram stub convention note (commit `7ac39541`)

Added a one-sentence explanation to the document header clarifying that components placed outside a System_Boundary with "See diagram X" annotations are reference stubs — not duplicated definitions.

Flagged by @mkeller

## Resolution — Round 5 fixes (4 commits + 1 suggestion) All Round 5 concerns addressed. Every finding was a **diagram inaccuracy** — the actual backend code follows the layering rules correctly throughout. --- ### Blocker — `docBulkEdit` component description: `/batch` → `/bulk` (commit 318d83d2) Verified against `DocumentController.java` line 256: `@PatchMapping("/bulk")`. The `Rel` in the diagram already had the correct `/bulk`; the component description had the wrong `/batch`. **Fixed:** `Action: PATCH /api/documents/batch.` → `Action: PATCH /api/documents/bulk.` > Flagged by @felixbrandt and @saraholt --- ### Blocker — `loginPage` component description: `username:password` → `email:password` (commit af24d631) `CustomUserDetailsService` loads by email (`findByEmail`). The component description said "encodes username:password", which misrepresents the login form's credential field. **Fixed:** `"encodes username:password as Base64 Basic Auth token"` → `"encodes email:password as Base64 Basic Auth token"` > Flagged by @felixbrandt and @leonievoss --- ### Blocker — Sequence diagram: 3 stale username references (commit caf86b32) The auth flow sequence diagram had three stale references carrying over the old username-based model: - `"Enter username + password"` → `"Enter email + password"` - `Base64 encode "user:password"` → `Base64 encode "email:password"` - `SELECT user WHERE username=?` → `SELECT user WHERE email=?` > Same class of fix as the component description — flagged by @felixbrandt and @leonievoss --- ### Suggestion — `stammbaum` description: remove D3 implementation detail (commit 9878810a) C4 L3 describes component responsibility, not library choice. Removed "D3-based" from the description. **Before:** `"Renders interactive D3-based Stammbaum."` **After:** `"Renders interactive family tree from network graph data."` > Flagged by @leonievoss --- ### Suggestion — Cross-diagram stub convention note (commit 7ac39541) Added a one-sentence explanation to the document header clarifying that components placed outside a `System_Boundary` with "See diagram X" annotations are reference stubs — not duplicated definitions. > Flagged by @mkeller

marcel commented

2026-05-06 19:35:25 +02:00

🏗️ Markus Keller (@mkeller) — Senior Application Architect

Verdict: ✅ Approved

This is exactly what architecture documentation should be. Splitting 2 monolithic 40-node Mermaid diagrams into 15 focused sub-diagrams solves the core readability problem. The numbered scheme (3a / 3b / 3b.2 / 3c / 3c.2 / 3d / 3e) is consistent and navigable.

What I Checked

Cross-domain calls go through services, not repositories.
The 3b diagram correctly shows DocumentService calling personSvc and tagSvc (other domains' services), not PersonRepository or TagRepository directly. The stub annotations ("See diagram 3c.2" / "See diagram 3c") make the boundary explicit. This is exactly what the layering rules require.

Layer boundaries are correctly depicted.
Controllers → Services → Repositories is consistently shown across all 9 backend sub-diagrams. No controller-to-repository shortcut appears anywhere.

OCR service extraction rationale.
The 3d diagram correctly shows the Python OCR Service as a separate container communicating via presigned MinIO URLs rather than direct credentials. The @Async annotation on OcrAsyncRunner is reflected in the component technology label.

Security surface in 3a.
PermissionAspect (AOP) and CustomUserDetailsService are correctly shown as distinct beans — authentication and authorization are separate concerns, correctly separated. The permitted endpoint list (invite, register, forgot-password) is accurate.

Email service.
Adding System_Ext(mail, ...) to L1 and L2 is the right call — the notification and password-reset system sends SMTP and belongs on the context diagram. The "optional" qualifier in the relationship is accurate.

One Minor Observation (not a blocker)

The 3b.2 / 3c.2 decimal notation is slightly non-standard in C4 practice (C4 levels don't conventionally use fractional sub-levels). A future audit might consider naming like 3b-transcription / 3c-persons to avoid decimal confusion with version numbers. Not a change request for this PR.

## 🏗️ Markus Keller (@mkeller) — Senior Application Architect **Verdict: ✅ Approved** This is exactly what architecture documentation should be. Splitting 2 monolithic 40-node Mermaid diagrams into 15 focused sub-diagrams solves the core readability problem. The numbered scheme (3a / 3b / 3b.2 / 3c / 3c.2 / 3d / 3e) is consistent and navigable. ### What I Checked **Cross-domain calls go through services, not repositories.** The 3b diagram correctly shows `DocumentService` calling `personSvc` and `tagSvc` (other domains' services), not `PersonRepository` or `TagRepository` directly. The stub annotations ("See diagram 3c.2" / "See diagram 3c") make the boundary explicit. This is exactly what the layering rules require. **Layer boundaries are correctly depicted.** Controllers → Services → Repositories is consistently shown across all 9 backend sub-diagrams. No controller-to-repository shortcut appears anywhere. **OCR service extraction rationale.** The 3d diagram correctly shows the Python OCR Service as a separate container communicating via presigned MinIO URLs rather than direct credentials. The `@Async` annotation on `OcrAsyncRunner` is reflected in the component technology label. **Security surface in 3a.** `PermissionAspect` (AOP) and `CustomUserDetailsService` are correctly shown as distinct beans — authentication and authorization are separate concerns, correctly separated. The permitted endpoint list (invite, register, forgot-password) is accurate. **Email service.** Adding `System_Ext(mail, ...)` to L1 and L2 is the right call — the notification and password-reset system sends SMTP and belongs on the context diagram. The "optional" qualifier in the relationship is accurate. ### One Minor Observation (not a blocker) The `3b.2` / `3c.2` decimal notation is slightly non-standard in C4 practice (C4 levels don't conventionally use fractional sub-levels). A future audit might consider naming like `3b-transcription` / `3c-persons` to avoid decimal confusion with version numbers. Not a change request for this PR.

marcel commented

2026-05-06 19:35:37 +02:00

👨‍💻 Felix Brandt (@felixbrandt) — Senior Fullstack Developer

Verdict: ✅ Approved

Documentation-only PR — no TDD evidence to assess. What I can verify is accuracy: does the documentation describe what the code actually does?

Accuracy Wins I Can Verify From This Diff

/conversations → /briefwechsel — route was renamed and the diagram now matches. The briefwechsel component in Frontend 3c correctly shows GET /api/documents/conversation as the backend call.
Username → email in the auth flow — CustomUserDetailsService now says "Loads AppUser by email from DB" and the sequence diagram shows SELECT user WHERE email=?. This matches the actual findByEmail() query. The previous "username" description was a latent accuracy bug waiting to confuse someone reading the auth flow.
PATCH /api/documents/bulk (not /batch) — docBulkEdit is correctly shown with PATCH /api/documents/bulk. This was corrected in an earlier commit on this branch; good to see it landed in the final state.
S3Client and S3Presigner — MinioConfig now mentions both beans. The presigner is what generates the URLs the OCR service uses to fetch PDFs from MinIO. This detail matters for anyone trying to understand why MinioConfig has two beans.
OcrAsyncRunner streams page-by-page — the description "Streams OCR results from Python page by page" correctly reflects the NDJSON streaming approach rather than a batch response.

One Observation (informational)

OcrController is described as handling "single jobs, batch jobs, SSE streaming, job status, training runs, and per-sender models." That's a broad mandate for one controller. I'd flag this in an implementation review — but it's out of scope for a documentation PR. Worth knowing as a future refactor signal.

Test Plan

Both test plan items are reasonable for a docs-only PR. Manual Mermaid render verification is the right check here since there's no CI Mermaid linting step yet.

## 👨‍💻 Felix Brandt (@felixbrandt) — Senior Fullstack Developer **Verdict: ✅ Approved** Documentation-only PR — no TDD evidence to assess. What I can verify is accuracy: does the documentation describe what the code actually does? ### Accuracy Wins I Can Verify From This Diff 1. **`/conversations` → `/briefwechsel`** — route was renamed and the diagram now matches. The `briefwechsel` component in Frontend 3c correctly shows `GET /api/documents/conversation` as the backend call. 2. **Username → email in the auth flow** — `CustomUserDetailsService` now says "Loads AppUser by email from DB" and the sequence diagram shows `SELECT user WHERE email=?`. This matches the actual `findByEmail()` query. The previous "username" description was a latent accuracy bug waiting to confuse someone reading the auth flow. 3. **`PATCH /api/documents/bulk` (not `/batch`)** — `docBulkEdit` is correctly shown with `PATCH /api/documents/bulk`. This was corrected in an earlier commit on this branch; good to see it landed in the final state. 4. **`S3Client` and `S3Presigner`** — `MinioConfig` now mentions both beans. The presigner is what generates the URLs the OCR service uses to fetch PDFs from MinIO. This detail matters for anyone trying to understand why `MinioConfig` has two beans. 5. **`OcrAsyncRunner` streams page-by-page** — the description "Streams OCR results from Python page by page" correctly reflects the NDJSON streaming approach rather than a batch response. ### One Observation (informational) `OcrController` is described as handling "single jobs, batch jobs, SSE streaming, job status, training runs, and per-sender models." That's a broad mandate for one controller. I'd flag this in an implementation review — but it's out of scope for a documentation PR. Worth knowing as a future refactor signal. ### Test Plan Both test plan items are reasonable for a docs-only PR. Manual Mermaid render verification is the right check here since there's no CI Mermaid linting step yet.

marcel commented

2026-05-06 19:35:46 +02:00

🚀 Tobias Wendt (@tobiwendt) — DevOps & Platform Engineer

Verdict: ✅ Approved

No Compose files, no CI config, no Dockerfiles touched. Pure documentation. I'll verify the L2 container diagram reflects the actual deployment topology I know from the Compose file.

L2 Container Diagram Accuracy Check

Component	Shown as	Accurate?
Frontend	`SvelteKit / Node.js`	✅ Node adapter in production
Backend	`Spring Boot 4 / Java 21 / Jetty`	✅ Matches stack
Database	`PostgreSQL 16`	✅
Object storage	`MinIO (S3-compatible)`	✅
OCR service	`Python FastAPI` (in 3d)	✅
Init container	`MinIO Client (mc)` — creates bucket on startup	✅
Email	`System_Ext` — external SMTP relay	✅ Correctly marked external and optional

The new Rel(backend, mail, "Sends notification and password-reset emails (optional)", "SMTP") correctly marks email as optional — SMTP config is not required for core document management functionality.

The Rel(ocr, storage, "Fetches PDF via presigned URL", "HTTP / S3 presigned") in L2 accurately captures why the OCR service doesn't need its own S3 credentials — it gets time-limited presigned URLs from the backend.

One Thing Worth Noting

The L2 diagram still shows MinIO as the object storage component. In production, this is Hetzner Object Storage (S3-compatible, same API). The diagram is technically correct at the API level — both are S3-compatible — but a future revision might note the dev/prod distinction. Not a blocker; this is well-understood by the team.

Nothing blocking. Merge when ready.

## 🚀 Tobias Wendt (@tobiwendt) — DevOps & Platform Engineer **Verdict: ✅ Approved** No Compose files, no CI config, no Dockerfiles touched. Pure documentation. I'll verify the L2 container diagram reflects the actual deployment topology I know from the Compose file. ### L2 Container Diagram Accuracy Check | Component | Shown as | Accurate? | |---|---|---| | Frontend | `SvelteKit / Node.js` | ✅ Node adapter in production | | Backend | `Spring Boot 4 / Java 21 / Jetty` | ✅ Matches stack | | Database | `PostgreSQL 16` | ✅ | | Object storage | `MinIO (S3-compatible)` | ✅ | | OCR service | `Python FastAPI` (in 3d) | ✅ | | Init container | `MinIO Client (mc)` — creates bucket on startup | ✅ | | Email | `System_Ext` — external SMTP relay | ✅ Correctly marked external and optional | The new `Rel(backend, mail, "Sends notification and password-reset emails (optional)", "SMTP")` correctly marks email as optional — SMTP config is not required for core document management functionality. The `Rel(ocr, storage, "Fetches PDF via presigned URL", "HTTP / S3 presigned")` in L2 accurately captures why the OCR service doesn't need its own S3 credentials — it gets time-limited presigned URLs from the backend. ### One Thing Worth Noting The L2 diagram still shows `MinIO` as the object storage component. In production, this is Hetzner Object Storage (S3-compatible, same API). The diagram is technically correct at the API level — both are S3-compatible — but a future revision might note the dev/prod distinction. Not a blocker; this is well-understood by the team. Nothing blocking. Merge when ready.

marcel commented

2026-05-06 19:36:00 +02:00

📋 Elicit — Requirements Engineer & Business Analyst

Verdict: ⚠️ Approved with concerns

Architecture diagrams are a form of requirements documentation — they specify what the system does and who it serves. I'm checking whether the actors, system boundaries, and functional scope are consistently and completely represented.

What's Well Done

The L1 actor descriptions now include "transcribes" for both Administrator and Family Member — this accurately reflects the product's core use case (crowd transcription of historical Kurrent/Sütterlin letters). That verb was missing before and its absence undersold the primary user activity.

The email service is correctly marked optional in the L1/L2 relationships. Invite-based registration and notification email are real features, but they're optional dependencies — the system functions without an SMTP relay. This distinction is correctly captured.

Concerns

1. Invite-gated registration not visible at L1 (Minor)

The L1 System Context diagram shows a register endpoint in the frontend via the registerPage component, but L1 doesn't communicate that registration is invite-only. A new reader of the L1 diagram would conclude anyone can self-register. The invite constraint is architecturally significant — it shapes the user lifecycle and the security surface. Consider a note or a relationship qualifier like "Joins by invite" on the Member actor at L1. This is intentionally high-level, so I won't call it a blocker, but it's a representation gap.

2. Route name consistency: /aktivitaeten vs "Chronik" (Verify)

Frontend 3c shows the component as /aktivitaeten with description "Unified activity feed (Chronik)." CLAUDE.md lists the route as aktivitaeten/ and names the feature "Chronik." As long as /aktivitaeten is the actual SvelteKit route and "Chronik" is only the user-facing label, this is fine. But worth a quick sanity check before merge — if someone named the route /chronik at some point, the diagram would be wrong.

3. /hilfe/transkription understated (Cosmetic)

The help page description says "Static transcription style guide for Kurrent and Sütterlin character recognition. No backend calls." Accurate. But from a requirements perspective, this is the system's primary onboarding artifact for the 60+ transcriber persona — the people who do the most important work in the system. The word "static" makes it sound incidental. Not a diagram change request, just an observation about documentation framing.

## 📋 Elicit — Requirements Engineer & Business Analyst **Verdict: ⚠️ Approved with concerns** Architecture diagrams are a form of requirements documentation — they specify what the system does and who it serves. I'm checking whether the actors, system boundaries, and functional scope are consistently and completely represented. ### What's Well Done The L1 actor descriptions now include "transcribes" for both Administrator and Family Member — this accurately reflects the product's core use case (crowd transcription of historical Kurrent/Sütterlin letters). That verb was missing before and its absence undersold the primary user activity. The email service is correctly marked optional in the L1/L2 relationships. Invite-based registration and notification email are real features, but they're optional dependencies — the system functions without an SMTP relay. This distinction is correctly captured. ### Concerns **1. Invite-gated registration not visible at L1 (Minor)** The L1 System Context diagram shows a `register` endpoint in the frontend via the `registerPage` component, but L1 doesn't communicate that registration is invite-only. A new reader of the L1 diagram would conclude anyone can self-register. The invite constraint is architecturally significant — it shapes the user lifecycle and the security surface. Consider a note or a relationship qualifier like "Joins by invite" on the Member actor at L1. This is intentionally high-level, so I won't call it a blocker, but it's a representation gap. **2. Route name consistency: `/aktivitaeten` vs "Chronik" (Verify)** Frontend 3c shows the component as `/aktivitaeten` with description "Unified activity feed (Chronik)." CLAUDE.md lists the route as `aktivitaeten/` and names the feature "Chronik." As long as `/aktivitaeten` is the actual SvelteKit route and "Chronik" is only the user-facing label, this is fine. But worth a quick sanity check before merge — if someone named the route `/chronik` at some point, the diagram would be wrong. **3. `/hilfe/transkription` understated (Cosmetic)** The help page description says "Static transcription style guide for Kurrent and Sütterlin character recognition. No backend calls." Accurate. But from a requirements perspective, this is the system's primary onboarding artifact for the 60+ transcriber persona — the people who do the most important work in the system. The word "static" makes it sound incidental. Not a diagram change request, just an observation about documentation framing.

marcel commented

2026-05-06 19:36:12 +02:00

🔒 Nora "NullX" Steiner — Application Security Engineer

Verdict: ✅ Approved

I'm verifying that the security subsystem diagram (3a) accurately represents the authentication and authorization architecture, and that no security-relevant components are misrepresented or missing.

3a — Security & Authentication Review

CustomUserDetailsService authentication field: ✅ Fixed

The prior diagram said "Loads AppUser by username from DB." The new diagram correctly says "Loads AppUser by email from DB." This is security-critical accuracy — the lookup field in Spring Security's UserDetailsService determines what the Basic Auth username field is compared against. Documenting the wrong field would mislead anyone auditing the authentication flow.

SecurityConfig permitted endpoints: ✅ Accurate

The description "Permits password-reset, invite, and register endpoints without authentication" correctly reflects the actual permitAll() configuration. These are the only unauthenticated entry points to the backend. The diagram makes the security surface explicit and auditable.

PermissionAspect separation: ✅ Architecturally correct

Authentication (SecurityConfig / CustomUserDetailsService) and authorization (PermissionAspect) are shown as distinct components. This is the correct representation — they are separate beans with separate concerns. A reader can follow the flow: filter chain authenticates → AOP aspect authorizes → method executes.

CSRF disabled: ✅ Implicit, correct

The diagram notes CSRF is disabled in SecurityConfig. The reason (Basic Auth credentials in cookies with Authorization header injection in hooks.server.ts makes CSRF structurally impossible) belongs in source-level comments, not in the diagram. The diagram correctly documents the what and defers the why to the code.

One Observation (informational, not blocking)

The 3c diagram shows InviteController at /api/auth/invite and AuthController at /api/auth but doesn't list which specific endpoints are permitAll(). A reader of 3c alone wouldn't know these endpoints are unauthenticated. This is fine — 3a is the authoritative source for the security perimeter, and the cross-diagram structure handles it. Just noting the cross-diagram dependency for anyone reading 3c in isolation.

No security misrepresentations found.

## 🔒 Nora "NullX" Steiner — Application Security Engineer **Verdict: ✅ Approved** I'm verifying that the security subsystem diagram (3a) accurately represents the authentication and authorization architecture, and that no security-relevant components are misrepresented or missing. ### 3a — Security & Authentication Review **`CustomUserDetailsService` authentication field: ✅ Fixed** The prior diagram said "Loads AppUser by username from DB." The new diagram correctly says "Loads AppUser by **email** from DB." This is security-critical accuracy — the lookup field in Spring Security's `UserDetailsService` determines what the Basic Auth username field is compared against. Documenting the wrong field would mislead anyone auditing the authentication flow. **`SecurityConfig` permitted endpoints: ✅ Accurate** The description "Permits password-reset, invite, and register endpoints without authentication" correctly reflects the actual `permitAll()` configuration. These are the only unauthenticated entry points to the backend. The diagram makes the security surface explicit and auditable. **`PermissionAspect` separation: ✅ Architecturally correct** Authentication (`SecurityConfig` / `CustomUserDetailsService`) and authorization (`PermissionAspect`) are shown as distinct components. This is the correct representation — they are separate beans with separate concerns. A reader can follow the flow: filter chain authenticates → AOP aspect authorizes → method executes. **CSRF disabled: ✅ Implicit, correct** The diagram notes CSRF is disabled in `SecurityConfig`. The reason (Basic Auth credentials in cookies with Authorization header injection in `hooks.server.ts` makes CSRF structurally impossible) belongs in source-level comments, not in the diagram. The diagram correctly documents the *what* and defers the *why* to the code. ### One Observation (informational, not blocking) The 3c diagram shows `InviteController` at `/api/auth/invite` and `AuthController` at `/api/auth` but doesn't list which specific endpoints are `permitAll()`. A reader of 3c alone wouldn't know these endpoints are unauthenticated. This is fine — 3a is the authoritative source for the security perimeter, and the cross-diagram structure handles it. Just noting the cross-diagram dependency for anyone reading 3c in isolation. **No security misrepresentations found.**

marcel commented

2026-05-06 19:36:28 +02:00

🧪 Sara Holt (@saraholt) — QA Engineer & Test Strategist

Verdict: ⚠️ Approved with concerns

No production code changed — my review focuses on: (1) does the test plan adequately verify this documentation? (2) are the component descriptions precise enough to write tests from? (3) are there accuracy claims I can cross-reference?

What's Well Done

The component descriptions are precise enough to drive test design. Examples:

"Creates OcrJob and OcrJobDocument records, checks Python service health, and delegates async execution to OcrAsyncRunner" — tells a tester exactly what OcrService should be verified to do in a unit test.
"Tracks import state (IDLE/RUNNING/DONE/FAILED)" — tells a tester the four states to assert on.
"Always responds with success to prevent email enumeration" (forgot-password) — this is a behavioral requirement that should have an explicit test. Good to see it documented.

Concerns

1. Test plan is entirely manual (Minor)

Both checklist items require human execution:

"Verify all 15 Mermaid blocks render without layout errors" — no CI step validates Mermaid syntax
"Confirm no component name in any diagram is missing from the codebase" — no automated check

For a docs-only PR this is acceptable, but worth noting: npx @mermaid-js/mermaid-cli can validate Mermaid syntax in CI without a browser. One pipeline step would catch syntax regressions automatically on future docs changes.

2. No mechanism to catch documentation drift (Minor)

Several commits on this branch fixed accuracy regressions that accumulated since the original diagrams were written (email/username, /batch vs /bulk, /conversations vs /briefwechsel). The final state is correct, but there's no automated protection against the next accumulation. As new endpoints are added, C4 diagrams will silently drift again. A tracking issue or a documentation-review checklist item in the feature PR template would mitigate this.

3. Cross-diagram dependency in 3b.2 to verify (Informational)

TranscriptionQueueService in diagram 3b.2 depends on DocumentService (from 3b) and AuditLogQueryService (from 3e), while 3e's DashboardService depends back on TranscriptionService (from 3b.2). This bidirectional dependency across sub-diagrams could obscure a real coupling concern. I'd verify against the actual service code that neither of these is a circular Spring injection dependency — given the Spring Boot 4 constraint on constructor injection cycles, if this is real code it either uses @Lazy (now discouraged) or delegates through a different path.

## 🧪 Sara Holt (@saraholt) — QA Engineer & Test Strategist **Verdict: ⚠️ Approved with concerns** No production code changed — my review focuses on: (1) does the test plan adequately verify this documentation? (2) are the component descriptions precise enough to write tests from? (3) are there accuracy claims I can cross-reference? ### What's Well Done The component descriptions are precise enough to drive test design. Examples: - `"Creates OcrJob and OcrJobDocument records, checks Python service health, and delegates async execution to OcrAsyncRunner"` — tells a tester exactly what `OcrService` should be verified to do in a unit test. - `"Tracks import state (IDLE/RUNNING/DONE/FAILED)"` — tells a tester the four states to assert on. - `"Always responds with success to prevent email enumeration"` (forgot-password) — this is a behavioral requirement that should have an explicit test. Good to see it documented. ### Concerns **1. Test plan is entirely manual (Minor)** Both checklist items require human execution: - "Verify all 15 Mermaid blocks render without layout errors" — no CI step validates Mermaid syntax - "Confirm no component name in any diagram is missing from the codebase" — no automated check For a docs-only PR this is acceptable, but worth noting: `npx @mermaid-js/mermaid-cli` can validate Mermaid syntax in CI without a browser. One pipeline step would catch syntax regressions automatically on future docs changes. **2. No mechanism to catch documentation drift (Minor)** Several commits on this branch fixed accuracy regressions that accumulated since the original diagrams were written (email/username, /batch vs /bulk, /conversations vs /briefwechsel). The final state is correct, but there's no automated protection against the next accumulation. As new endpoints are added, C4 diagrams will silently drift again. A tracking issue or a documentation-review checklist item in the feature PR template would mitigate this. **3. Cross-diagram dependency in 3b.2 to verify (Informational)** `TranscriptionQueueService` in diagram 3b.2 depends on `DocumentService` (from 3b) and `AuditLogQueryService` (from 3e), while 3e's `DashboardService` depends back on `TranscriptionService` (from 3b.2). This bidirectional dependency across sub-diagrams could obscure a real coupling concern. I'd verify against the actual service code that neither of these is a circular Spring injection dependency — given the Spring Boot 4 constraint on constructor injection cycles, if this is real code it either uses `@Lazy` (now discouraged) or delegates through a different path.

marcel commented

2026-05-06 19:36:42 +02:00

🎨 Leonie Voss (@leonievoss) — UI/UX Design Lead & Accessibility Strategist

Verdict: ✅ Approved

Architecture diagrams aren't UI components, but they are information design artifacts. I'm evaluating this as an information architecture problem: can the intended audience navigate and understand these diagrams?

What Works Well

The split is a readability win. 2 monolithic 40-node diagrams → 15 focused 8–14 node diagrams. A 40-node Mermaid graph in Gitea's markdown viewer requires diagonal scrolling and mental context-switching. Sub-diagrams that fit in a single browser viewport can be scanned in seconds. This is the right call for the audience (developers navigating the codebase, not conference attendees with a full-screen projector).

The cross-diagram stub convention note is well-placed. It appears in the document header before the reader encounters any stubs. This is the correct pattern — establish the vocabulary before using it. Without this note, a reader seeing a component outside a System_Boundary block would be confused.

Section headers are scannable. "### 3a — Security & Authentication" renders as a proper heading in Gitea's markdown, giving each diagram a title that appears in the page outline. A developer who knows they want the OCR section can jump directly without reading top-to-bottom.

One Design Observation (informational, not blocking)

The decimal numbering 3b.2 and 3c.2 can be misread as version numbers or patch releases rather than sub-diagram identifiers. Someone skimming the markdown source might wonder if 3b.2 supersedes 3b or is an incremental revision of it. Flatter naming — e.g., "3c — Documents", "3d — Transcription Pipeline", "3e — Persons & Family Graph" — would eliminate this ambiguity. Worth considering on the next documentation audit, not a change request for this PR.

Rendering Consideration

Mermaid diagrams render as horizontally scrollable containers in Gitea's mobile view. The transcriber audience (60+, typically on laptop or tablet) won't encounter this — they're on wider screens. Younger family members reading on phones may need to scroll horizontally on the larger sub-diagrams. This is inherent to Mermaid rendering, not introduced by this PR, and the split into smaller diagrams actually reduces horizontal scroll width compared to the original monoliths.

## 🎨 Leonie Voss (@leonievoss) — UI/UX Design Lead & Accessibility Strategist **Verdict: ✅ Approved** Architecture diagrams aren't UI components, but they are information design artifacts. I'm evaluating this as an information architecture problem: can the intended audience navigate and understand these diagrams? ### What Works Well **The split is a readability win.** 2 monolithic 40-node diagrams → 15 focused 8–14 node diagrams. A 40-node Mermaid graph in Gitea's markdown viewer requires diagonal scrolling and mental context-switching. Sub-diagrams that fit in a single browser viewport can be scanned in seconds. This is the right call for the audience (developers navigating the codebase, not conference attendees with a full-screen projector). **The cross-diagram stub convention note is well-placed.** It appears in the document header before the reader encounters any stubs. This is the correct pattern — establish the vocabulary before using it. Without this note, a reader seeing a component outside a `System_Boundary` block would be confused. **Section headers are scannable.** "### 3a — Security & Authentication" renders as a proper heading in Gitea's markdown, giving each diagram a title that appears in the page outline. A developer who knows they want the OCR section can jump directly without reading top-to-bottom. ### One Design Observation (informational, not blocking) The decimal numbering `3b.2` and `3c.2` can be misread as version numbers or patch releases rather than sub-diagram identifiers. Someone skimming the markdown source might wonder if `3b.2` supersedes `3b` or is an incremental revision of it. Flatter naming — e.g., "3c — Documents", "3d — Transcription Pipeline", "3e — Persons & Family Graph" — would eliminate this ambiguity. Worth considering on the next documentation audit, not a change request for this PR. ### Rendering Consideration Mermaid diagrams render as horizontally scrollable containers in Gitea's mobile view. The transcriber audience (60+, typically on laptop or tablet) won't encounter this — they're on wider screens. Younger family members reading on phones may need to scroll horizontally on the larger sub-diagrams. This is inherent to Mermaid rendering, not introduced by this PR, and the split into smaller diagrams actually *reduces* horizontal scroll width compared to the original monoliths.

marcel added 1 commit 2026-05-06 19:59:40 +02:00

fix(c4): flatten decimal sub-diagram numbering; note invite gate at L1

CI / Unit & Component Tests (push) Failing after 3m42s

Details

CI / OCR Service Tests (push) Successful in 40s

Details

CI / Backend Unit Tests (push) Failing after 3m33s

Details

CI / Unit & Component Tests (pull_request) Failing after 3m38s

Details

CI / OCR Service Tests (pull_request) Successful in 1m8s

Details

CI / Backend Unit Tests (pull_request) Failing after 3m32s

Details

b60ad66816

- Rename 3b.2→3c, 3c→3d, 3c.2→3e, 3d→3f, 3e→3g to eliminate
  decimal notation that read as version numbers rather than sub-levels
- Update all seven "See diagram X" cross-references to match
- Correct backend intro: "three focused views" → "seven focused sub-diagrams"
- Add "Access by administrator invite." to L1 Family Member description
  to surface the invite-only registration constraint at the context level

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

marcel merged commit 12f0e21b21 into main

2026-05-06 20:00:16 +02:00

marcel deleted branch docs/post-refactor-accuracy-audit

2026-05-06 20:00:16 +02:00

marcel referenced this issue from a commit

2026-05-06 20:00:17 +02:00

docs(c4): fix diagram 3c service layer and add missing 3e components

marcel referenced this pull request

2026-05-06 23:49:43 +02:00

docs(db): add ER and ORM diagrams (PlantUML) #452

Sign in to join this conversation.

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: marcel/familienarchiv#448

docs(c4): accuracy audit — split L3 diagrams, add 6 new sub-diagrams, fix all stale content #448

Summary

Diagrams added or changed

Test plan

🏗️ Markus Keller — Senior Application Architect

Blockers

Suggestions

👨‍💻 Felix Brandt — Senior Fullstack Developer

Blockers

Suggestions

🚀 Tobias Wendt — DevOps & Platform Engineer

📋 Elicit — Requirements Engineer

Blockers

Suggestions

🔐 Nora "NullX" Steiner — Application Security Engineer

What's documented well

Suggestion (non-blocking)

🧪 Sara Holt — Senior QA Engineer

Blockers

Suggestions

🎨 Leonie Voss — UX Designer & Accessibility Strategist

Resolution — Blocker fixes (commit 42c6e58d)

Blocker 1 + 2 — Diagram 3c: GroupController and TagController service layer

Blocker 3 — Diagram 3e: missing DashboardController and StatsController outgoing rel

Sara's concern — verification method for component name accuracy

🏗️ Markus Keller — Senior Application Architect

Blockers

Suggestions

👨‍💻 Felix Brandt — Senior Fullstack Developer

Blockers

Suggestions

🚀 Tobias Wendt — DevOps & Platform Engineer

What I checked

📋 Elicit — Requirements Engineer

What I checked

Suggestions (non-blocking)

🔐 Nora "NullX" Steiner — Application Security Engineer

What I checked

Suggestions (non-blocking)

🧪 Sara Holt — QA Engineer & Test Strategist

What I checked

Suggestions (non-blocking)

🎨 Leonie Voss — UX Designer & Accessibility Strategist

What I noted

Resolution — Round 2 blocker fixes (commit 83c3d85b)

Blocker — Diagram 3b: docSvc → personRepo/tagRepo direct relationships

Suggestion (Felix) — Diagram 3b.2: services routing directly to db

🏗️ Markus Keller — Senior Application Architect

Blockers

Suggestions

👨‍💻 Felix Brandt — Senior Fullstack Developer

Blockers

Suggestions

🔧 Tobias Wendt — DevOps & Platform Engineer

What I checked

One note (not a blocker)

📋 Elicit — Requirements Engineer

What I checked

One open question

🔐 Nora "NullX" Steiner — Application Security Engineer

What I checked

🧪 Sara Holt — QA Engineer

What I checked

🎨 Leonie Voss — UI/UX Design Lead

What I verified

Resolution — Round 3 blocker and suggestion fixes

Blocker — Diagram 3d: OcrAsyncRunner description vs. relationship contradiction (commit 35d82814)

Suggestion — Diagram 3c.2: RelationshipService/RelationshipInferenceService direct db arrows (commit a20a46b2)

Suggestion — Diagram frontend 3b: incorrect bulk-edit endpoint path (commit 632fb9ef)

Suggestion — Diagram backend 3a: secFilter→permAspect label (commit e3bfbde8)

Suggestion — Diagram frontend 3d: /hilfe/transkription actor (commit bbded5b9)

🏛️ Markus Keller (@mkeller) — Senior Application Architect

Blockers

Suggestions

Positives

👨‍💻 Felix Brandt (@felixbrandt) — Senior Fullstack Developer

No Blockers

Suggestions

Positives

🛠️ Tobias Wendt (@tobiwendt) — DevOps & Platform Engineer

Resolution — Blocker fixes (commit `42c6e58d`)

Blocker 1 + 2 — Diagram 3c: `GroupController` and `TagController` service layer

Blocker 3 — Diagram 3e: missing `DashboardController` and `StatsController` outgoing rel

Resolution — Round 2 blocker fixes (commit `83c3d85b`)

Blocker — Diagram 3b: `docSvc → personRepo/tagRepo` direct relationships

Suggestion (Felix) — Diagram 3b.2: services routing directly to `db`

Blocker — Diagram 3d: OcrAsyncRunner description vs. relationship contradiction (commit `35d82814`)

Suggestion — Diagram 3c.2: RelationshipService/RelationshipInferenceService direct db arrows (commit `a20a46b2`)

Suggestion — Diagram frontend 3b: incorrect bulk-edit endpoint path (commit `632fb9ef`)

Suggestion — Diagram backend 3a: `secFilter→permAspect` label (commit `e3bfbde8`)

Suggestion — Diagram frontend 3d: `/hilfe/transkription` actor (commit `bbded5b9`)

Blocker — Diagram 3e: DashboardService missing cross-domain relationships (commit `3fafaf29`)

Suggestion — Diagram 3d: OcrAsyncRunner → bare db arrow (commit `83f4f835`)

Security smell — Diagram 3a: secFilter BCrypt description (commit `61ca5aac`)

Concern — L1 + L2: Email Service not shown as external system (commit `818246a2`)

Blocker — `docBulkEdit` component description: `/batch` → `/bulk` (commit `318d83d2`)

Blocker — `loginPage` component description: `username:password` → `email:password` (commit `af24d631`)

Blocker — Sequence diagram: 3 stale username references (commit `caf86b32`)

Suggestion — `stammbaum` description: remove D3 implementation detail (commit `9878810a`)

Suggestion — Cross-diagram stub convention note (commit `7ac39541`)