690514:2019 204-rfa-approval-refactor #01

2026-05-14 20:19:21 +07:00
parent 07cc6d47b1
commit 0240d80da5
183 changed files with 20050 additions and 1017 deletions
@@ -0,0 +1,218 @@
+# Phase 0 Research: RFA Approval System Refactor
+
+**Date**: 2026-05-11  
+**Purpose**: Research technical patterns and validate design decisions
+
+---
+
+## Research Topics
+
+### 1. Parallel Review in Workflow Engine
+
+**Research Task**: Can Unified Workflow Engine (ADR-001) support parallel tasks with consensus rules?
+
+**Decision**: ✅ **Yes, with DSL Extension (Lead Consolidation)**
+
+**Rationale**:
+- Current DSL supports sequential states and transitions
+- Parallel review requires: (a) Task splitting on state entry, (b) Task completion by all Disciplines, (c) **Lead Discipline Consolidation** - Lead reviews all comments and makes the final decision
+- Pattern: `ParallelReviewState` - enters sub-workflows for each Discipline, aggregates on completion of all, then enables Lead review step
+
+**Implementation Pattern**:
+```typescript
+// DSL Extension: Parallel Review with Consolidation
+{
+  type: 'parallel_review',
+  config: {
+    splitBy: 'discipline',
+    requiredDisciplines: 'all',  // Must wait for all to finish
+    consolidator: 'leadDiscipline', // Final decision maker
+    visibility: 'full' // Transparency enabled
+  }
+}
+```
+
+**Alternatives Considered**:
+- Option A: Majority with Veto (rejected - too rigid for complex engineering projects)
+- Option B: Sequential with fast-forward (rejected - doesn't truly parallelize)
+- Option C: **Lead Consolidation in DSL** (selected - provides expert review and flexibility)
+
+**References**: 
+- BPMN 2.0 Parallel Gateway pattern
+- Existing `workflow-dsl.schema.ts` in codebase
+
+---
+
+### 2. Response Code Matrix Storage
+
+**Research Task**: Best structure for Master Approval Matrix with 5 categories × 11 codes?
+
+**Decision**: **Normalized Relational Model with JSON for flexibility**
+
+**Rationale**:
+- Core codes (1A-1G, 2, 3, 4) are stable relational data
+- Category mappings (which codes apply to which doc types) need flexibility
+- Project overrides need inheritance tracking
+
+**Schema Design**:
+```sql
+-- Core Response Codes (stable)
+response_codes (id, code, sub_status, description_th, description_en, category)
+
+-- Matrix Rules (project-specific overrides)
+response_code_rules (
+  id, 
+  project_id NULLABLE,  -- NULL = global default
+  document_type_id,
+  response_code_id,
+  is_enabled,
+  requires_comments,
+  triggers_notification,
+  parent_rule_id  -- For inheritance tracking
+)
+```
+
+**Alternatives Considered**:
+- Single JSON column for entire matrix (rejected - hard to query, validate, index)
+- Full EAV (Entity-Attribute-Value) (rejected - too complex for this use case)
+
+---
+
+### 3. Delegation Pattern & Circular Detection
+
+**Research Task**: Best approach for delegation with chain depth limit and circular detection?
+
+**Decision**: **Simple Adjacency List, Max Depth = 1 (Single Level Only)**
+
+**Rationale**:
+- Adjacency List: Simple, fast for immediate lookup (`delegator_id → delegatee_id`)
+- **Single Level Only**: Prevents accountability loss and complex chain management
+- Circular Detection: Trivial check (`A -> B -> A`)
+
+**Circular Detection Logic**:
+```typescript
+function detectCircularDelegation(delegatorId: string, proposedDelegateeId: string): boolean {
+  // Check if proposedDelegatee has already delegated to delegatorId
+  const existing = getActiveDelegation(proposedDelegateeId);
+  return existing?.delegateeId === delegatorId;
+}
+```
+
+**Alternatives Considered**:
+- Max Depth 3 (rejected - too complex for standard accountability)
+- Closure Table (rejected - overkill for simple chains)
+
+**Alternatives Considered**:
+- Nested Set Model (rejected - overkill for simple chains)
+- Closure Table (rejected - requires maintenance on delegation expiry)
+
+---
+
+### 4. BullMQ Pattern for Reminders & Distribution
+
+**Research Task**: Best BullMQ patterns for scheduled reminders and async distribution?
+
+**Decision**: **Delayed Jobs + Repeatable Jobs + Flows**
+
+**Pattern Breakdown**:
+
+**Reminders**:
+- **Delayed Jobs**: Schedule individual reminder at due date
+- **Repeatable Jobs**: Daily reminder for overdue items (cron pattern)
+- **Job Data**: `{ rfaId, reviewerId, reminderType, escalationLevel }`
+
+**Distribution**:
+- **Job Flow**: Parent (distribution coordinator) → Children (individual deliveries)
+- **Retry**: 3 attempts with exponential backoff
+- **Dead Letter**: Failed distributions logged for manual intervention
+
+```typescript
+// Reminder Queue Pattern
+await reminderQueue.add('rfa-reminder', {
+  rfaRevisionId,
+  reviewerId,
+  reminderType: 'DUE_SOON'
+}, {
+  delay: calculateDelay(dueDate, reminderDaysBefore)
+});
+
+// Distribution Flow Pattern
+await distributionFlow.add({
+  name: 'rfa-distribution',
+  data: { rfaId, responseCode, recipients: [...] },
+  children: recipients.map(r => ({
+    name: 'deliver-document',
+    data: { recipientId: r.id, method: r.deliveryMethod }
+  }))
+});
+```
+
+**Alternatives Considered**:
+- node-cron for scheduling (rejected - no persistence, no retry)
+- Custom scheduler service (rejected - BullMQ already provides this)
+
+---
+
+### 5. Review Task Status Aggregation
+
+**Research Task**: How to efficiently calculate aggregate status for parallel reviews?
+
+**Decision**: **Materialized View + Real-time Counter**
+
+**Rationale**:
+- Materialized View: Fast reads for list views ("2 of 3 approved")
+- Real-time Counter: Immediate update on each review action
+- Trigger: Update counter on ReviewTask status change
+
+**Aggregation Logic**:
+```sql
+-- Materialized view for fast reads
+CREATE VIEW review_task_summary AS
+SELECT 
+  rfa_revision_id,
+  COUNT(*) as total_disciplines,
+  SUM(CASE WHEN status = 'COMPLETED' THEN 1 ELSE 0 END) as completed,
+  GROUP_CONCAT(discipline_id ORDER BY discipline_id) as discipline_list
+FROM review_tasks
+GROUP BY rfa_revision_id;
+
+-- Lead Consolidation Check
+-- RFA moves to 'Lead Consolidation' step only when completed = total_disciplines
+```
+
+**Alternatives Considered**:
+- Calculate on-demand (rejected - slow with many disciplines)
+- Application-level cache (rejected - stale data risk)
+
+**Alternatives Considered**:
+- Calculate on-demand (rejected - slow with many disciplines)
+- Application-level cache (rejected - stale data risk)
+
+---
+
+## Summary of Decisions
+
+| Topic | Decision | Key Rationale |
+|-------|----------|---------------|
+| Parallel Review | DSL Lead Consolidation | Expert-driven summaries, flexibility |
+| Response Code Storage | Normalized + JSON | Balance of structure and flexibility |
+| Delegation | Adjacency List (Level 1) | Accountability, simple circular detection |
+| Queue Pattern | BullMQ Delayed + Flows | Industry standard, reliable |
+| Status Aggregation | Materialized View + Counter | Fast reads, real-time updates |
+
+---
+
+## Risk Assessment
+
+| Risk | Probability | Mitigation |
+|------|-------------|------------|
+| DSL Parallel Gateway complexity | Medium | Prototype with simple 2-discipline case first |
+| Response Code migration from existing | Low | New tables, existing data untouched |
+| Performance on large Review Teams | Low | Pagination on aggregation, Redis caching |
+| Circular delegation algorithm | Low | Unit test with 3-level chains |
+
+---
+
+## Next Phase
+
+**Phase 1**: Design data model and API contracts based on these research decisions.