np-dms/lcbp3

Fork 0

Files

T

admin 961ee72343

CI / CD Pipeline / build (push) Failing after 4m53s

Details

CI / CD Pipeline / deploy (push) Has been skipped

Details

690406:2310 Done Task BE-ERR-01

2026-04-06 23:10:56 +07:00

29 KiB

Raw Blame History

ADR-020: AI Intelligence Integration Architecture

Status: Proposed Date: 2026-04-03 Version: 1.8.5 Review Cycle: Core ADR (Review every 6 months or Major Version upgrade) Decision Makers: Development Team, AI Integration Lead, System Architect Gap Resolution: Addresses requirement for automated document processing efficiency (Product Vision v1.8.5, Section 3.2) and acceptance criteria for AI-assisted metadata extraction (UAT Criteria, Section 4.7) Version Dependency:

Effective From: v1.9.0
Applies To: v1.9.0+ (Full implementation)
Backward Compatible: v1.8.5 (API endpoints only)
Required For: v2.0.0 (Core AI features)

Related Documents:

ADR-017: Ollama Data Migration Architecture
ADR-017B: AI Document Classification
ADR-018: AI Boundary Policy — AI Physical Isolation
ADR-019: Hybrid Identifier Strategy — UUID Strategy
n8n Migration Setup Guide

หมายุ: ADR-020 กำหนดสถาปัตยกรรมการผสานรวม AI Intelligence เข้ากับระบบ NAP-DMS แบบครบวงจร โดยใช้แนวทาง "RFA-First" เพื่อให้ครอบคลุมทั้งการนำเข้าเอกสารเก่า (Legacy Migration) และการสร้างเอกสารใหม่ (New Ingestion)

Context and Problem Statement

ปัญหาที่ต้องการแก้ไข

ระบบ NAP-DMS v1.8.5 ต้องการเพิ่มประสิทธิภาพการทำงานกับเอกสารวิศวกรรมโดยใช้ AI Intelligence ใน 2 สถานการณ์หลัก:

Legacy Document Migration: มีเอกสาร PDF เก่าจำนวนมากที่ต้องนำเข้าระบบ พร้อมตรวจสอบความถูกต้องระหว่าง Metadata ใน Excel กับเนื้อหาใน PDF
New Document Ingestion: ผู้ใช้งานอัปโหลดเอกสารใหม่และต้องการความช่วยเหลือจาก AI ในการสกัดข้อมูลอัตโนมัติ

ข้อจำกัดและข้อกำหนด

Security (ADR-018): AI ต้องรันบน Admin Desktop (Desk-5439) แยกส่วนกับระบบหลัก
Data Privacy: ห้ามส่งข้อมูลขึ้น Cloud Provider ต้องประมวลผลภายในองค์กรเท่านั้น
Human-in-the-Loop: ข้อมูลที่ AI สกัดต้องผ่านการตรวจสอบโดยมนุษย์เสมอ
Thai Language Support: ต้องรองรับเอกสารภาษาไทยและวิศวกรรม

Decision Drivers

RFA-First Approach: เริ่มจากเอกสาร RFA (Request for Approval) ที่มีความซับซ้อนสูง
Unified Architecture: ใช้ Pipeline และ Component ร่วมกันทั้ง 2 รูปแบบการทำงาน
Data Integrity: รักษาความถูกต้องของข้อมูลเป็นสำคัญสูงสุด
User Experience: จัดหมวดหมู่ระหว่าง Batch Throughput กับ Real-time UX
Cost Efficiency: ใช้ Ollama แบบ On-Premise เพื่อลดต้นทุน
Maintainability: แยก Logic ของ AI ออกจาก Core Application

Considered Options

Option 1: Separate AI Systems per Use Case

Pros:

✅ เชี่ยวชาญเฉพาะทาง (Specialized)
✅ แยก Failure Domain

Cons:

❌ Code Dupification สูง
❌ บำรุงรักษายาก (Multiple systems)
❌ Inconsistent AI Behavior

Option 2: Unified AI Pipeline with Different Frontends ⭐ (Selected)

Pros:

✅ Single Source of Truth: Pipeline กลางเดียว
✅ Reusable Components: DocumentReviewForm ใช้ร่วมกันได้
✅ Consistent Quality: Prompt และ Model เดียวกัน
✅ Easier Maintenance: แก้ไขที่เดียว ใช้ได้ทั้งหมด
✅ Cost Effective: ใช้ Ollama รุ่นเดียว (Gemma 4)

Cons:

❌ ต้องออกแบบให้รองรับทั้ง Batch และ Real-time
❌ Complex Component Design

Decision Outcome

Chosen Option: Option 2 — Unified AI Pipeline with Different Frontends

Rationale:

การสร้าง Pipeline กลางเดียวสำหรับ AI และใช้ Component ร่วมกันทาง Frontend จะช่วยลดความซับซ้อนในการบำรุงรักษา และรับประกันความสม่ำเสมอของคุณภาพ AI ทั้งในการนำเข้าเอกสารเก่าและใหม่

Impact Analysis

Affected Components

Component	Impact Level	Description
Backend Architecture	High	New AiModule, MigrationService, database schema changes
Frontend Components	Medium	DocumentReviewForm, Migration Dashboard enhancements
Infrastructure	High	Admin Desktop AI services, n8n workflows, Docker setup
Security Model	Medium	ADR-018 boundary enforcement, new API endpoints
Database Schema	Medium	migration_logs table, ai_audit_logs table
API Layer	Medium	New AI endpoints, authentication scopes
Testing Framework	Medium	AI accuracy tests, integration tests
Documentation	Low	User guides, admin procedures

Required Changes

Change Category	Specific Changes	Priority
Database	Create `migration_logs` table (SQL First) Create `ai_audit_logs` table Update data dictionary	Critical
Backend	Implement AiModule with n8n integration Create MigrationService with business logic Add AI endpoints with CASL guards Update validation layer for AI responses	Critical
Frontend	Build DocumentReviewForm reusable component Create Admin Migration Dashboard Integrate AI suggestions in RFA form Add confidence score indicators	High
Infrastructure	Setup n8n on Admin Desktop (Desk-5439) Deploy Ollama with Gemma 4 Configure PaddleOCR service Setup Docker containers	Critical
Security	Implement ADR-018 AI boundaries Add AI-specific authentication scopes Create audit logging for AI interactions Setup rate limiting for AI endpoints	Critical
Testing	AI accuracy validation tests End-to-end pipeline tests Security boundary verification Performance benchmarking	High
Documentation	Admin workflow procedures User AI assistance guide Troubleshooting procedures API documentation updates	Medium

Cross-Component Dependencies

Dependency	Source	Target	Impact
AI Service → Database	AiService extraction calls	migration_logs table	Data persistence
Frontend → AI Gateway	DocumentReviewForm	/api/ai/extract endpoint	Real-time suggestions
n8n → Backend API	AI workflows	Validation endpoints	Human-in-the-loop
Admin Desktop → QNAP NAS	AI services	DMS backend API	Security boundary
Migration Service → Storage	Batch processing	File storage system	Document handling

Architecture Overview

Core Technology Stack

Component	Technology	Host	Purpose
AI Engine	Ollama + Gemma 4	Admin Desktop (Desk-5439)	LLM Inference
OCR Engine	PaddleOCR	Admin Desktop (Desk-5439)	Thai/English Text Extraction
Orchestrator	n8n	QNAP NAS (Docker)	Workflow Management
AI Gateway	NestJS AiModule	QNAP NAS (Docker)	API Gateway & Validation

Data Flow Architecture

┌─────────────────────────────────────────────────────────────────┐
│                    AI Processing Flow                           │
├─────────────────────────────────────────────────────────────────┤
│                                                                 │
│  ┌─────────────┐    ┌─────────────┐    ┌─────────────────┐     │
│  │   Input     │───▶│    n8n      │───▶│  AI Services    │     │
│  │  (PDF/Excel)│    │  Workflow   │    │ (OCR+LLM)       │     │
│  └─────────────┘    └─────────────┘    └────────┬────────┘     │
│                                              │                 │
│                                              ▼                 │
│  ┌─────────────────────────────────────────────────────────┐   │
│  │              DMS Backend API                           │   │
│  │  ┌─────────────┐    ┌─────────────┐    ┌────────────┐  │   │
│  │  │AiService    │    │Validation   │    │Audit Log   │  │   │
│  │  │Gateway      │◀───│Layer        │◀───│Service     │  │   │
│  │  └─────────────┘    └─────────────┘    └────────────┘  │   │
│  └─────────────────────────────────────────────────────────┘   │
│                              │                                 │
│                              ▼                                 │
│  ┌─────────────────────────────────────────────────────────┐   │
│  │                 Frontend Layer                           │   │
│  │  ┌─────────────────────┐    ┌─────────────────────┐     │   │
│  │  │  Migration Dashboard │    │  Document Review   │     │   │
│  │  │      (Admin)        │    │     Form (User)     │     │   │
│  │  └─────────────────────┘    └─────────────────────┘     │   │
│  └─────────────────────────────────────────────────────────┘   │
└─────────────────────────────────────────────────────────────────┘

Implementation Modules

Backend Components

1. AiModule & AiService

@Injectable()
export class AiService {
  // Single entry point for all AI operations
  async extractMetadata(fileId: string): Promise<AiExtractionResult> {
    // 1. Send to n8n workflow
    // 2. Wait for OCR + LLM processing
    // 3. Validate results
    // 4. Return structured data
  }

  async validateExtraction(result: AiExtractionResult): Promise<ValidationResult> {
    // Confidence scoring, enum validation, audit logging
  }
}

2. Migration Entity

CREATE TABLE migration_logs (
  id INT AUTO_INCREMENT PRIMARY KEY,
  publicId BINARY(16) DEFAULT (UUID_TO_BIN(UUID(), 1)),
  source_file VARCHAR(255) NOT NULL,
  source_metadata JSON, -- Data from Excel
  ai_extracted JSON, -- Data from AI
  confidence_score DECIMAL(3,2),
  status ENUM('PENDING_REVIEW', 'APPROVED', 'REJECTED'),
  reviewed_by INT,
  reviewed_at TIMESTAMP NULL,
  created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
  updated_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP ON UPDATE CURRENT_TIMESTAMP
);

3. API Endpoints

Endpoint	Purpose	Access
`POST /api/ai/extract`	Real-time extraction	Authenticated Users
`POST /api/migration/batch`	Batch migration	Admin Only
`GET /api/migration/queue`	Review queue	Admin Only
`POST /api/migration/commit`	Commit approved items	Admin Only

Frontend Components

1. DocumentReviewForm (Reusable Component)

interface DocumentReviewFormProps {
  // Source: Migration Table or AI API Response
  sourceData: MigrationItem | AiExtractionResult;
  // Mode: 'migration' | 'new'
  mode: 'migration' | 'new';
  onSubmit: (validatedData: ValidatedDocument) => void;
}

// Features:
// - Highlight AI-suggested fields
// - Show confidence scores
// - Allow human correction
// - Track feedback for AI improvement

2. Migration Dashboard (Admin)

// Features:
// - Filter by confidence level
// - Bulk approve/reject
// - Compare source vs AI data
// - Export review reports

Workflow Specifications

Workflow 1: Legacy Migration (Batch Processing)

Input: Excel Metadata + PDF Files
  │
  ▼
n8n Workflow:
  1. Read Excel row
  2. Send PDF to PaddleOCR
  3. Extract Thai/English text
  4. Send text + metadata to Gemma 4
  5. AI validates consistency
  6. Generate confidence score
  7. Store in migration_logs (PENDING_REVIEW)
  │
  ▼
Output: Migration Dashboard for Admin Review
  │
  ▼
Action: Admin approves → Commit to permanent storage

Workflow 2: New Ingestion (Real-time Processing)

Input: User uploads PDF in RFA creation form
  │
  ▼
n8n Workflow (Real-time):
  1. OCR extraction (PaddleOCR)
  2. AI analysis (Gemma 4)
  3. Return suggestions to frontend
  │
  ▼
Output: Form auto-fill with AI suggestions
  │
  ▼
Action: User reviews/edits → Saves to database

AI Model Configuration

Gemma 4 Prompt Strategy

You are an AI assistant for Laem Chabang Phase 3 construction project document analysis.

TASK: Extract and validate metadata from engineering documents.

RULES:
1. Extract: Subject, Date, Discipline, Drawing Reference, Contract Number
2. Validate: Check consistency between provided metadata and document content
3. Confidence: Rate accuracy (0-100%) for each extracted field
4. Language: Support Thai and English engineering terms
5. Format: Return structured JSON

OUTPUT FORMAT:
{
  "extracted_metadata": {
    "subject": "...",
    "date": "YYYY-MM-DD",
    "discipline": "Civil|Mechanical|Electrical|Architectural",
    "drawing_reference": "...",
    "contract_number": "..."
  },
  "validation": {
    "is_consistent": true|false,
    "discrepancies": ["..."],
    "confidence_score": 0.95
  }
}

Confidence Scoring Strategy

Score Range	Action
95-100%	Auto-approve (migration only)
85-94%	Low priority review
60-84%	High priority review
< 60%	Reject / Requires manual entry

Security & Compliance

AI Boundary Enforcement (ADR-018)

Rule	Implementation
Physical Isolation	AI runs on Admin Desktop only
No Direct DB Access	All communication via DMS API
API Authentication	JWT tokens with `ai:invoke` scope
Audit Logging	Every AI interaction logged
Human Validation	No auto-commit without review

Data Privacy Measures

Local Processing Only: No data leaves corporate network
Temporary Storage: AI processes data in memory only
Encryption: All API calls use TLS 1.3
Data Retention: AI logs retained for 90 days only

Implementation Roadmap

Phase 1: Pipeline Infrastructure (Task BE-AI-01)

Week 1-2: AI Pipeline Foundation

Docker Environment Setup on Admin Desktop (Desk-5439)
- n8n service with Basic Authentication
- Ollama with Gemma 4 model (GPU optimized)
- PaddleOCR service with Thai language support
n8n Workflow Development
- Webhook trigger for DMS integration
- OCR → AI → JSON processing pipeline
- Error handling and retry logic
Prompt Engineering
- Thai engineering document templates
- JSON schema validation
- Confidence scoring implementation
Integration Testing
- End-to-end pipeline validation
- Security boundary verification
- Performance benchmarking

Phase 2: Backend AI Gateway (Task BE-AI-02)

Week 3-4: NestJS Integration Layer

Database Schema Design (SQL First)
- migration_logs table with UUIDv7 primary keys
- ai_audit_logs for performance tracking
- Data dictionary updates
AI Module Architecture
- AiService with n8n webhook integration
- MigrationService for business logic
- Validation layer with confidence thresholds
API Endpoints & Security
- Admin migration endpoints with CASL guards
- Real-time extraction endpoint (/api/ai/extract)
- Idempotency and rate limiting implementation
Configuration Management
- Service account authentication
- Environment variables for AI endpoints
- Monitoring and logging setup

Phase 3: Frontend Human-in-the-Loop (Task FE-AI-03)

Week 5-6: User Experience & Validation

Reusable AI Components
- AiSuggestionField with confidence indicators
- DocumentComparisonView with PDF sidebar
- Client-side validation with Zod + React Hook Form
Admin Migration Dashboard
- Paginated table with filtering/sorting
- Bulk actions for high-confidence items
- Error logging and retry mechanisms
Real-time Ingestion Integration
- RFA creation flow enhancement
- AI processing state indicators
- Auto-fill with user override capability
Human-AI Feedback Loop
- User correction tracking
- Performance analytics dashboard
- Accuracy metrics and reporting

Phase 4: Testing & Deployment

Week 7-8: Production Readiness

Comprehensive Testing
- Thai/English document validation
- Confidence scoring accuracy verification
- Load testing and performance optimization
Security Audit
- ADR-018 boundary verification
- Penetration testing of AI endpoints
- Data privacy compliance check
User Training & Documentation
- Admin workflow training
- User guide for AI-assisted document creation
- Troubleshooting and support procedures
Production Deployment
- Blue-green deployment strategy
- Monitoring and alerting setup
- Rollback procedures and contingency plans

Success Metrics

Technical Performance Metrics

Metric	Target	Measurement Method
Thai OCR Accuracy	>90%	Character-by-character comparison with ground truth
AI JSON Validity	100%	Automated validation of all AI responses
Processing Time	<15s/document	End-to-end timing from upload to suggestion
GPU Memory Usage	<6GB per doc	Resource monitoring on Admin Desktop
System Uptime	>99%	Service availability monitoring

Business Impact Metrics

Metric	Target	Measurement Method
Data Entry Time Reduction	70%	Time comparison manual vs AI-assisted workflows
AI Accuracy Rate	85%+	Human verification of AI extractions
Migration Throughput	1000 docs/day	Batch processing capacity with admin review
User Correction Rate	<15%	Percentage of AI suggestions modified by users
Admin Productivity	3x improvement	Documents processed per admin hour

User Experience Metrics

Metric	Target	Measurement Method
User Satisfaction	4.0/5.0	Post-deployment user survey
Task Completion Rate	>95%	Successful document creation rate
Learning Curve	<30 min	Time to proficiency for new users
Error Rate	<2%	Failed AI extractions requiring manual intervention

Security & Compliance Metrics

Metric	Target	Measurement Method
Security Incidents	0	Audit log monitoring and breach detection
Data Privacy Compliance	100%	Adherence to ADR-018 and PDPA requirements
Audit Trail Completeness	100%	All AI interactions logged and traceable
API Response Times	<200ms	DMS API performance under load

Risk Assessment & Mitigation

🔴 High-Risk Items

Risk	Impact	Probability	Mitigation Strategy
AI Accuracy on Thai Documents	High	Medium	Custom prompt engineering + Extensive testing with Thai engineering docs
Admin Desktop Hardware Failure	High	Low	Backup desktop ready + Cloud AI fallback plan (emergency only)
Data Privacy Violations	Critical	Low	Strict ADR-018 enforcement + Regular security audits
Performance Bottlenecks	Medium	Medium	Queue system + GPU monitoring + Load balancing

🟡 Medium-Risk Items

Risk	Impact	Probability	Mitigation Strategy
User Adoption Resistance	Medium	Medium	Comprehensive training + UI/UX optimization + Early user involvement
Thai OCR Quality Issues	Medium	Medium	Multiple OCR engines + Manual correction workflow
Integration Complexity	Medium	Low	Phased deployment + Extensive testing + Rollback procedures
Cost Overruns	Medium	Low	On-premise AI eliminates per-use costs

🟢 Low-Risk Items

Risk	Impact	Probability	Mitigation Strategy
Technology Stack Changes	Low	Low	Containerized deployment + Version pinning
Vendor Dependency	Low	Low	Open-source stack + Multiple model options
Regulatory Changes	Medium	Low	Flexible architecture + Compliance monitoring

Architecture Decision Records

ADR-017: Ollama Data Migration — Foundation migration architecture
ADR-017B: Smart Categorization — AI categorization use cases
ADR-018: AI Boundary Policy — Security isolation requirements (CRITICAL)
ADR-019: Hybrid Identifier Strategy — UUID usage patterns (CRITICAL)

Implementation Tasks

Task BE-AI-01: Pipeline Infrastructure Setup — n8n + PaddleOCR + Gemma 4 setup
Task BE-AI-02: Backend AI Gateway Development — NestJS integration layer
Task FE-AI-03: Frontend Human-in-the-Loop Interface — User experience and validation

Technical Specifications

03-05-n8n-migration-setup-guide.md — n8n configuration details
05-02-backend-guidelines.md — NestJS patterns and conventions
05-03-frontend-guidelines.md — Next.js patterns and UI standards
03-01-data-dictionary.md — Field definitions and business rules

Compliance & Security

ADR-016: Security & Authentication — Overall security framework
04-08-release-management-policy.md — Deployment procedures

ADR Review Cycle

Review Classification

Core ADR Status: This ADR is classified as a Core Architecture Decision due to its fundamental impact on system architecture and security boundaries.

Review Schedule

Review Type	Frequency	Trigger	Scope
Regular Review	Every 6 months	Calendar-based	Validity assessment, performance metrics
Major Version Review	Every major version (v2.0.0, v3.0.0)	Version planning	Architecture relevance, compatibility
Security Review	Annually or after security incident	Security audit	ADR-018 compliance, threat model
Technology Review	As needed	Tech stack changes	AI model updates, infrastructure changes

Review Process

Phase 1: Preparation (1 week before review)

Metrics Collection
- AI accuracy rates and trends
- Performance benchmarks vs targets
- Security incident reports
- User feedback and satisfaction scores
- Technology stack currency assessment
Stakeholder Notification
- Development Team
- AI Integration Lead
- System Architect
- Security Team
- Product Management

Phase 2: Review Meeting (2-hour session)

Current State Assessment
- Review success metrics achievement
- Identify gaps or deviations from original decision
- Assess technology relevance and currency
Impact Evaluation
- Measure actual vs expected business impact
- Evaluate implementation challenges
- Identify unintended consequences
Future Considerations
- Emerging AI technologies
- Changing business requirements
- Scalability concerns
- Security landscape changes

Phase 3: Decision & Documentation (1 week after review)

Review Outcomes
- No Change: ADR remains valid and effective
- Update Required: Minor adjustments to implementation
- Supersede: New ADR created to replace this one
- Retire: ADR no longer relevant
Documentation Updates
- Update review date and findings
- Add new version notes
- Link to related ADRs if created
- Update implementation roadmap

Review Criteria

Criterion	Question	Pass/Fail Threshold
Effectiveness	Is AI achieving target accuracy (>85%)?	Pass: ≥85%, Fail: <85%
Performance	Are processing times within targets (<15s)?	Pass: ≤15s, Fail: >15s
Security	Is ADR-018 compliance maintained?	Pass: 100% compliant, Fail: Any violation
Adoption	Are users utilizing AI features?	Pass: >70% adoption, Fail: ≤70%
Maintainability	Is system supportable with current resources?	Pass: Yes, Fail: Requires additional resources
Technology Currency	Are AI models and infrastructure up-to-date?	Pass: Current version, Fail: >1 version behind

Review History Template

## Review Cycle [YYYY-MM-DD]

**Review Type:** [Regular/Major Version/Security/Technology]
**Reviewers:** [Names and roles]
**Duration:** [Meeting date]

### Findings
- [Key findings from metrics and assessment]

### Issues Identified
- [Problems or concerns discovered]

### Recommendations
- [Action items and decisions]

### Outcome
- [No Change/Update Required/Supersede/Retire]

### Next Review Date
- [YYYY-MM-DD]

Document History

Version	Date	Author	Changes
1.8.5	2026-04-03	AI Integration Lead	Initial ADR — AI Intelligence Integration Architecture
1.8.6	2026-04-03	Tech Lead	Updated — Aligned with detailed task specifications and implementation requirements
1.8.7	2026-04-04	System Architect	Enhanced — Added Impact Analysis template, ADR Review Cycle process, Gap Linking to requirements, and Version Dependency tracking

Last Updated: 2026-04-04 Status: Proposed Review Date: 2026-04-10 Implementation Target: v1.9.0 Next Review Date: 2026-10-04 (6-month regular review)

29 KiB Raw Blame History