Roadmap-Hinweis: Vage Bullets ohne Akzeptanzkriterien in Checkbox-Tasks überführen. Format: - [ ] <Task> (Target: <Q/Jahr>).

Training Module Roadmap

Current Status

v1.6.0 – AdaLoRA (adaptive rank pruning), LoRAAdapterMerger (TIES + linear), and LoRA+ (asymmetric LR) added. Full LoRA fine-tuning toolchain including checkpoint/resume, adapter versioning, quantization (QLoRA), and multi-GPU training.

Completed ✅

In Progress 🚧

Automated hyperparameter search (LoRA rank, learning rate sweep) (Target: Q2 2026) — HyperparamSearchConfig, HyperparamResult, HyperparamTrialResult, HyperparamSearchCallback in training_pipeline.h; runHyperparamSearch() in training_pipeline.cpp; 9 tests in tests/test_training_pipeline_e2e.cpp
- Subsystems: src/training/training_pipeline.cpp (new HyperparamSearch inner class), ConfidenceCalibrator
- Inputs: HyperparamSearchConfig{rank_candidates, lr_candidates, max_trials, budget_seconds}; validation split fraction
- Outputs: HyperparamResult{best_rank, best_lr, best_val_loss, trial_log}; best config auto-applied to pipeline
- Constraints: deterministic trial ordering (seeded random); concurrent trials capped at num_gpus
- Errors: no improvement after max_trials → return best seen; budget exceeded → early-stop, return best so far
- Tests: unit — mock trainer, verify trial scheduling; integration — sweep over 3 rank values on synthetic dataset
- Perf: trial overhead (excluding training) ≤ 50 ms/trial; total sweep for 9-trial 3×3 grid ≤ 3× single-train time
Adapter serving integration with LLM inference layer (Target: Q3 2026) — ILLMRouter abstract interface + DeployResult in include/training/adapter_serving.h; setLLMRouter(ILLMRouter*) on IncrementalLoRATrainer; deployVersionEx()/rollbackVersionEx() propagate weight to router; 29 focused tests in test_training_phase2.cpp

Planned Features 📋

Short-term (Next 3-6 months)

[?] Support for additional legal jurisdictions beyond German law
[?] Active learning loop (auto-select most informative samples for labeling)
[?] Training data deduplication and near-duplicate filtering
[?] Evaluation metrics dashboard (validation loss curves, accuracy)
[?] Export labeled datasets in standard formats (JSONL, Hugging Face datasets)

AdaLoRA ↔ Tensor-Train Bridge (Q2–Q4 2027)

Full research: research/ADALORA_TT_BRIDGE_RESEARCH.md
Specification: include/training/adalora_tt_bridge.h

Phase 1 (Q2 2027): Core conversion AdaLoRA ↔ TT — AdaLoraTTBridge::exportToTT() / importFromTT()
- Mathematical basis: G₀[0,:,i] = P[:,i]·√λᵢ, G₁[i,:,0] = Q[:,i]·√λᵢ (bijective for 2D matrices)
- QR sign-normalisation + orthogonality validation (‖P^T·P − I‖_F < ε_orth = 1e-4)
- Round-trip error < machine epsilon; 15+ unit tests
- Acceptance: lossless for active_rank ≤ 64; std::invalid_argument for rank > max_tt_rank
Phase 2 (Q2 2027): Storage integration — store() / loadAdapter() via TensorNetworkStorageEngine
- Key schema: __lora_adapters__:<tenant>:<adapter>:<layer>:G<0|1>
- LoRACheckpointManager backend TT_STORAGE; adapter-load latency target ≤ 11 ms (7B, r=64)
Phase 3 (Q3 2027): Deduplication + serving — TensorFingerprintGraph integration
- findSimilarAdapters() for FLARE live adapter switch ≤ 15 ms
- GgmlTensorBridge::mapAdapter() zero-copy mmap path
- Expected: ≥40% storage reduction for 100 domain-related adapters
Phase 4 (Q4 2027): Unified rank control — roundAndReallocate()
- TT-rounding as globally optimal alternative to AdaLoRA greedy pruning
- Comparison study: AdaLoRA pruning vs. TT-SVD vs. combined (target: ≥2% better Frobenius-optimal rank cut)

Long-term (6-12 months)

[?] Reinforcement learning from human feedback (RLHF) training loop
[?] Multi-modal training samples (text + table + chart)
Domain adaptation beyond legal (medical, financial) — DomainType LEGAL/MEDICAL/FINANCIAL in auto_labeler.h; domain-specific keyword extraction for medical/financial domains in auto_labeler.cpp
Federated learning for privacy-preserving cross-institution training (Target: Q2 2026) — LoRAFederationCoordinator + IncrementalLoRATrainer::exportGradient()/applyGlobalDelta() in distributed_knowledge and training modules
Model distillation from large to small adapters (Target: Q2 2026) — FederatedDistillationCoordinator in distributed_knowledge/federated_distillation_coordinator.h/.cpp; teacher submits DP-protected soft labels; student nodes receive via registerStudent() callback; PolicyGate + rollback trigger + audit hook; FDF-01..10 tests in tests/test_federated_distillation_coordinator.cpp

Implementation Phases

Phase 1: Auto-Labeling & LoRA Training Pipeline (Status: Completed ✅)

Phase 2: Adapter Management & Multi-Domain (Status: Completed ✅)

Adapter version management: atomic deploy/rollback with integrity verification (Target: Q2 2026) — deployVersionEx()/rollbackVersionEx() in incremental_lora_trainer.h/.cpp; verifyAdapterIntegrity() calls LoRACheckpointManager::validate() when checkpoint_dir is set; bypass for unmanaged adapters; DeployResult{success,active_version,split_applied,error} result struct; error codes: "version_not_found", "integrity_failure", "router_unavailable", "invalid_split"
Multi-domain support beyond German legal text (medical, financial) (Target: Q2 2026) — DomainType enum (LEGAL/MEDICAL/FINANCIAL) added to auto_labeler.h; AutoLabelConfig::domain_type field; extractFallbackModalities() in auto_labeler.cpp dispatches domain-specific obligation/recommendation/permission/prohibition patterns for medical (must/shall/required/should/recommended/may/contraindicated/verboten) and financial (must/shall/required/should/may/prohibited/forbidden/disclose/report/offenlegen/melden) domains; German and English terms both covered
Automated hyperparameter search (LoRA rank and learning rate sweep) (Target: Q2 2026)
Adapter serving integration with the LLM inference layer (Target: Q3 2026) — ILLMRouter abstract interface (adapter_serving.h/.cpp): setAdapterWeight(version,weight), isAvailable(), activeVersion(); IncrementalLoRATrainer::setLLMRouter(ILLMRouter*) wires the router; deployVersionEx()/rollbackVersionEx() propagate weight updates to the router atomically after local registry update; unavailable router → DeployResult.error = "router_unavailable"

Phase 3: Multi-Modality & Provenance (Status: Completed ✅)

ContentModality enum (TEXT_CLAUSE, TABLE, CITATION, OCR_IMAGE, UNKNOWN) added to auto_labeler.h
modality field added to TrainingSample struct for per-modality confidence thresholds
LoRACheckpointManager – SHA-256 integrity validation, atomic rotation, rolling 3-checkpoint window, manifest JSON (lora_checkpoint_manager.h/.cpp)
ProvenanceTracker – ProvenanceRecord, write(), recordFilteredSample(), queryLineage(), getRecord() (provenance_tracker.h/.cpp)
EnrichmentLRUCache – thread-safe LRU map inside KnowledgeGraphEnricher, enableCache/disableCache/getCacheStats API
ConfidenceCalibrator – isotonic regression (PAV algorithm) per-category threshold selection in training_pipeline.h/.cpp
Multi-modality full parser (training/modality_parser.h/.cpp): ModalityDetector, TextClauseExtractor, TableExtractor, CitationExtractor, OCRExtractor
Standalone focused test targets for training module (ModalityParserFocusedTests, TrainingConvergenceFocusedTests)
[?] Active learning loop (auto-select most informative unlabelled samples)

Phase 4: DATABASE_OPTIMIZER Domain AutoLabeler — IMPL-A1 (Status: Completed ✅)

Paper 1 — §5 Training Data Pipeline / §7.4 Golden Dataset Construction Issue: docs/issues/lora_loops/IMPL-A1-dataset-construction.md

Phase 5: Federation Bridges — IMPL-A3 (Status: Completed ✅)

Paper 1+3 — §4.5 Adapter Lifecycle / Distributed Knowledge §Layer B Issue: docs/issues/lora_loops/IMPL-A3-federation-hooks.md

IncrementalLoRATrainer::exportGradient() → EncryptedGradient (opaque blob, AES-256-GCM) — include/training/incremental_lora_trainer.h
IncrementalLoRATrainer::applyGlobalDelta(const GlobalAdapterDelta&) → applies FedAvg aggregate to local adapter weights
EncryptedGradient and GlobalAdapterDelta structs in training_interfaces.h
Privacy invariant: exportGradient() output must never contain raw training samples — enforced by unit test
5 unit tests in tests/test_incremental_lora_trainer.cpp (FED-01..05)
- FED-01 exportGradient() returns non-empty blob after training
- FED-02 applyGlobalDelta() verifiably changes adapter weights (weight-diff ≠ 0)
- FED-03 applying zero-delta leaves weights unchanged
- FED-04 privacy: raw sample text absent from EncryptedGradient serialised bytes
- FED-05 double-apply is idempotent when delta == 0

Phase 6: Federated Distillation & Privacy-Preserving Learning (Status: Completed ✅)

Phase 1 — Protokoll-Design: Federated Distillation protocol for Client/Coordinator/Verifier roles specified and wired via IncrementalLoRATrainer, LoRAFederationCoordinator, and governance/audit hooks (Target: Q2 2026)
Phase 1 — Threat Model: honest-but-curious + Byzantine client model, membership-inference/model-inversion risk coverage documented in module security/audit docs (Target: Q2 2026)
Phase 2 — Baseline + Privacy Controls: central-vs-federated baseline path, Gaussian DP controls (dp_epsilon, dp_delta), secure cross-shard gradient exchange (EncryptedGradient) integrated (Target: Q2 2026)
Phase 2 — Robust Aggregation: non-IID-resilient median/FedAvg aggregation and poisoning/outlier protection paths validated by distributed-knowledge tests (Target: Q2 2026)
Phase 3 — Evaluation: non-IID and cross-domain federation scenarios validated in tests/test_distributed_knowledge_integration.cpp and resilience suite tests/test_distributed_knowledge_or.cpp (Target: Q2 2026)
Phase 3 — Trade-off Measurement: privacy/utility and failure-mode observability exposed via coordinator stats (getStats()) and audit callbacks (Target: Q2 2026)
Phase 4 — Productive Rollout: canary-style staged federation enablement, model governance controls, and rollback path through deployVersionEx()/rollbackVersionEx() and federation admin integration (Target: Q3 2026)
Phase 4 — Fallback Safety: policy/quality guardrails enforce safe fallback to local adapters if federation or governance checks fail (Target: Q3 2026)

Federated Distillation KPIs (initial)

>= 90% task quality vs. centralized baseline at configured privacy budget (Target: Q3 2026)
Federated round overhead <= 15% versus non-federated update in focused DK benchmarks (Target: Q3 2026)
0 unprotected raw-data exfiltration along training/federation paths (Target: Continuous)
Demonstrated robustness under simulated poisoning and timeout scenarios (Target: Q3 2026)

Federated Distillation Deliverables

Technical protocol + threat model documentation (Target: Q2 2026)
Reproducible evaluation suite for federated rounds and resilience scenarios (Target: Q2 2026)
Governance/release criteria with model rollback safety checks (Target: Q3 2026)
Implementation backlog for production integration and hardening (Target: Q3 2026)

Federated Distillation Acceptance Criteria

Privacy/utility trade-off measurable and reviewable by stakeholders
Security mechanisms validated through tests and attack/failure simulations
Rollout and rollback path documented and testable in training + federation flows

Production Readiness Checklist

Unit tests coverage > 80% (8 test files, 4,381 lines; ConfidenceCalibrator, ModalityParser, Pipeline E2E, Data Selection, Checkpoint, Provenance all covered)
Integration tests (label → train → evaluate → deploy lifecycle) – test_training_pipeline_e2e.cpp
Performance benchmarks – benchmarks/bench_legal_lora_pipeline.cpp
[?] Security audit (PII scanning, tenant isolation, checkpoint encryption at rest – see FUTURE_ENHANCEMENTS.md Security/Reliability section)
Documentation complete (README.md, ARCHITECTURE.md, ROADMAP.md, FUTURE_ENHANCEMENTS.md)
API stability guaranteed (Pimpl pattern; TrainingSample struct stable from v1.x)

Known Issues & Limitations

NLP modality extractor is provided externally (analytics::NlpTextAnalyzer); not bundled.
Multi-GPU training requires THEMIS_ENABLE_LLM && THEMIS_ENABLE_GPU at build time; single-GPU fallback is automatic.
IncrementalTrainingConfig.quantization governs the training-module view of quantization; INT8/NF4 use QLoRALayer (from llm/lora_framework/quantized_model.h) so only LoRA adapters are updated in full precision while the base weights remain compressed. The LLM inference layer uses a separate QuantizationType defined in llm/lora_framework/quantization.h.
LoRA adapter serving (inference) must be handled by the LLM integration layer.
Real LoRA weight updates use the embedded Tensor framework; base-model tokenization (llama.cpp) is not yet wired — training batches are encoded as float feature vectors from sample hashes.
LoRAAdapter (training module) operates independently of the LLM-layer LoRALayer; integration with IncrementalLoRATrainer checkpoints is the caller's responsibility via exportWeights() / importWeights().

Breaking Changes

TrainingSample struct is stable from v1.x; new optional fields only.
IncrementalTrainingConfig may gain new hyperparameter fields in v1.5.0; backward-compatible.

Latente Symbole (Unused-Functions-Audit)

Stand: 2026-04-20 – Quelle: src/UNUSED_FUNCTIONS_REPORT.md

🧪 NUR_TESTS (implementiert, kein Produktions-Aufrufer)

AdaLoRAAdapter – AdaLoRA-Adapter für Parameter-effizientes Fine-Tuning; Tests vorhanden

Aktion: ROADMAP-Ticket für Produktions-Integration ergänzen oder als CANDIDATE_FOR_REMOVAL markieren.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training Module Roadmap

Current Status

Completed ✅

In Progress 🚧

Planned Features 📋

Short-term (Next 3-6 months)

AdaLoRA ↔ Tensor-Train Bridge (Q2–Q4 2027)

Long-term (6-12 months)

Implementation Phases

Phase 1: Auto-Labeling & LoRA Training Pipeline (Status: Completed ✅)

Phase 2: Adapter Management & Multi-Domain (Status: Completed ✅)

Phase 3: Multi-Modality & Provenance (Status: Completed ✅)

Phase 4: DATABASE_OPTIMIZER Domain AutoLabeler — IMPL-A1 (Status: Completed ✅)

Phase 5: Federation Bridges — IMPL-A3 (Status: Completed ✅)

Phase 6: Federated Distillation & Privacy-Preserving Learning (Status: Completed ✅)

Federated Distillation KPIs (initial)

Federated Distillation Deliverables

Federated Distillation Acceptance Criteria

Production Readiness Checklist

Known Issues & Limitations

Breaking Changes

Latente Symbole (Unused-Functions-Audit)

🧪 NUR_TESTS (implementiert, kein Produktions-Aufrufer)

FilesExpand file tree

ROADMAP.md

Latest commit

History

ROADMAP.md

File metadata and controls

Training Module Roadmap

Current Status

Completed ✅

In Progress 🚧

Planned Features 📋

Short-term (Next 3-6 months)

AdaLoRA ↔ Tensor-Train Bridge (Q2–Q4 2027)

Long-term (6-12 months)

Implementation Phases

Phase 1: Auto-Labeling & LoRA Training Pipeline (Status: Completed ✅)

Phase 2: Adapter Management & Multi-Domain (Status: Completed ✅)

Phase 3: Multi-Modality & Provenance (Status: Completed ✅)

Phase 4: DATABASE_OPTIMIZER Domain AutoLabeler — IMPL-A1 (Status: Completed ✅)

Phase 5: Federation Bridges — IMPL-A3 (Status: Completed ✅)

Phase 6: Federated Distillation & Privacy-Preserving Learning (Status: Completed ✅)

Federated Distillation KPIs (initial)

Federated Distillation Deliverables

Federated Distillation Acceptance Criteria

Production Readiness Checklist

Known Issues & Limitations

Breaking Changes

Latente Symbole (Unused-Functions-Audit)

🧪 NUR_TESTS (implementiert, kein Produktions-Aufrufer)