πŸ“Š Comparison

Design Recovery Control vs RL-Based Control vs LLM-Based Control


🎯 Purpose

This document provides a strict, explicit, and non-negotiable comparison between:

Its purpose is to prevent conceptual mixing,
especially in safety-critical, audited, or certified engineering contexts.


πŸ”‘ Fundamental Conceptual Difference

The decisive difference is what is being controlled.

Framework What Is Directly Controlled
πŸ›  DRC Control design assumptions
πŸ” RL-based control Control inputs or learned policies
🧠 LLM-based control Control decisions or actions

This distinction is architectural, not stylistic.


🧩 Architectural Comparison

Aspect πŸ›  DRC πŸ” RL-Based Control 🧠 LLM-Based Control
Real-time control PID / FSM only Learned policy LLM inference
Learning element None Central Central
LLM role Design supervisor only None Primary controller
Execution timing Asynchronous, discrete Continuous / online Continuous or event-driven
Safety authority PID + FSM (explicit) External or learned Often implicit
Determinism Deterministic Often stochastic Non-deterministic
Inspectability Full Partial Low
Certification suitability High Low–Medium Very low

πŸ”’ Control Authority Boundary

πŸ›  Design Recovery Control (DRC)

πŸ‘‰ Control authority remains fully classical and deterministic.


πŸ” Reinforcement Learning–Based Control

⚠ Common risks:


🧠 LLM-Based Control

⚠ Common risks:


πŸ”„ Learning vs Recovery

Concept πŸ›  DRC πŸ” RL 🧠 LLM Control
Online learning ❌ No βœ… Yes ⚠ Sometimes
Self-modifying behavior ❌ No βœ… Yes ❌ Often
Design intent preservation βœ… Yes ❌ No ❌ No
Assumption recovery βœ… Yes ❌ No ❌ No

DRC restores design validity,
not behavior.


⚠ Failure Handling Philosophy

πŸ›  DRC


πŸ” RL / 🧠 LLM Control

🚫 These philosophies are fundamentally irreconcilable.


πŸ›‘ Safety and Certification Perspective

Criterion πŸ›  DRC πŸ” RL 🧠 LLM Control
Real-time determinism βœ… ❌ ❌
Explicit safety guards βœ… FSM ⚠ Optional ❌ Rare
Auditability βœ… ⚠ Partial ❌
Formal verification βœ… ❌ ❌
Human approval gating βœ… ❌ ❌

🧭 When Each Approach Is Appropriate

Use πŸ›  DRC when:


Use πŸ” RL when:


Use 🧠 LLM-Based Control when:


🚫 Explicit Non-Equivalence Statement

Design Recovery Control is NOT a form of reinforcement learning.
Design Recovery Control is NOT an LLM-based controller.

Any system that allows an RL agent or LLM
to directly influence control inputs
must not be described as DRC.


πŸ”’ Design Intent Freeze

This document fixes the conceptual boundaries
between DRC, RL-based control, and LLM-based control.

Future documents may expand examples,
but must not blur, merge, or reinterpret these categories.


End of document.