Executive Summary
The choice of modeling approach significantly impacts the success of advanced process control implementations. This whitepaper compares three fundamental approaches—white-box (first-principles), black-box (data-driven), and gray-box (hybrid)—examining their strengths, limitations, and applicability to chemical process control.
Our analysis and practical experience demonstrate that gray-box models, which combine physical knowledge with machine learning, deliver superior performance for most industrial applications by balancing interpretability, accuracy, and data efficiency.
Introduction
Model-based control requires accurate predictions of process behavior. The quality of these predictions directly determines control performance. Engineers face a fundamental choice: should they rely on physical understanding (first-principles), learn entirely from data (machine learning), or combine both approaches?
This decision has significant implications for:
- Development time and cost
- Data requirements
- Prediction accuracy
- Extrapolation capability
- Interpretability and regulatory acceptance
- Maintenance and adaptation
White-Box Models (First-Principles)
White-box models are built entirely from physical and chemical principles: mass balances, energy balances, reaction kinetics, thermodynamics, and transport phenomena.
Structure
A typical reactor model includes:
Mass Balance: dC/dt = F/V(C_in - C) - r(C, T)
Energy Balance: dT/dt = F/V(T_in - T) + (-ΔH)r/ρCp - UA(T - T_j)/VρCp
Kinetics: r = k₀ exp(-E_a/RT) C^n
Advantages
- Interpretability: Every parameter has physical meaning
- Extrapolation: Valid across operating ranges if physics understood
- Low data needs: Can work with minimal process data
- Regulatory acceptance: Well-understood by inspectors
Limitations
- Development time: Requires extensive engineering effort
- Unknown parameters: Kinetic constants often unavailable or uncertain
- Simplifications: Real processes have unmodeled phenomena
- Maintenance: Models drift as equipment ages
When to Use
White-box models work well when the underlying physics is well understood, when data is scarce, and when regulatory requirements demand full traceability. They are common in early process development and safety analysis.
Black-Box Models (Data-Driven)
Black-box models learn input-output relationships directly from data without incorporating physical knowledge. Modern approaches include deep neural networks, recurrent networks (LSTM, GRU), and transformer architectures.
Structure
A neural network model:
y = f_NN(x₁, x₂, ..., x_n; θ)
where θ represents learned weights and biases
Advantages
- No physics required: Works without process understanding
- Handles complexity: Can capture nonlinear interactions
- Automatic feature learning: Discovers patterns in data
- Rapid development: Quick to train with sufficient data
Limitations
- Data hungry: Requires large, diverse datasets
- Poor extrapolation: Unreliable outside training data range
- Interpretability: Difficult to understand or explain predictions
- Safety concerns: May produce physically impossible outputs
- Regulatory skepticism: "Trust but verify" is difficult
When to Use
Black-box models suit applications with abundant data, limited physical understanding, and lower safety criticality. They excel at pattern recognition tasks like soft sensors and quality prediction.
Gray-Box Models (Hybrid)
Gray-box models combine physical knowledge with data-driven components. The physical structure constrains the model to behave reasonably while machine learning captures unknown or complex phenomena.
Architectures
Several hybrid architectures are possible:
1. Serial Hybrid
First-principles model with neural network correction:
y = f_physics(x) + f_NN(x)
The neural network learns the residual between physics and reality
2. Parallel Hybrid
Neural network for specific unknown terms:
dC/dt = F/V(C_in - C) - r_NN(C, T)
Physics structure maintained; kinetics learned from data
3. Physics-Informed Neural Networks (PINNs)
Neural networks trained with physics-based loss functions:
Loss = L_data + λ · L_physics
Model penalized for violating physical constraints
Advantages
- Best of both worlds: Physical interpretability with learning capability
- Data efficient: Physics provides structure; less data needed
- Safer extrapolation: Physical constraints prevent unreasonable predictions
- Regulatory acceptable: Explainable structure with documented learning
- Adaptable: ML components can update without re-engineering physics
Limitations
- More complex: Requires both domain and ML expertise
- Architecture choices: Many ways to combine—selection requires experience
- Validation: Both physical and learned components need verification
Quantitative Comparison
We compared the three approaches on a batch reactor temperature control problem using identical training data (100 batches) and validation data (50 batches).
| Metric | White-Box | Black-Box | Gray-Box |
|---|---|---|---|
| RMSE (in-distribution) | 2.1°C | 0.8°C | 0.9°C |
| RMSE (extrapolation) | 3.5°C | 12.4°C | 2.8°C |
| Physical constraint violations | 0% | 8% | 0% |
| Data for 95% accuracy | 10 batches | 500 batches | 50 batches |
| Development time | High | Low | Medium |
| Interpretability | High | Low | Medium-High |
Key observations:
- Black-box achieves best in-distribution accuracy but fails on extrapolation
- White-box extrapolates reasonably but struggles with model mismatch
- Gray-box combines good accuracy with robust extrapolation
- Gray-box requires ~10x less data than black-box for similar performance
The Acaysia Approach
Acaysia uses gray-box models as the foundation for process control. Our architecture:
Physical Foundation
- Mass and energy balances provide model structure
- Thermodynamic relationships constrain predictions
- Known kinetics incorporated where available
Learned Components
- Neural networks capture reaction kinetics
- Heat transfer correlations learned from data
- Fouling and degradation effects modeled adaptively
Online Adaptation
- Continuous comparison of predictions to measurements
- Automatic retraining when drift detected
- Operator notifications when models update
Safety Guarantees
- Physical constraints enforced in model output
- Uncertainty quantification identifies low-confidence regions
- Automatic fallback when predictions unreliable
Practical Recommendations
For New Implementations
- Start with available physical knowledge—even approximate models help
- Identify which phenomena are well-understood vs. uncertain
- Use ML for uncertain components while maintaining physical structure
- Validate extensively, especially at operating envelope boundaries
For Existing Systems
- Assess current model performance and identify gaps
- Collect data specifically targeting model weaknesses
- Add ML components incrementally, validating each addition
- Maintain ability to revert to previous model version
For Regulatory Environments
- Document physical basis and ML components separately
- Maintain clear audit trails of model updates
- Demonstrate model performance across validated ranges
- Establish procedures for model change control
Conclusion
The choice between white-box, black-box, and gray-box models involves tradeoffs among accuracy, interpretability, data requirements, and extrapolation capability. For industrial process control applications, gray-box models offer the most attractive balance:
- Physical structure ensures reasonable behavior
- ML components capture complex, unknown phenomena
- Data efficiency enables deployment in real facilities
- Interpretability supports regulatory compliance
- Adaptability maintains performance over time
As ML techniques continue to advance, we expect gray-box approaches to become the standard for safety-critical industrial applications where purely data-driven methods carry too much risk.
References
- Psichogios, D.C. & Ungar, L.H. (1992). "A hybrid neural network-first principles approach to process modeling." AIChE Journal.
- Raissi, M., Perdikaris, P., & Karniadakis, G.E. (2019). "Physics-informed neural networks." Journal of Computational Physics.
- Thompson, M.L. & Kramer, M.A. (1994). "Modeling chemical processes using prior knowledge and neural networks." AIChE Journal.
- von Stosch, M., Oliveira, R., Peres, J., & Feyo de Azevedo, S. (2014). "Hybrid semi-parametric modeling in process systems engineering." Computers & Chemical Engineering.