feat: Add MLP surrogate with Turbo Mode for 100x faster optimization
Neural Acceleration (MLP Surrogate): - Add run_nn_optimization.py with hybrid FEA/NN workflow - MLP architecture: 4-layer (64->128->128->64) with BatchNorm/Dropout - Three workflow modes: - --all: Sequential export->train->optimize->validate - --hybrid-loop: Iterative Train->NN->Validate->Retrain cycle - --turbo: Aggressive single-best validation (RECOMMENDED) - Turbo mode: 5000 NN trials + 50 FEA validations in ~12 minutes - Separate nn_study.db to avoid overloading dashboard Performance Results (bracket_pareto_3obj study): - NN prediction errors: mass 1-5%, stress 1-4%, stiffness 5-15% - Found minimum mass designs at boundary (angle~30deg, thick~30mm) - 100x speedup vs pure FEA exploration Protocol Operating System: - Add .claude/skills/ with Bootstrap, Cheatsheet, Context Loader - Add docs/protocols/ with operations (OP_01-06) and system (SYS_10-14) - Update SYS_14_NEURAL_ACCELERATION.md with MLP Turbo Mode docs NX Automation: - Add optimization_engine/hooks/ for NX CAD/CAE automation - Add study_wizard.py for guided study creation - Fix FEM mesh update: load idealized part before UpdateFemodel() New Study: - bracket_pareto_3obj: 3-objective Pareto (mass, stress, stiffness) - 167 FEA trials + 5000 NN trials completed - Demonstrates full hybrid workflow 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
This commit is contained in:
564
docs/protocols/system/SYS_14_NEURAL_ACCELERATION.md
Normal file
564
docs/protocols/system/SYS_14_NEURAL_ACCELERATION.md
Normal file
@@ -0,0 +1,564 @@
|
||||
# SYS_14: Neural Network Acceleration
|
||||
|
||||
<!--
|
||||
PROTOCOL: Neural Network Surrogate Acceleration
|
||||
LAYER: System
|
||||
VERSION: 2.0
|
||||
STATUS: Active
|
||||
LAST_UPDATED: 2025-12-06
|
||||
PRIVILEGE: user
|
||||
LOAD_WITH: [SYS_10_IMSO, SYS_11_MULTI_OBJECTIVE]
|
||||
-->
|
||||
|
||||
## Overview
|
||||
|
||||
Atomizer provides **neural network surrogate acceleration** enabling 100-1000x faster optimization by replacing expensive FEA evaluations with instant neural predictions.
|
||||
|
||||
**Two approaches available**:
|
||||
1. **MLP Surrogate** (Simple, integrated) - 4-layer MLP trained on FEA data, runs within study
|
||||
2. **GNN Field Predictor** (Advanced) - Graph neural network for full field predictions
|
||||
|
||||
**Key Innovation**: Train once on FEA data, then explore 5,000-50,000+ designs in the time it takes to run 50 FEA trials.
|
||||
|
||||
---
|
||||
|
||||
## When to Use
|
||||
|
||||
| Trigger | Action |
|
||||
|---------|--------|
|
||||
| >50 trials needed | Consider neural acceleration |
|
||||
| "neural", "surrogate", "NN" mentioned | Load this protocol |
|
||||
| "fast", "acceleration", "speed" needed | Suggest neural acceleration |
|
||||
| Training data available | Enable surrogate |
|
||||
|
||||
---
|
||||
|
||||
## Quick Reference
|
||||
|
||||
**Performance Comparison**:
|
||||
|
||||
| Metric | Traditional FEA | Neural Network | Improvement |
|
||||
|--------|-----------------|----------------|-------------|
|
||||
| Time per evaluation | 10-30 minutes | 4.5 milliseconds | **2,000-500,000x** |
|
||||
| Trials per hour | 2-6 | 800,000+ | **1000x** |
|
||||
| Design exploration | ~50 designs | ~50,000 designs | **1000x** |
|
||||
|
||||
**Model Types**:
|
||||
|
||||
| Model | Purpose | Use When |
|
||||
|-------|---------|----------|
|
||||
| **MLP Surrogate** | Direct objective prediction | Simple studies, quick setup |
|
||||
| Field Predictor GNN | Full displacement/stress fields | Need field visualization |
|
||||
| Parametric Predictor GNN | Direct objective prediction | Complex geometry, need accuracy |
|
||||
| Ensemble | Uncertainty quantification | Need confidence bounds |
|
||||
|
||||
---
|
||||
|
||||
## MLP Surrogate (Recommended for Quick Start)
|
||||
|
||||
### Overview
|
||||
|
||||
The MLP (Multi-Layer Perceptron) surrogate is a simple but effective neural network that predicts objectives directly from design parameters. It's integrated into the study workflow via `run_nn_optimization.py`.
|
||||
|
||||
### Architecture
|
||||
|
||||
```
|
||||
Input Layer (N design variables)
|
||||
↓
|
||||
Linear(N, 64) + ReLU + BatchNorm + Dropout(0.1)
|
||||
↓
|
||||
Linear(64, 128) + ReLU + BatchNorm + Dropout(0.1)
|
||||
↓
|
||||
Linear(128, 128) + ReLU + BatchNorm + Dropout(0.1)
|
||||
↓
|
||||
Linear(128, 64) + ReLU + BatchNorm + Dropout(0.1)
|
||||
↓
|
||||
Linear(64, M objectives)
|
||||
```
|
||||
|
||||
**Parameters**: ~34,000 trainable
|
||||
|
||||
### Workflow Modes
|
||||
|
||||
#### 1. Standard Hybrid Mode (`--all`)
|
||||
|
||||
Run all phases sequentially:
|
||||
```bash
|
||||
python run_nn_optimization.py --all
|
||||
```
|
||||
|
||||
Phases:
|
||||
1. **Export**: Extract training data from existing FEA trials
|
||||
2. **Train**: Train MLP surrogate (300 epochs default)
|
||||
3. **NN-Optimize**: Run 1000 NN trials with NSGA-II
|
||||
4. **Validate**: Validate top 10 candidates with FEA
|
||||
|
||||
#### 2. Hybrid Loop Mode (`--hybrid-loop`)
|
||||
|
||||
Iterative refinement:
|
||||
```bash
|
||||
python run_nn_optimization.py --hybrid-loop --iterations 5 --nn-trials 500
|
||||
```
|
||||
|
||||
Each iteration:
|
||||
1. Train/retrain surrogate from current FEA data
|
||||
2. Run NN optimization
|
||||
3. Validate top candidates with FEA
|
||||
4. Add validated results to training set
|
||||
5. Repeat until convergence (max error < 5%)
|
||||
|
||||
#### 3. Turbo Mode (`--turbo`) ⚡ RECOMMENDED
|
||||
|
||||
Aggressive single-best validation:
|
||||
```bash
|
||||
python run_nn_optimization.py --turbo --nn-trials 5000 --batch-size 100 --retrain-every 10
|
||||
```
|
||||
|
||||
Strategy:
|
||||
- Run NN in small batches (100 trials)
|
||||
- Validate ONLY the single best candidate with FEA
|
||||
- Add to training data immediately
|
||||
- Retrain surrogate every N FEA validations
|
||||
- Repeat until total NN budget exhausted
|
||||
|
||||
**Example**: 5,000 NN trials with batch=100 → 50 FEA validations in ~12 minutes
|
||||
|
||||
### Configuration
|
||||
|
||||
```json
|
||||
{
|
||||
"neural_acceleration": {
|
||||
"enabled": true,
|
||||
"min_training_points": 50,
|
||||
"auto_train": true,
|
||||
"epochs": 300,
|
||||
"validation_split": 0.2,
|
||||
"nn_trials": 1000,
|
||||
"validate_top_n": 10,
|
||||
"model_file": "surrogate_best.pt",
|
||||
"separate_nn_database": true
|
||||
}
|
||||
}
|
||||
```
|
||||
|
||||
**Important**: `separate_nn_database: true` stores NN trials in `nn_study.db` instead of `study.db` to avoid overloading the dashboard with thousands of NN-only results.
|
||||
|
||||
### Typical Accuracy
|
||||
|
||||
| Objective | Expected Error |
|
||||
|-----------|----------------|
|
||||
| Mass | 1-5% |
|
||||
| Stress | 1-4% |
|
||||
| Stiffness | 5-15% |
|
||||
|
||||
### Output Files
|
||||
|
||||
```
|
||||
2_results/
|
||||
├── study.db # Main FEA + validated results (dashboard)
|
||||
├── nn_study.db # NN-only results (not in dashboard)
|
||||
├── surrogate_best.pt # Trained model weights
|
||||
├── training_data.json # Normalized training data
|
||||
├── nn_optimization_state.json # NN optimization state
|
||||
├── nn_pareto_front.json # NN-predicted Pareto front
|
||||
├── validation_report.json # FEA validation results
|
||||
└── turbo_report.json # Turbo mode results (if used)
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## GNN Field Predictor (Advanced)
|
||||
|
||||
### Core Components
|
||||
|
||||
| Component | File | Purpose |
|
||||
|-----------|------|---------|
|
||||
| BDF/OP2 Parser | `neural_field_parser.py` | Convert NX files to neural format |
|
||||
| Data Validator | `validate_parsed_data.py` | Physics and quality checks |
|
||||
| Field Predictor | `field_predictor.py` | GNN for full field prediction |
|
||||
| Parametric Predictor | `parametric_predictor.py` | GNN for direct objectives |
|
||||
| Physics Loss | `physics_losses.py` | Physics-informed training |
|
||||
| Neural Surrogate | `neural_surrogate.py` | Integration with Atomizer |
|
||||
| Neural Runner | `runner_with_neural.py` | Optimization with NN acceleration |
|
||||
|
||||
### Workflow Diagram
|
||||
|
||||
```
|
||||
Traditional:
|
||||
Design → NX Model → Mesh → Solve (30 min) → Results → Objective
|
||||
|
||||
Neural (after training):
|
||||
Design → Neural Network (4.5 ms) → Results → Objective
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## Neural Model Types
|
||||
|
||||
### 1. Field Predictor GNN
|
||||
|
||||
**Use Case**: When you need full field predictions (stress distribution, deformation shape).
|
||||
|
||||
```
|
||||
Input Features (12D per node):
|
||||
├── Node coordinates (x, y, z)
|
||||
├── Material properties (E, nu, rho)
|
||||
├── Boundary conditions (fixed/free per DOF)
|
||||
└── Load information (force magnitude, direction)
|
||||
|
||||
GNN Layers (6 message passing):
|
||||
├── MeshGraphConv (custom for FEA topology)
|
||||
├── Layer normalization
|
||||
├── ReLU activation
|
||||
└── Dropout (0.1)
|
||||
|
||||
Output (per node):
|
||||
├── Displacement (6 DOF: Tx, Ty, Tz, Rx, Ry, Rz)
|
||||
└── Von Mises stress (1 value)
|
||||
```
|
||||
|
||||
**Parameters**: ~718,221 trainable
|
||||
|
||||
### 2. Parametric Predictor GNN (Recommended)
|
||||
|
||||
**Use Case**: Direct optimization objective prediction (fastest option).
|
||||
|
||||
```
|
||||
Design Parameters (ND) → Design Encoder (MLP) → GNN Backbone → Scalar Heads
|
||||
|
||||
Output (objectives):
|
||||
├── mass (grams)
|
||||
├── frequency (Hz)
|
||||
├── max_displacement (mm)
|
||||
└── max_stress (MPa)
|
||||
```
|
||||
|
||||
**Parameters**: ~500,000 trainable
|
||||
|
||||
### 3. Ensemble Models
|
||||
|
||||
**Use Case**: Uncertainty quantification.
|
||||
|
||||
1. Train 3-5 models with different random seeds
|
||||
2. At inference, run all models
|
||||
3. Use mean for prediction, std for uncertainty
|
||||
4. High uncertainty → trigger FEA validation
|
||||
|
||||
---
|
||||
|
||||
## Training Pipeline
|
||||
|
||||
### Step 1: Collect Training Data
|
||||
|
||||
Enable export in workflow config:
|
||||
|
||||
```json
|
||||
{
|
||||
"training_data_export": {
|
||||
"enabled": true,
|
||||
"export_dir": "atomizer_field_training_data/my_study"
|
||||
}
|
||||
}
|
||||
```
|
||||
|
||||
Output structure:
|
||||
```
|
||||
atomizer_field_training_data/my_study/
|
||||
├── trial_0001/
|
||||
│ ├── input/model.bdf # Nastran input
|
||||
│ ├── output/model.op2 # Binary results
|
||||
│ └── metadata.json # Design params + objectives
|
||||
├── trial_0002/
|
||||
│ └── ...
|
||||
└── study_summary.json
|
||||
```
|
||||
|
||||
**Recommended**: 100-500 FEA samples for good generalization.
|
||||
|
||||
### Step 2: Parse to Neural Format
|
||||
|
||||
```bash
|
||||
cd atomizer-field
|
||||
python batch_parser.py ../atomizer_field_training_data/my_study
|
||||
```
|
||||
|
||||
Creates HDF5 + JSON files per trial.
|
||||
|
||||
### Step 3: Train Model
|
||||
|
||||
**Parametric Predictor** (recommended):
|
||||
```bash
|
||||
python train_parametric.py \
|
||||
--train_dir ../training_data/parsed \
|
||||
--val_dir ../validation_data/parsed \
|
||||
--epochs 200 \
|
||||
--hidden_channels 128 \
|
||||
--num_layers 4
|
||||
```
|
||||
|
||||
**Field Predictor**:
|
||||
```bash
|
||||
python train.py \
|
||||
--train_dir ../training_data/parsed \
|
||||
--epochs 200 \
|
||||
--model FieldPredictorGNN \
|
||||
--hidden_channels 128 \
|
||||
--num_layers 6 \
|
||||
--physics_loss_weight 0.3
|
||||
```
|
||||
|
||||
### Step 4: Validate
|
||||
|
||||
```bash
|
||||
python validate.py --checkpoint runs/my_model/checkpoint_best.pt
|
||||
```
|
||||
|
||||
Expected output:
|
||||
```
|
||||
Validation Results:
|
||||
├── Mean Absolute Error: 2.3% (mass), 1.8% (frequency)
|
||||
├── R² Score: 0.987
|
||||
├── Inference Time: 4.5ms ± 0.8ms
|
||||
└── Physics Violations: 0.2%
|
||||
```
|
||||
|
||||
### Step 5: Deploy
|
||||
|
||||
```json
|
||||
{
|
||||
"neural_surrogate": {
|
||||
"enabled": true,
|
||||
"model_checkpoint": "atomizer-field/runs/my_model/checkpoint_best.pt",
|
||||
"confidence_threshold": 0.85
|
||||
}
|
||||
}
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## Configuration
|
||||
|
||||
### Full Neural Configuration Example
|
||||
|
||||
```json
|
||||
{
|
||||
"study_name": "bracket_neural_optimization",
|
||||
|
||||
"surrogate_settings": {
|
||||
"enabled": true,
|
||||
"model_type": "parametric_gnn",
|
||||
"model_path": "models/bracket_surrogate.pt",
|
||||
"confidence_threshold": 0.85,
|
||||
"validation_frequency": 10,
|
||||
"fallback_to_fea": true
|
||||
},
|
||||
|
||||
"training_data_export": {
|
||||
"enabled": true,
|
||||
"export_dir": "atomizer_field_training_data/bracket_study",
|
||||
"export_bdf": true,
|
||||
"export_op2": true,
|
||||
"export_fields": ["displacement", "stress"]
|
||||
},
|
||||
|
||||
"neural_optimization": {
|
||||
"initial_fea_trials": 50,
|
||||
"neural_trials": 5000,
|
||||
"retraining_interval": 500,
|
||||
"uncertainty_threshold": 0.15
|
||||
}
|
||||
}
|
||||
```
|
||||
|
||||
### Configuration Parameters
|
||||
|
||||
| Parameter | Type | Default | Description |
|
||||
|-----------|------|---------|-------------|
|
||||
| `enabled` | bool | false | Enable neural surrogate |
|
||||
| `model_type` | string | "parametric_gnn" | Model architecture |
|
||||
| `model_path` | string | - | Path to trained model |
|
||||
| `confidence_threshold` | float | 0.85 | Min confidence for predictions |
|
||||
| `validation_frequency` | int | 10 | FEA validation every N trials |
|
||||
| `fallback_to_fea` | bool | true | Use FEA when uncertain |
|
||||
|
||||
---
|
||||
|
||||
## Hybrid FEA/Neural Workflow
|
||||
|
||||
### Phase 1: FEA Exploration (50-100 trials)
|
||||
- Run standard FEA optimization
|
||||
- Export training data automatically
|
||||
- Build landscape understanding
|
||||
|
||||
### Phase 2: Neural Training
|
||||
- Parse collected data
|
||||
- Train parametric predictor
|
||||
- Validate accuracy
|
||||
|
||||
### Phase 3: Neural Acceleration (1000s of trials)
|
||||
- Use neural network for rapid exploration
|
||||
- Periodic FEA validation
|
||||
- Retrain if distribution shifts
|
||||
|
||||
### Phase 4: FEA Refinement (10-20 trials)
|
||||
- Validate top candidates with FEA
|
||||
- Ensure results are physically accurate
|
||||
- Generate final Pareto front
|
||||
|
||||
---
|
||||
|
||||
## Adaptive Iteration Loop
|
||||
|
||||
For complex optimizations, use iterative refinement:
|
||||
|
||||
```
|
||||
┌─────────────────────────────────────────────────────────────────┐
|
||||
│ Iteration 1: │
|
||||
│ ┌──────────────┐ ┌──────────────┐ ┌──────────────┐ │
|
||||
│ │ Initial FEA │ -> │ Train NN │ -> │ NN Search │ │
|
||||
│ │ (50-100) │ │ Surrogate │ │ (1000 trials)│ │
|
||||
│ └──────────────┘ └──────────────┘ └──────────────┘ │
|
||||
│ │ │
|
||||
│ Iteration 2+: ▼ │
|
||||
│ ┌──────────────┐ ┌──────────────┐ ┌──────────────┐ │
|
||||
│ │ Validate Top │ -> │ Retrain NN │ -> │ NN Search │ │
|
||||
│ │ NN with FEA │ │ with new data│ │ (1000 trials)│ │
|
||||
│ └──────────────┘ └──────────────┘ └──────────────┘ │
|
||||
└─────────────────────────────────────────────────────────────────┘
|
||||
```
|
||||
|
||||
### Adaptive Configuration
|
||||
|
||||
```json
|
||||
{
|
||||
"adaptive_settings": {
|
||||
"enabled": true,
|
||||
"initial_fea_trials": 50,
|
||||
"nn_trials_per_iteration": 1000,
|
||||
"fea_validation_per_iteration": 5,
|
||||
"max_iterations": 10,
|
||||
"convergence_threshold": 0.01,
|
||||
"retrain_epochs": 100
|
||||
}
|
||||
}
|
||||
```
|
||||
|
||||
### Convergence Criteria
|
||||
|
||||
Stop when:
|
||||
- No improvement for 2-3 consecutive iterations
|
||||
- Reached FEA budget limit
|
||||
- Objective improvement < 1% threshold
|
||||
|
||||
### Output Files
|
||||
|
||||
```
|
||||
studies/my_study/3_results/
|
||||
├── adaptive_state.json # Current iteration state
|
||||
├── surrogate_model.pt # Trained neural network
|
||||
└── training_history.json # NN training metrics
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## Loss Functions
|
||||
|
||||
### Data Loss (MSE)
|
||||
Standard prediction error:
|
||||
```python
|
||||
data_loss = MSE(predicted, target)
|
||||
```
|
||||
|
||||
### Physics Loss
|
||||
Enforce physical constraints:
|
||||
```python
|
||||
physics_loss = (
|
||||
equilibrium_loss + # Force balance
|
||||
boundary_loss + # BC satisfaction
|
||||
compatibility_loss # Strain compatibility
|
||||
)
|
||||
```
|
||||
|
||||
### Combined Training
|
||||
```python
|
||||
total_loss = data_loss + 0.3 * physics_loss
|
||||
```
|
||||
|
||||
Physics loss weight typically 0.1-0.5.
|
||||
|
||||
---
|
||||
|
||||
## Uncertainty Quantification
|
||||
|
||||
### Ensemble Method
|
||||
```python
|
||||
# Run N models
|
||||
predictions = [model_i(x) for model_i in ensemble]
|
||||
|
||||
# Statistics
|
||||
mean_prediction = np.mean(predictions)
|
||||
uncertainty = np.std(predictions)
|
||||
|
||||
# Decision
|
||||
if uncertainty > threshold:
|
||||
# Use FEA instead
|
||||
result = run_fea(x)
|
||||
else:
|
||||
result = mean_prediction
|
||||
```
|
||||
|
||||
### Confidence Thresholds
|
||||
|
||||
| Uncertainty | Action |
|
||||
|-------------|--------|
|
||||
| < 5% | Use neural prediction |
|
||||
| 5-15% | Use neural, flag for validation |
|
||||
| > 15% | Fall back to FEA |
|
||||
|
||||
---
|
||||
|
||||
## Troubleshooting
|
||||
|
||||
| Symptom | Cause | Solution |
|
||||
|---------|-------|----------|
|
||||
| High prediction error | Insufficient training data | Collect more FEA samples |
|
||||
| Out-of-distribution warnings | Design outside training range | Retrain with expanded range |
|
||||
| Slow inference | Large mesh | Use parametric predictor instead |
|
||||
| Physics violations | Low physics loss weight | Increase `physics_loss_weight` |
|
||||
|
||||
---
|
||||
|
||||
## Cross-References
|
||||
|
||||
- **Depends On**: [SYS_10_IMSO](./SYS_10_IMSO.md) for optimization framework
|
||||
- **Used By**: [OP_02_RUN_OPTIMIZATION](../operations/OP_02_RUN_OPTIMIZATION.md), [OP_05_EXPORT_TRAINING_DATA](../operations/OP_05_EXPORT_TRAINING_DATA.md)
|
||||
- **See Also**: [modules/neural-acceleration.md](../../.claude/skills/modules/neural-acceleration.md)
|
||||
|
||||
---
|
||||
|
||||
## Implementation Files
|
||||
|
||||
```
|
||||
atomizer-field/
|
||||
├── neural_field_parser.py # BDF/OP2 parsing
|
||||
├── field_predictor.py # Field GNN
|
||||
├── parametric_predictor.py # Parametric GNN
|
||||
├── train.py # Field training
|
||||
├── train_parametric.py # Parametric training
|
||||
├── validate.py # Model validation
|
||||
├── physics_losses.py # Physics-informed loss
|
||||
└── batch_parser.py # Batch data conversion
|
||||
|
||||
optimization_engine/
|
||||
├── neural_surrogate.py # Atomizer integration
|
||||
└── runner_with_neural.py # Neural runner
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## Version History
|
||||
|
||||
| Version | Date | Changes |
|
||||
|---------|------|---------|
|
||||
| 2.0 | 2025-12-06 | Added MLP Surrogate with Turbo Mode |
|
||||
| 1.0 | 2025-12-05 | Initial consolidation from neural docs |
|
||||
Reference in New Issue
Block a user