Files

Antoine 602560c46a feat: Add MLP surrogate with Turbo Mode for 100x faster optimization

Neural Acceleration (MLP Surrogate):
- Add run_nn_optimization.py with hybrid FEA/NN workflow
- MLP architecture: 4-layer (64->128->128->64) with BatchNorm/Dropout
- Three workflow modes:
  - --all: Sequential export->train->optimize->validate
  - --hybrid-loop: Iterative Train->NN->Validate->Retrain cycle
  - --turbo: Aggressive single-best validation (RECOMMENDED)
- Turbo mode: 5000 NN trials + 50 FEA validations in ~12 minutes
- Separate nn_study.db to avoid overloading dashboard

Performance Results (bracket_pareto_3obj study):
- NN prediction errors: mass 1-5%, stress 1-4%, stiffness 5-15%
- Found minimum mass designs at boundary (angle~30deg, thick~30mm)
- 100x speedup vs pure FEA exploration

Protocol Operating System:
- Add .claude/skills/ with Bootstrap, Cheatsheet, Context Loader
- Add docs/protocols/ with operations (OP_01-06) and system (SYS_10-14)
- Update SYS_14_NEURAL_ACCELERATION.md with MLP Turbo Mode docs

NX Automation:
- Add optimization_engine/hooks/ for NX CAD/CAE automation
- Add study_wizard.py for guided study creation
- Fix FEM mesh update: load idealized part before UpdateFemodel()

New Study:
- bracket_pareto_3obj: 3-objective Pareto (mass, stress, stiffness)
- 167 FEA trials + 5000 NN trials completed
- Demonstrates full hybrid workflow

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

2025-12-06 20:01:59 -05:00

15 KiB

Raw Blame History

SYS_14: Neural Network Acceleration

Overview

Atomizer provides neural network surrogate acceleration enabling 100-1000x faster optimization by replacing expensive FEA evaluations with instant neural predictions.

Two approaches available:

MLP Surrogate (Simple, integrated) - 4-layer MLP trained on FEA data, runs within study
GNN Field Predictor (Advanced) - Graph neural network for full field predictions

Key Innovation: Train once on FEA data, then explore 5,000-50,000+ designs in the time it takes to run 50 FEA trials.

When to Use

Trigger	Action
>50 trials needed	Consider neural acceleration
"neural", "surrogate", "NN" mentioned	Load this protocol
"fast", "acceleration", "speed" needed	Suggest neural acceleration
Training data available	Enable surrogate

Quick Reference

Performance Comparison:

Metric	Traditional FEA	Neural Network	Improvement
Time per evaluation	10-30 minutes	4.5 milliseconds	2,000-500,000x
Trials per hour	2-6	800,000+	1000x
Design exploration	~50 designs	~50,000 designs	1000x

Model Types:

Model	Purpose	Use When
MLP Surrogate	Direct objective prediction	Simple studies, quick setup
Field Predictor GNN	Full displacement/stress fields	Need field visualization
Parametric Predictor GNN	Direct objective prediction	Complex geometry, need accuracy
Ensemble	Uncertainty quantification	Need confidence bounds

MLP Surrogate (Recommended for Quick Start)

Overview

The MLP (Multi-Layer Perceptron) surrogate is a simple but effective neural network that predicts objectives directly from design parameters. It's integrated into the study workflow via run_nn_optimization.py.

Architecture

Input Layer (N design variables)
    ↓
Linear(N, 64) + ReLU + BatchNorm + Dropout(0.1)
    ↓
Linear(64, 128) + ReLU + BatchNorm + Dropout(0.1)
    ↓
Linear(128, 128) + ReLU + BatchNorm + Dropout(0.1)
    ↓
Linear(128, 64) + ReLU + BatchNorm + Dropout(0.1)
    ↓
Linear(64, M objectives)

Parameters: ~34,000 trainable

Workflow Modes

1. Standard Hybrid Mode (`--all`)

Run all phases sequentially:

python run_nn_optimization.py --all

Phases:

Export: Extract training data from existing FEA trials
Train: Train MLP surrogate (300 epochs default)
NN-Optimize: Run 1000 NN trials with NSGA-II
Validate: Validate top 10 candidates with FEA

2. Hybrid Loop Mode (`--hybrid-loop`)

Iterative refinement:

python run_nn_optimization.py --hybrid-loop --iterations 5 --nn-trials 500

Each iteration:

Train/retrain surrogate from current FEA data
Run NN optimization
Validate top candidates with FEA
Add validated results to training set
Repeat until convergence (max error < 5%)

3. Turbo Mode (`--turbo`) ⚡ RECOMMENDED

Aggressive single-best validation:

python run_nn_optimization.py --turbo --nn-trials 5000 --batch-size 100 --retrain-every 10

Strategy:

Run NN in small batches (100 trials)
Validate ONLY the single best candidate with FEA
Add to training data immediately
Retrain surrogate every N FEA validations
Repeat until total NN budget exhausted

Example: 5,000 NN trials with batch=100 → 50 FEA validations in ~12 minutes

Configuration

{
  "neural_acceleration": {
    "enabled": true,
    "min_training_points": 50,
    "auto_train": true,
    "epochs": 300,
    "validation_split": 0.2,
    "nn_trials": 1000,
    "validate_top_n": 10,
    "model_file": "surrogate_best.pt",
    "separate_nn_database": true
  }
}

Important: separate_nn_database: true stores NN trials in nn_study.db instead of study.db to avoid overloading the dashboard with thousands of NN-only results.

Typical Accuracy

Objective	Expected Error
Mass	1-5%
Stress	1-4%
Stiffness	5-15%

Output Files

2_results/
├── study.db                    # Main FEA + validated results (dashboard)
├── nn_study.db                 # NN-only results (not in dashboard)
├── surrogate_best.pt           # Trained model weights
├── training_data.json          # Normalized training data
├── nn_optimization_state.json  # NN optimization state
├── nn_pareto_front.json        # NN-predicted Pareto front
├── validation_report.json      # FEA validation results
└── turbo_report.json           # Turbo mode results (if used)

GNN Field Predictor (Advanced)

Core Components

Component	File	Purpose
BDF/OP2 Parser	`neural_field_parser.py`	Convert NX files to neural format
Data Validator	`validate_parsed_data.py`	Physics and quality checks
Field Predictor	`field_predictor.py`	GNN for full field prediction
Parametric Predictor	`parametric_predictor.py`	GNN for direct objectives
Physics Loss	`physics_losses.py`	Physics-informed training
Neural Surrogate	`neural_surrogate.py`	Integration with Atomizer
Neural Runner	`runner_with_neural.py`	Optimization with NN acceleration

Workflow Diagram

Traditional:
Design → NX Model → Mesh → Solve (30 min) → Results → Objective

Neural (after training):
Design → Neural Network (4.5 ms) → Results → Objective

Neural Model Types

1. Field Predictor GNN

Use Case: When you need full field predictions (stress distribution, deformation shape).

Input Features (12D per node):
├── Node coordinates (x, y, z)
├── Material properties (E, nu, rho)
├── Boundary conditions (fixed/free per DOF)
└── Load information (force magnitude, direction)

GNN Layers (6 message passing):
├── MeshGraphConv (custom for FEA topology)
├── Layer normalization
├── ReLU activation
└── Dropout (0.1)

Output (per node):
├── Displacement (6 DOF: Tx, Ty, Tz, Rx, Ry, Rz)
└── Von Mises stress (1 value)

Parameters: ~718,221 trainable

2. Parametric Predictor GNN (Recommended)

Use Case: Direct optimization objective prediction (fastest option).

Design Parameters (ND) → Design Encoder (MLP) → GNN Backbone → Scalar Heads

Output (objectives):
├── mass (grams)
├── frequency (Hz)
├── max_displacement (mm)
└── max_stress (MPa)

Parameters: ~500,000 trainable

3. Ensemble Models

Use Case: Uncertainty quantification.

Train 3-5 models with different random seeds
At inference, run all models
Use mean for prediction, std for uncertainty
High uncertainty → trigger FEA validation

Training Pipeline

Step 1: Collect Training Data

Enable export in workflow config:

{
  "training_data_export": {
    "enabled": true,
    "export_dir": "atomizer_field_training_data/my_study"
  }
}

Output structure:

atomizer_field_training_data/my_study/
├── trial_0001/
│   ├── input/model.bdf       # Nastran input
│   ├── output/model.op2      # Binary results
│   └── metadata.json         # Design params + objectives
├── trial_0002/
│   └── ...
└── study_summary.json

Recommended: 100-500 FEA samples for good generalization.

Step 2: Parse to Neural Format

cd atomizer-field
python batch_parser.py ../atomizer_field_training_data/my_study

Creates HDF5 + JSON files per trial.

Step 3: Train Model

Parametric Predictor (recommended):

python train_parametric.py \
  --train_dir ../training_data/parsed \
  --val_dir ../validation_data/parsed \
  --epochs 200 \
  --hidden_channels 128 \
  --num_layers 4

Field Predictor:

python train.py \
  --train_dir ../training_data/parsed \
  --epochs 200 \
  --model FieldPredictorGNN \
  --hidden_channels 128 \
  --num_layers 6 \
  --physics_loss_weight 0.3

Step 4: Validate

python validate.py --checkpoint runs/my_model/checkpoint_best.pt

Expected output:

Validation Results:
├── Mean Absolute Error: 2.3% (mass), 1.8% (frequency)
├── R² Score: 0.987
├── Inference Time: 4.5ms ± 0.8ms
└── Physics Violations: 0.2%

Step 5: Deploy

{
  "neural_surrogate": {
    "enabled": true,
    "model_checkpoint": "atomizer-field/runs/my_model/checkpoint_best.pt",
    "confidence_threshold": 0.85
  }
}

Configuration

Full Neural Configuration Example

{
  "study_name": "bracket_neural_optimization",

  "surrogate_settings": {
    "enabled": true,
    "model_type": "parametric_gnn",
    "model_path": "models/bracket_surrogate.pt",
    "confidence_threshold": 0.85,
    "validation_frequency": 10,
    "fallback_to_fea": true
  },

  "training_data_export": {
    "enabled": true,
    "export_dir": "atomizer_field_training_data/bracket_study",
    "export_bdf": true,
    "export_op2": true,
    "export_fields": ["displacement", "stress"]
  },

  "neural_optimization": {
    "initial_fea_trials": 50,
    "neural_trials": 5000,
    "retraining_interval": 500,
    "uncertainty_threshold": 0.15
  }
}

Configuration Parameters

Parameter	Type	Default	Description
`enabled`	bool	false	Enable neural surrogate
`model_type`	string	"parametric_gnn"	Model architecture
`model_path`	string	-	Path to trained model
`confidence_threshold`	float	0.85	Min confidence for predictions
`validation_frequency`	int	10	FEA validation every N trials
`fallback_to_fea`	bool	true	Use FEA when uncertain

Hybrid FEA/Neural Workflow

Phase 1: FEA Exploration (50-100 trials)

Run standard FEA optimization
Export training data automatically
Build landscape understanding

Phase 2: Neural Training

Parse collected data
Train parametric predictor
Validate accuracy

Phase 3: Neural Acceleration (1000s of trials)

Use neural network for rapid exploration
Periodic FEA validation
Retrain if distribution shifts

Phase 4: FEA Refinement (10-20 trials)

Validate top candidates with FEA
Ensure results are physically accurate
Generate final Pareto front

Adaptive Iteration Loop

For complex optimizations, use iterative refinement:

┌─────────────────────────────────────────────────────────────────┐
│  Iteration 1:                                                    │
│  ┌──────────────┐    ┌──────────────┐    ┌──────────────┐       │
│  │ Initial FEA  │ -> │ Train NN     │ -> │ NN Search    │       │
│  │ (50-100)     │    │ Surrogate    │    │ (1000 trials)│       │
│  └──────────────┘    └──────────────┘    └──────────────┘       │
│                                                 │                │
│  Iteration 2+:                                  ▼                │
│  ┌──────────────┐    ┌──────────────┐    ┌──────────────┐       │
│  │ Validate Top │ -> │ Retrain NN   │ -> │ NN Search    │       │
│  │ NN with FEA  │    │ with new data│    │ (1000 trials)│       │
│  └──────────────┘    └──────────────┘    └──────────────┘       │
└─────────────────────────────────────────────────────────────────┘

Adaptive Configuration

{
  "adaptive_settings": {
    "enabled": true,
    "initial_fea_trials": 50,
    "nn_trials_per_iteration": 1000,
    "fea_validation_per_iteration": 5,
    "max_iterations": 10,
    "convergence_threshold": 0.01,
    "retrain_epochs": 100
  }
}

Convergence Criteria

Stop when:

No improvement for 2-3 consecutive iterations
Reached FEA budget limit
Objective improvement < 1% threshold

Output Files

studies/my_study/3_results/
├── adaptive_state.json      # Current iteration state
├── surrogate_model.pt       # Trained neural network
└── training_history.json    # NN training metrics

Loss Functions

Data Loss (MSE)

Standard prediction error:

data_loss = MSE(predicted, target)

Physics Loss

Enforce physical constraints:

physics_loss = (
    equilibrium_loss +      # Force balance
    boundary_loss +         # BC satisfaction
    compatibility_loss      # Strain compatibility
)

Combined Training

total_loss = data_loss + 0.3 * physics_loss

Physics loss weight typically 0.1-0.5.

Uncertainty Quantification

Ensemble Method

# Run N models
predictions = [model_i(x) for model_i in ensemble]

# Statistics
mean_prediction = np.mean(predictions)
uncertainty = np.std(predictions)

# Decision
if uncertainty > threshold:
    # Use FEA instead
    result = run_fea(x)
else:
    result = mean_prediction

Confidence Thresholds

Uncertainty	Action
< 5%	Use neural prediction
5-15%	Use neural, flag for validation
> 15%	Fall back to FEA

Troubleshooting

Symptom	Cause	Solution
High prediction error	Insufficient training data	Collect more FEA samples
Out-of-distribution warnings	Design outside training range	Retrain with expanded range
Slow inference	Large mesh	Use parametric predictor instead
Physics violations	Low physics loss weight	Increase `physics_loss_weight`

Cross-References

Depends On: SYS_10_IMSO for optimization framework
Used By: OP_02_RUN_OPTIMIZATION, OP_05_EXPORT_TRAINING_DATA
See Also: modules/neural-acceleration.md

Implementation Files

atomizer-field/
├── neural_field_parser.py       # BDF/OP2 parsing
├── field_predictor.py           # Field GNN
├── parametric_predictor.py      # Parametric GNN
├── train.py                     # Field training
├── train_parametric.py          # Parametric training
├── validate.py                  # Model validation
├── physics_losses.py            # Physics-informed loss
└── batch_parser.py              # Batch data conversion

optimization_engine/
├── neural_surrogate.py          # Atomizer integration
└── runner_with_neural.py        # Neural runner

Version History

Version	Date	Changes
2.0	2025-12-06	Added MLP Surrogate with Turbo Mode
1.0	2025-12-05	Initial consolidation from neural docs

15 KiB Raw Blame History

SYS_14: Neural Network Acceleration

Overview

When to Use

Quick Reference

MLP Surrogate (Recommended for Quick Start)

Overview

Architecture

Workflow Modes

1. Standard Hybrid Mode (--all)

2. Hybrid Loop Mode (--hybrid-loop)

3. Turbo Mode (--turbo) ⚡ RECOMMENDED

Configuration

Typical Accuracy

Output Files

GNN Field Predictor (Advanced)

Core Components

Workflow Diagram

Neural Model Types

1. Field Predictor GNN

2. Parametric Predictor GNN (Recommended)

3. Ensemble Models

Training Pipeline

Step 1: Collect Training Data

Step 2: Parse to Neural Format

Step 3: Train Model

Step 4: Validate

Step 5: Deploy

Configuration

Full Neural Configuration Example

Configuration Parameters

Hybrid FEA/Neural Workflow

Phase 1: FEA Exploration (50-100 trials)

Phase 2: Neural Training

Phase 3: Neural Acceleration (1000s of trials)

Phase 4: FEA Refinement (10-20 trials)

Adaptive Iteration Loop

Adaptive Configuration

Convergence Criteria

Output Files

Loss Functions

Data Loss (MSE)

Physics Loss

Combined Training

Uncertainty Quantification

Ensemble Method

Confidence Thresholds

Troubleshooting

Cross-References

Implementation Files

Version History

15 KiB

Raw Blame History

1. Standard Hybrid Mode (`--all`)

2. Hybrid Loop Mode (`--hybrid-loop`)

3. Turbo Mode (`--turbo`) ⚡ RECOMMENDED