feat: Major update with validators, skills, dashboard, and docs reorganization

- Add validation framework (config, model, results, study validators) - Add Claude Code skills (create-study, run-optimization, generate-report, troubleshoot, analyze-model) - Add Atomizer Dashboard (React frontend + FastAPI backend) - Reorganize docs into structured directories (00-09) - Add neural surrogate modules and training infrastructure - Add multi-objective optimization support 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
2025-11-25 19:23:58 -05:00
parent 74a92803b7
commit e3bdb08a22
155 changed files with 52729 additions and 37 deletions
--- a/docs/03_GETTING_STARTED.md
+++ b/docs/03_GETTING_STARTED.md
@@ -0,0 +1,297 @@
+# How to Extend an Optimization Study
+
+**Date**: November 20, 2025
+
+When you want to run more iterations to get better results, you have three options:
+
+---
+
+## Option 1: Continue Existing Study (Recommended)
+
+**Best for**: When you want to keep all previous trial data and just add more iterations
+
+**Advantages**:
+- Preserves all existing trials
+- Continues from current best result
+- Uses accumulated knowledge from previous trials
+- More efficient (no wasted trials)
+
+**Process**:
+
+### Step 1: Wait for current optimization to finish
+Check if the v2.1 test is still running:
+```bash
+# On Windows
+tasklist | findstr python
+
+# Check background job status
+# Look for the running optimization process
+```
+
+### Step 2: Run the continuation script
+```bash
+cd studies/circular_plate_protocol10_v2_1_test
+python continue_optimization.py
+```
+
+### Step 3: Configure number of additional trials
+Edit [continue_optimization.py:29](../studies/circular_plate_protocol10_v2_1_test/continue_optimization.py#L29):
+```python
+# CONFIGURE THIS: Number of additional trials to run
+ADDITIONAL_TRIALS = 50  # Change to 100 for total of ~150 trials
+```
+
+**Example**: If you ran 50 trials initially and want 100 total:
+- Set `ADDITIONAL_TRIALS = 50`
+- Study will run trials #50-99 (continuing from where it left off)
+- All 100 trials will be in the same study database
+
+---
+
+## Option 2: Modify Config and Restart
+
+**Best for**: When you want a completely fresh start with more iterations
+
+**Advantages**:
+- Clean slate optimization
+- Good for testing different configurations
+- Simpler to understand (one continuous run)
+
+**Disadvantages**:
+- Loses all previous trial data
+- Wastes computational budget if previous trials were good
+
+**Process**:
+
+### Step 1: Stop any running optimization
+```bash
+# Kill the running process if needed
+# On Windows, find the PID and:
+taskkill /PID <process_id> /F
+```
+
+### Step 2: Edit optimization config
+Edit [studies/circular_plate_protocol10_v2_1_test/1_setup/optimization_config.json](../studies/circular_plate_protocol10_v2_1_test/1_setup/optimization_config.json):
+```json
+{
+  "trials": {
+    "n_trials": 100,  // Changed from 50 to 100
+    "timeout_per_trial": 3600
+  }
+}
+```
+
+### Step 3: Delete old results
+```bash
+cd studies/circular_plate_protocol10_v2_1_test
+
+# Delete old database and history
+del 2_results\study.db
+del 2_results\optimization_history_incremental.json
+del 2_results\intelligent_optimizer\*.*
+```
+
+### Step 4: Rerun optimization
+```bash
+python run_optimization.py
+```
+
+---
+
+## Option 3: Wait and Evaluate First
+
+**Best for**: When you're not sure if more iterations are needed
+
+**Process**:
+
+### Step 1: Wait for current test to finish
+The v2.1 test is currently running with 50 trials. Let it complete first.
+
+### Step 2: Check results
+```bash
+cd studies/circular_plate_protocol10_v2_1_test
+
+# View optimization report
+type 3_reports\OPTIMIZATION_REPORT.md
+
+# Or check test summary
+type 2_results\test_summary.json
+```
+
+### Step 3: Evaluate performance
+Look at:
+- **Best error**: Is it < 0.1 Hz? (target achieved)
+- **Convergence**: Has it plateaued or still improving?
+- **Pruning rate**: < 5% is good
+
+### Step 4: Decide next action
+- **If target achieved**: Done! No need for more trials
+- **If converging**: Add 20-30 more trials (Option 1)
+- **If struggling**: May need algorithm adjustment, not more trials
+
+---
+
+## Comparison Table
+
+| Feature | Option 1: Continue | Option 2: Restart | Option 3: Wait |
+|---------|-------------------|-------------------|----------------|
+| Preserves data | ✅ Yes | ❌ No | ✅ Yes |
+| Efficient | ✅ Very | ❌ Wasteful | ✅ Most |
+| Easy to set up | ✅ Simple | ⚠️ Moderate | ✅ Simplest |
+| Best use case | Adding more trials | Testing new config | Evaluating first |
+
+---
+
+## Detailed Example: Extending to 100 Trials
+
+Let's say the v2.1 test (50 trials) finishes with:
+- Best error: 0.25 Hz (not at target yet)
+- Convergence: Still improving
+- Pruning rate: 4% (good)
+
+**Recommendation**: Continue with 50 more trials (Option 1)
+
+### Step-by-step:
+
+1. **Check current status**:
+   ```python
+   import optuna
+   storage = "sqlite:///studies/circular_plate_protocol10_v2_1_test/2_results/study.db"
+   study = optuna.load_study(study_name="circular_plate_protocol10_v2_1_test", storage=storage)
+
+   print(f"Current trials: {len(study.trials)}")
+   print(f"Best error: {study.best_value:.4f} Hz")
+   ```
+
+2. **Edit continuation script**:
+   ```python
+   # In continue_optimization.py line 29
+   ADDITIONAL_TRIALS = 50  # Will reach ~100 total
+   ```
+
+3. **Run continuation**:
+   ```bash
+   cd studies/circular_plate_protocol10_v2_1_test
+   python continue_optimization.py
+   ```
+
+4. **Monitor progress**:
+   - Watch console output for trial results
+   - Check `optimization_history_incremental.json` for updates
+   - Look for convergence (error decreasing)
+
+5. **Verify results**:
+   ```python
+   # After completion
+   study = optuna.load_study(...)
+   print(f"Total trials: {len(study.trials)}")  # Should be ~100
+   print(f"Final best error: {study.best_value:.4f} Hz")
+   ```
+
+---
+
+## Understanding Trial Counts
+
+**Important**: The "total trials" count includes both successful and pruned trials.
+
+Example breakdown:
+```
+Total trials: 50
+├── Successful: 47 (94%)
+│   └── Used for optimization
+└── Pruned: 3 (6%)
+    └── Rejected (invalid parameters, simulation failures)
+```
+
+When you add 50 more trials:
+```
+Total trials: 100
+├── Successful: ~94 (94%)
+└── Pruned: ~6 (6%)
+```
+
+The optimization algorithm only learns from **successful trials**, so:
+- 50 successful trials ≈ 53 total trials (with 6% pruning)
+- 100 successful trials ≈ 106 total trials (with 6% pruning)
+
+---
+
+## Best Practices
+
+### When to Add More Trials:
+✅ Error still decreasing (not converged yet)
+✅ Close to target but need refinement
+✅ Exploring new parameter regions
+
+### When NOT to Add More Trials:
+❌ Error has plateaued for 20+ trials
+❌ Already achieved target tolerance
+❌ High pruning rate (>10%) - fix validation instead
+❌ Wrong algorithm selected - fix strategy selector instead
+
+### How Many to Add:
+- **Close to target** (within 2x tolerance): Add 20-30 trials
+- **Moderate distance** (2-5x tolerance): Add 50 trials
+- **Far from target** (>5x tolerance): Investigate root cause first
+
+---
+
+## Monitoring Long Runs
+
+For runs with 100+ trials (several hours):
+
+### Option A: Run in background (Windows)
+```bash
+# Start minimized
+start /MIN python continue_optimization.py
+```
+
+### Option B: Use screen/tmux (if available)
+```bash
+# Not standard on Windows, but useful on Linux/Mac
+tmux new -s optimization
+python continue_optimization.py
+# Detach: Ctrl+B, then D
+# Reattach: tmux attach -t optimization
+```
+
+### Option C: Monitor progress file
+```python
+# Check progress without interrupting
+import json
+with open('2_results/optimization_history_incremental.json') as f:
+    history = json.load(f)
+
+print(f"Completed trials: {len(history)}")
+best = min(history, key=lambda x: x['objective'])
+print(f"Current best: {best['objective']:.4f} Hz")
+```
+
+---
+
+## Troubleshooting
+
+### Issue: "Study not found in database"
+**Cause**: Initial optimization hasn't run yet or database corrupted
+**Fix**: Run `run_optimization.py` first to create the initial study
+
+### Issue: Continuation starts from trial #0
+**Cause**: Study database exists but is empty
+**Fix**: Delete database and run fresh optimization
+
+### Issue: NX session conflicts
+**Cause**: Multiple NX sessions accessing same model
+**Solution**: NX Session Manager handles this automatically, but verify:
+```python
+from optimization_engine.nx_session_manager import NXSessionManager
+mgr = NXSessionManager()
+print(mgr.get_status_report())
+```
+
+### Issue: High pruning rate in continuation
+**Cause**: Optimization exploring extreme parameter regions
+**Fix**: Simulation validator should prevent this, but verify rules are active
+
+---
+
+**Summary**: For your case (wanting 100 iterations), use **Option 1** with the `continue_optimization.py` script. Set `ADDITIONAL_TRIALS = 50` and run it after the current test finishes.