Core modules: - cli.py: Command-line interface with Click - pipeline.py: Main orchestrator - video_processor.py: Frame extraction with ffmpeg - audio_analyzer.py: Whisper transcription - vision_analyzer.py: Component detection (placeholder) - doc_generator.py: Markdown + PDF output Also includes: - pyproject.toml with uv/hatch config - Prompts for AI analysis - Basic tests - ROADMAP.md with 4-week plan
1.3 KiB
1.3 KiB
CAD-Documenter Roadmap
Phase 1: Core Pipeline (MVP) — Week 1
- Video frame extraction (ffmpeg)
- Audio transcription (Whisper)
- Basic vision analysis (component identification)
- Markdown generation (simple template)
- CLI skeleton
Goal: cad-doc video.mp4 produces basic markdown output
Phase 2: Smart Analysis — Week 2
- Scene change detection
- Timestamp correlation (frames ↔ transcript)
- Feature extraction (holes, threads, etc.)
- Material estimation
- Improved prompts for vision
Goal: Output correlates visual and verbal information intelligently
Phase 3: Integrations — Week 3
- Part Manager API integration
- BOM generation with P/N lookup
- Atomizer hints generation
- Atomaste Report Standard PDF
Goal: Full integration with Atomaste ecosystem
Phase 4: Polish — Week 4
- Interactive mode
- Gitea auto-publish
- Error handling and recovery
- Documentation and examples
- Tests
Goal: Production-ready tool
Success Metrics
- ⏱️ Time saved: 4+ hours per assembly documentation
- 📊 Completeness: 90%+ of components captured
- 🎯 Accuracy: P/N matching >95% with Part Manager
- 📄 CDR ready: PDF passes quality review without edits