Core modules: - cli.py: Command-line interface with Click - pipeline.py: Main orchestrator - video_processor.py: Frame extraction with ffmpeg - audio_analyzer.py: Whisper transcription - vision_analyzer.py: Component detection (placeholder) - doc_generator.py: Markdown + PDF output Also includes: - pyproject.toml with uv/hatch config - Prompts for AI analysis - Basic tests - ROADMAP.md with 4-week plan
48 lines
1.6 KiB
Plaintext
48 lines
1.6 KiB
Plaintext
You are analyzing a CAD model video walkthrough. Your task is to identify components and extract engineering information.
|
|
|
|
## Context
|
|
The engineer is walking through a CAD model, explaining the design. You will receive:
|
|
1. A frame from the video showing the CAD model
|
|
2. The transcript text around this timestamp
|
|
3. Any previously identified components
|
|
|
|
## Your Task
|
|
Analyze the frame and identify:
|
|
|
|
1. **Components visible** - What parts/assemblies can you see?
|
|
2. **Features** - Holes, threads, fillets, chamfers, ribs, etc.
|
|
3. **Materials** - Based on appearance or transcript mentions
|
|
4. **Functions** - What does each component do? (from transcript)
|
|
5. **Relationships** - How do components connect/interface?
|
|
|
|
## Output Format
|
|
Return a JSON object:
|
|
```json
|
|
{
|
|
"components": [
|
|
{
|
|
"name": "Component Name",
|
|
"confidence": 0.95,
|
|
"description": "Brief description",
|
|
"function": "What it does (from transcript)",
|
|
"material": "Material if mentioned or estimated",
|
|
"features": ["feature1", "feature2"],
|
|
"bounding_box": [x1, y1, x2, y2] // optional
|
|
}
|
|
],
|
|
"assembly_relationships": [
|
|
{"from": "Component A", "to": "Component B", "type": "bolted"}
|
|
],
|
|
"transcript_matches": [
|
|
{"component": "Component Name", "excerpt": "relevant quote from transcript"}
|
|
]
|
|
}
|
|
```
|
|
|
|
## Guidelines
|
|
- Be specific with component names (not just "part 1")
|
|
- Note standard parts (fasteners, bearings) with specifications if visible
|
|
- Extract material from transcript mentions ("aluminum", "steel", etc.)
|
|
- Identify function from verbal explanations
|
|
- Note manufacturing features (machined, cast, printed, etc.)
|