Skip to content

Latest commit

 

History

History
75 lines (49 loc) · 1.69 KB

File metadata and controls

75 lines (49 loc) · 1.69 KB

VisionEngine API Reference

pkg/analyzer

Interfaces

  • Analyzer - Primary vision analysis interface
  • VideoProcessor - Video analysis capabilities

Types

  • Rect - Bounding box rectangle
  • Size - Image dimensions
  • UIElement - Detected UI element
  • TextRegion - Detected text region
  • VisualIssue - Detected visual problem
  • Action - Navigation/interaction action
  • ScreenIdentity - Unique screen identifier
  • ScreenAnalysis - Full screen analysis result
  • ScreenDiff - Screen comparison result
  • KeyFrame - Video key frame

Functions

  • NewStubAnalyzer() - Create stub analyzer (no OpenCV)
  • NewStubAnalyzerWithProvider(provider) - Create stub with LLM vision

pkg/graph

Interfaces

  • NavigationGraph - Directed graph of screens and transitions

Types

  • ScreenNode - Node in the graph
  • Transition - Directed edge
  • GraphSnapshot - Serializable graph state

Functions

  • NewNavigationGraph() - Create empty graph
  • ExportDOT(g) - Export to Graphviz DOT
  • ExportJSON(g) - Export to JSON
  • ExportMermaid(g) - Export to Mermaid

pkg/llmvision

Interfaces

  • VisionProvider - LLM vision API adapter

Types

  • ProviderConfig - Provider configuration
  • FallbackProvider - Multi-provider with fallback

Functions

  • NewOpenAIProvider(config) - GPT-4o adapter
  • NewAnthropicProvider(config) - Claude adapter
  • NewGeminiProvider(config) - Gemini adapter
  • NewQwenProvider(config) - Qwen-VL adapter
  • NewFallbackProvider(providers...) - Fallback chain

pkg/config

Types

  • Config - VisionEngine configuration

Functions

  • DefaultConfig() - Sensible defaults
  • LoadFromEnv() - Load from environment variables