Skip to content

Data in OWA: Complete Desktop Agent Data Pipeline

Desktop AI needs high-quality, synchronized multimodal data: screen captures, mouse/keyboard events, and window context. OWA provides the complete pipeline from recording to training.

🚀 Quick Start: Record → Train in 3 Steps

# 1. Record desktop interaction
$ ocap my-session.mcap

# 2. Process to training format
$ python scripts/01_raw_events_to_event_dataset.py --train-dir ./

# 3. Train your model
$ python train.py --dataset ./event-dataset

📖 Detailed Guide: Complete Quick Start Tutorial - Step-by-step walkthrough with examples and troubleshooting

The OWA Data Ecosystem

🎯 Getting Started

New to OWA data? Start here:

📚 Technical Reference

Deep dive into the format and pipeline:

🛠️ Tools & Ecosystem

🤗 Community Datasets

Browse Available Datasets: 🤗 datasets?other=OWA

  • Growing Collection: Hundreds of community-contributed datasets
  • Standardized Format: All use OWAMcap for seamless integration
  • Interactive Preview: Hugging Face Spaces Visualizer
  • Easy Sharing: Upload recordings directly with one command

🚀 Impact: OWA has democratized desktop agent data, growing from zero to hundreds of public datasets in the unified OWAMcap format.