Open Source Infrastructure

Darwin Mobile Agent
A Roadmap for Self-Evolution

An open-source infrastructure designed as a foundation for autonomous reinforcement learning in mobile GUI environments. Building truly autonomous, self-evolving agents.

🚀 Get Started 📄 Read Paper

Capabilities

Key Features

A comprehensive framework for building self-evolving GUI agents

🌍

"Big World" GUI

Mobile GUI as an experimentally tractable, open-ended "Big World" proxy for general intelligence emergence.

⚡

Async Architecture

Enables efficient parallelization by pipelining model inference and environment execution to hide interaction latency.

🧬

Self-Evolution

Systematic removal of human priors from Task Curriculum, Outcome Verification, and Agent State (Memory).

🔄

Task Lifecycle

Automated Setup → Execution → Teardown protocol to internalize environment state management.

☁️

Cloud-Native Fleet

Stable infrastructure using cloud Android devices via Alibaba Cloud instead of local emulators.

✅

Smart Verification

LLM-based semantic verification to drive rewards for open-ended tasks without programmatic state access.

Progress

Roadmap

Track our journey towards fully autonomous agents

✅ Completed

✓ Cloud phone infrastructure with Alibaba Cloud
✓ Multi-model support (UI-TARS, Qwen3-VL)
✓ Asynchronous agent-environment loop
✓ Curriculum-based task sampling
✓ LLM-based outcome verification
✓ Task lifecycle protocol (setup-task-cleanup)

📋 Planned

Pre-trained model checkpoints release
Extended task benchmark
Device abstraction layer for emulators and physical devices
Automated task generation (LLM-based)
Knowledge distillation & Memory Management

Getting Started

Quick Start

Get up and running in minutes

Terminal

# 1. Clone and install
git clone https://github.com/ai-agents-2030/darwin-mobile-agent.git
cd darwin-mobile-agent
pip install -e .

# 2. Install dependencies
pip install vllm==0.8.5 trl==0.25.1 "transformers>=4.57.0"

# 3. Configure cloud phones (see docs for details)
# Edit your device addresses in the training script

# 4. Run training
bash examples/darwin_agent/run_spabench_ui_tars.sh

Validation

Results

Stable Policy Optimization

We evaluate the stability of the Darwin Mobile Agent infrastructure in a controlled reinforcement learning setting. Eight tasks are executed concurrently across eight cloud phones, enabling parallel data collection.

The agent demonstrates a clear and consistent improvement in mean task success rate. Despite operating on real devices with inherent latency and asynchronous agent–environment interactions, the training process remains stable with no observable divergence or performance collapse.

This indicates that the Darwin infrastructure can reliably sustain end-to-end policy optimisation in the mobile GUI domain.

Reference

Citation

If you use Darwin Mobile Agent in your research, please cite

@article{darwin2025,
  title={Darwin Mobile Agent: A Roadmap for Self-Evolution},
  author={Beechey, Daniel and Yuen, Derek and Liu, Jianheng and Luo, Dezhao and He, Tiantian and Luo, Weilin and Wang, Jun and Shao, Kun},
  journal={arXiv preprint},
  year={2025}
}

Darwin Mobile Agent
A Roadmap for Self-Evolution

Key Features

"Big World" GUI

Async Architecture

Self-Evolution

Task Lifecycle

Cloud-Native Fleet

Smart Verification

Roadmap

✅ Completed

📋 Planned

Quick Start

Results

Stable Policy Optimization

Documentation

📖 Getting Started

💻 Local Setup

☁️ Cloud Devices

📋 Task Benchmarks

Acknowledgements

Citation

Darwin Mobile Agent A Roadmap for Self-Evolution

Key Features

"Big World" GUI

Async Architecture

Self-Evolution

Task Lifecycle

Cloud-Native Fleet

Smart Verification

Roadmap

✅ Completed

📋 Planned

Quick Start

Results

Stable Policy Optimization

Documentation

📖 Getting Started

💻 Local Setup

☁️ Cloud Devices

📋 Task Benchmarks

Acknowledgements

Citation

Darwin Mobile Agent
A Roadmap for Self-Evolution