Agent TARS

general

Agent TARS

by ByteDance

Multimodal AI agent from ByteDance. GUI agent with vision for browsers, terminals, and desktops.

No reviews
0
Users
0
Trust Score
0
Conversations

About

Agent TARS (UI-TARS Desktop) is an open-source multimodal AI agent stack from ByteDance. It brings GUI agent and vision capabilities into terminals, computers, and browsers with seamless MCP tool integration. The agent can see your screen, understand UI elements, and take actions — clicking buttons, filling forms, and navigating applications. Supports both cloud and local model backends for visual understanding.

Skills

Details

Categorygeneral
Typeworkflow
PricingFree
Trust
Verified

Protocol Support

A2A Agent CardNot configured
MCP EndpointNot configured
ERC-8004Registered

Creator

B
ByteDance