VentureBeat November 29, 2024
Michael Nuñez

A comprehensive new survey from Microsoft researchers and academic partners reveals that artificial intelligence agents powered by large language models (LLMs) are becoming increasingly capable of controlling graphical user interfaces (GUIs), potentially changing how humans interact with software.

The technology essentially gives AI systems the ability to see and manipulate computer interfaces just like humans do — clicking buttons, filling out forms, and navigating between applications. Rather than requiring users to learn complex software commands, these “GUI agents” can interpret natural language requests and automatically execute the necessary actions.

“These agents represent a paradigm shift, enabling users to perform intricate, multi-step tasks through simple conversational commands,” the researchers write. “Their applications span across web navigation, mobile app interactions, and desktop...

Today's Sponsors

LEK
ZeOmega

Today's Sponsor

LEK

 
Topics: AI (Artificial Intelligence), Survey / Study, Technology, Trends
10 signs AI is ‘eating the world (of venture capital)’
2024: record year for AI trials
2025: Provider organizations will embrace new AI and analytics techniques
AI and Automation in Healthcare – 2025 Health IT Predictions
Why The Public And Private Sectors Must Jointly Define Responsible AI

Share This Article