The AI agent accepts both text and images as input. To complete tasks, the CUA processes raw pixel data of the screen and uses a virtual keyboard and mouse to execute actions. OpenAI claims it can ...
IBL News | New York OpenAI yesterday introduced a research preview of its general-purpose AI agent, an operator that can take control of a web browser and independently perform some actions. It costs ...