Google has introduced an exciting new AI model called Gemini 2.5 Computer Use. This model allows AI to navigate the web in ways designed for human users, rather than robots. It can perform tasks like filling out online forms by understanding visual inputs and reasoning about what is needed.
The Gemini model can handle user interface testing and can work with websites that don’t have direct API access. Variants of this model have been used in different projects, like AI Mode and Project Mariner, which automates actions in a browser, such as adding items to online shopping carts based on user lists.
This announcement came right after OpenAI unveiled new functions for ChatGPT during its annual Dev Day. Meanwhile, Anthropic had rolled out a similar “computer use” feature in its Claude AI model last year.
Google has shared demo videos showing how Gemini works, although they are sped up for demonstration purposes. According to Google, this model outperforms others in common benchmarks for web and mobile tasks. However, unlike ChatGPT Agent, Gemini currently operates only within a browser environment and isn’t equipped for full desktop control. It supports 13 actions, including opening a web browser and dragging and dropping elements.
Developers can access Gemini 2.5 through Google AI Studio and Vertex AI. There’s also a demo available on Browserbase, showing the model completing tasks like playing a game or browsing trending topics.
Reflecting on the advancements in AI, experts highlight that technologies like Gemini signify a growing trend toward more capable assistants. For instance, a recent survey showed that 70% of users are excited about AI tools that can handle more complex tasks. This suggests a shift in how we see the role of AI in our daily lives.
As AI technology continues to evolve, the conversation on its impact is crucial. Many users on social media share their experiences and ideas about how these tools can improve efficiency at work and in personal projects. These discussions underline the value of developing AI that understands and interacts effectively within human-designed systems.
For more information on Gemini’s capabilities, check out Google’s official announcement.
Source link
AI,Google,News,Tech

