← all stories otherannouncement 2 sources · 1h ago

Gemini 3.5 Flash Gets Computer Use for Automated PC Operations

By folding computer use into its fastest, cheapest model instead of keeping it a premium standalone product, Google is making autonomous PC-operation agents a default capability rather than a specialized add-on.

Reporting from 2 sources: GameBusiness.jp, GIGAZINE.

Gemini 3.5 Flash Gets Computer Use for Automated PC Operations

Google has integrated a "computer use" feature directly into its Gemini 3.5 Flash AI model, allowing it to recognize on-screen content and perform clicks and text inputs. Previously available only as a separate standalone model called Gemini 2.5 Computer Use, the feature is now built into the lightweight Flash series. This enables developers to build agents that can see, reason about, and act upon browser, mobile, and desktop environments through a single model. Google says use cases include automating multi-step workflows, enterprise application testing, and accessibility audits. On the OSWorld-Verified benchmark, which measures how accurately AI can perform operating system tasks, Gemini 3.5 Flash scored 78.4, up from Gemini 3 Flash's 65.1 and ahead of Gemini 3.1 Pro's 76.2. It tied with Sonnet 4.6 at 78.4, while Opus 4.8 led at 83.4. Google has also implemented safety measures, including targeted adversarial training against prompt injection, optional user confirmation for sensitive operations, and automatic task stoppage when indirect injection is detected. The model is available immediately through the Gemini API and the Gemini Enterprise Agent Platform.

Google also released a demo environment hosted by Browserbase, a reference implementation, and documentation alongside the model. The company says the integration means AI can not only return answers but also become easier to use as an agent that views and operates the screen.

Because Gemini 3.5 Flash outputs the intent of its operations, developers can more easily understand why the AI is trying to press a particular button. Google demonstrated the feature by having the model analyze the Gemini app and return a categorized list of features, and by having it automatically audit accessibility issues in its own documentation.

On the OSWorld-Verified benchmark, the new model scored 78.4, up from Gemini 3 Flash's 65.1 and ahead of Gemini 3.1 Pro's 76.2. It tied with Sonnet 4.6 at 78.4, while Opus 4.8 led at 83.4. GPT-5.4 mini scored 72.1, and GPT-5.5 scored 78.7.

Google says the computer use feature in Gemini 3.5 Flash underwent targeted adversarial training against prompt injection. Two optional enterprise safety features are available: one that requires explicit user confirmation for sensitive or irreversible operations, and one that automatically stops tasks when indirect prompt injection is detected. Google recommends developers combine these with secure sandbox environments, human-in-the-loop steps, and strict access controls.

Synthesized by Yomimono from the 2 cited sources below, including Japanese-language reporting where cited, then editorially reviewed before publishing.

Gemini 3.5 Flash Gets Computer Use for Automated PC Operations

More on this

Sources