James here from the team! Let us know if you have feedback on either our cloud or open source repo. We want to push the frontiers for computer-use so that people can do less repetitive work.
Tested a few agentic browsers such as genspark, fellou and comet. I found the vision approach less effective comparing to the dom-based approach, and seem quite slower too. Does it need a reasoning step to type an url into the address bar?
this is pretty awesome, on the cloud env though I got the error: Error: AIProviderError: AI provider failed to generate text. Timeout while downloading https://playmatic-screenshots.s3.us-west-2.amazonaws.com
Also the task I gave it this was the result:
I was unable to retrieve any live fare data because both airline sites became unworkable in the remote session (xxxx selectors would not stay open; xxxxsearch could not be completed before the session ended). Below is a blank comparison table you can fill in once you gather the prices manually:
is that the current state of best in class computer use agents? or is more of a we need to modify it until it is good for our use case?
trying to provide helpful feedback and honest curiosity, this is awesome work
Nice job. It's exciting that the quality is approaching human level, but still I think we are spending way too many tokens, and the automation speed-up isn't really worth the total token price yet (unless you have very high-end gpus and you don't care about the completion speed of your tasks)
This is great. Will it solve the three biggest issues with ChatGPT agent?
1. Proxy support for sites that block the user
2. Browser extensions support for uBlock, password managers, etc.
3. CAPTCHA solving
Hi, great work congrats!
Does it use openrouter for model selection? Which models did you achieve the webarena result with? Are there any open source models which are any good for this?
This is awesome, biggest open-source browser agent?
"* Full computer access: It's not sandboxed in a browser. Meka operates with OS-level controls, allowing it to handle system dialogues, file uploads, and other interactions that browser-only automation tools can't."
This seems pretty scary. Just recently an AI wiped a company database: https://fortune.com/2025/07/23/ai-coding-tool-replit-wiped-d...