IMO it’s a pretty verifiably correct and complex piece of software. There are all sorts of test suites but no greater one than comparing a page rendered to how it renders in chrome.
Also, and maybe more importantly, a browser is the kind of the perfect entry point for a lot of businesses.
If you’re for example hoping to build an agent that people rely on to manage their calendars, emails, book them vacations, etc.. a browser is probably the perfect form factor for that.
Or if you’re a AI lab and want to tap into training data behind logged in websites - a browser everyone uses is the ultimately what you need.
If you can build and maintain a browser from scratch with just agents then there’s no need to pay millions of dollars to hire away chromium engineers from Google.
Claude Code announced an update to how TODO's work on X.
TLDR; To make sessions collaborate on a single Task List, you can set the TaskList as an environment variable and start Claude like so:
CLAUDE_CODE_TASK_LIST_ID=groceries claude
Here's the full text:
Today, we're upgrading Todos in Claude Code to Tasks. Tasks are a new primitive that help Claude Code track and complete more complicated projects and collaborate on them across multiple sessions or subagents.
As model capabilities grow, one of the most important things we can do is “unhobble” Claude and allow it to use its new capabilities effectively. Compared to previous models, Opus 4.5 is able to run autonomously for longer and keep track of its state better. We found that the TodoWrite Tool was no longer necessary because Claude already knew what it needed to do for smaller tasks.
At the same time, we found ourselves using Claude Code to complete longer projects, sometimes across multiple subagents, context windows or sessions. But projects are more complex, tasks have dependencies and blockers and require coordination when using it across sessions.
It was clear we needed to evolve Todos to help Claude work on longer projects. This need was also emerging in the community and we took inspiration from projects like Beads by Steve Yegge.
Tasks are our new abstraction for coordinating many pieces of work across projects, Claude can create Tasks with dependencies on each other that are stored in the metadata, which mirrors more how projects work. Additionally, Tasks are stored in the file system so that multiple subagents or sessions can collaborate on them. When one session updates a Task, that is broadcasted to all sessions currently working on the same Task List.
You can ask Claude to create tasks right now, it’s especially useful when creating when spinning up subagents. Tasks are stored in ~/.claude/tasks, you can use this to build additional utilities on top of tasks as well.
To make sessions collaborate on a single Task List, you can set the TaskList as an environment variable and start Claude like so:
CLAUDE_CODE_TASK_LIST_ID=groceries claude
This also works for claude -p and the AgentSDK.
Tasks are a key building block for allowing Claude to build more complex projects. We’re looking forward to seeing how you use it.
Just tried to signup for tana - but their waitlist form (interestingly) didn't ask me for an email to contact me with followup steps and I can't seem to create an account :(. How did ya get access?
You can go through the back-door currently here: https://app.tana.inc/create-account (currently requires credit card to get in directly, but you can cancel it directly after).