Working Notes: a commonplace notebook for recording & exploring ideas.
Home. Site Map. Subscribe. More at expLog. — Kunal

Ego

Ego is an Extensible Agent Orchestrator: inspired by emacs' architecture; I always feel compelled to hack on codex and claude, and I wanted something that would let me do that really trivially.

2026-04-05

Getting closer, as I think about this it's going to be a very tiny amount of code that should be easy to reason about but really reusable and powerful. I think I've spent more time with pen and paper than with actually typing it out in this case.

There's the core that handles the actual interaction loop with the model, and then there are different UIs I can attach to the core to drive the model behavior, like the REPL which I've defined.

Keeping tools and agents configurable means I should be trivially able to use a sandboxed python instance running in a carefully crafted container (or even remote lambdas if I so choose), and have agents converse with each other while giving them rules and jobs to do. Leaning into asyncio also gives me some flexibility.

I'm going to need to maintain an explicit log and smoe form of state to easily serialize, deserialize and fork contexts; need to spend more time noodling on this. There're so many options open for building truly interesting agent harnesses; I'm sad all of Claude, Codex and Gemini converged to a "browser ui" in the terminal.

2026-03-29

I'm still noodling about the design on my head and on paper to figure out how to structure the code. There were a couple of design decisions / principles to guide my choices I came up with:

as arrogant as it sounds, I'd like to achive: ego : claude :: emacs : vscode in terms of design and extensibility; I'm leaning on python as the ~lisp of choice
different models must be able to talk to each other
it should be trivial to customize and add new tools and behaviors for models
- eg to explore ideas like Recursive Language models
- or to have smarter context management
extremely easy to instrument and grow (following the original principles in djn

and implementation affecting

all state should be recorded continuously: should be able to stop/restart/fork sessions transparently
- this constrains state management; should I use something like mercurial or git -- or steal from their algorithms?
a very clear api and implementation that is extremely aware of resource utilization

2026-03-22

Another idea, after trying this out in a project yesterday: I'd really want this to be something that I can embed into any live python project -- and make any tool AI powered almost trivially. Lots of prototyping and exploration to do here!

(I think there's a lot that can be done here by just building the right MCP servers and giving agents access to them -- eg. gdb integration -- but I'm really curious about how far I can get by opening up a Python program's internals to a model.)

The proxy api router doesn't seem like it's approved anymore, so I'll have to switch to explicit API keys. Switching to openrouter to figure this out better.

Finally, I realized that an embeddable version of ego is a fantastic way to build something that can easily inject intelligence into any python program: it can introspect and change behavior, and generate code with the right tools. Python is remarkably reflection friendly, including access to documentation so this should be really powerful.

Oh, and I need to choose a name to ship to Pypi with. ego is already taken, sadly.

2026-02-22

General options

have multiple frontends, the way emacs does it
explore what I can achieve with different models
try to build support for recursive llm calls as a first take
AI can intercept any layer of this and rewrite it on demand, need to choose the right interaction model
At some point, try fine tuning models against this harness

The RLM paper has prompts in the appendix

explain that the context is available in a repl and how to control it
explain that python code to be executed must be specified with ```repl commands
give it a way to return the results using a function call or a variable
this is a whole tool ecosystem defined, I want to also extend this for it to be able to make and use tools automatically

I think next steps are to make a single process agent in python that can independently improve, and then figure out the human interaction model and instrumentation I'm planning to do.

2026-02-21

The general idea is to have a very simple python interpreter loop at the heart of the program that maintains state, and can easily be customized, hooked into, and overridden: exactly like emacs. I frequently feel like hacking on Claude Code, and then it's ... tricky? impossible?

The other bit is that I can have the AIs modify the orchestrator and save code in real time: and generally need to make sure this versionining is explicitly maintained so that restarting the interpreter maintains old state correctly. Particularly if the AIs want to make their own Python tools instead of installing shell commands.

I'd like to validate this with small experiments to see how far I can get, and also define an interface that works for this. Presumably users (& agents) will also want to install additional dependencies trivially into this application, so I need to figure that part out too.

Building a REPL

Shoutout to https://bernsteinbear.com/blog/simple-python-repl/ for a really good tutorial on building repls
Actual source code: https://github.com/python/cpython/blob/main/Lib/code.py

Simplifying API access

(via claude) https://rogs.me/2026/02/use-your-claude-max-subscription-as-an-api-with-cliproxyapi/
Based on the latest Tweets, this is Okay for local experiments: Thariq on Twitter
This should let me do far more interesting queries without having to deal with ACP or agent startup times