Original: Geoffrey Huntley · 07/06/2025
Summary
Just yesterday morning, I was writing a conference talk on best practices for maintaining the LLM context window, which was quite detailed. Just yesterday morning, I was writing a conference talk on best practices for maintaining the LLM context window, which was quite detailed. It contained the then best practices from the two blog posts below.Key Insights
“Just yesterday morning, I was writing a conference talk on best practices for maintaining the LLM context window, which was quite detailed.” — The author reflects on the rapid obsolescence of technical best practices due to advancements in AI.
“Instead of allocating everything to the main context window and then overflowing it, you spawn a subagent.” — Introducing the concept of subagents to manage tasks within AI’s context window more efficiently.
“All those tasks will soon be automated.” — Predicting the future of software maintenance, where mundane tasks are automated by AI.
Topics
- AI Agents
- Agent-Native Architecture
- Prompt Engineering
- KTLO (Keep The Lights On)
- Software Automation
Full Article
I dream of roombas - thousands of automated AI robots that autonomously maintain codebases
Author: Geoffrey HuntleyPublished: 2025-06-07
Source: https://ghuntley.com/ktlo/
Just yesterday morning, I was writing a conference talk on best practices for maintaining the LLM context window, which was quite detailed. It contained the then best practices from the two blog posts below.
autoregressive queens of failure Have you ever had your AI coding assistant suggest something so off-base that you wonder if it’s trolling you? Welcome to the world of autoregressive failure. LLMs, the brains behind these assistants,… — Geoffrey Huntley
if you are redlining the LLM, you aren’t headlining It’s an old joke in the DJ community about upcoming artists having a bad reputation for pushing the audio signal into the red. Red is bad because it results in the audio signal being clipped and the m… — Geoffrey HuntleyYet sections of that talk - just 4 hours later - are now redundant if you use Amp and are in the early access pilot. Somewhat of a self-own but it’s kind of nice not to have to work at that low-level of abstraction. It’s really nice to work at higher abstractions. In the stream below, you will see a prototype of subagents. Yep, it’s real. It’s here.
I dream about AI subagents; they whisper to me while I’m asleep In a previous post, I shared about “real context window” sizes and “advertised context window sizes” Claude 3.7’s advertised context window is 200k, but I’ve noticed that the quality of output clips a… — Geoffrey HuntleyInstead of allocating everything to the main context window and then overflowing it, you spawn a subagent, which has its brand-new context window for doing the meaty stuff, like building, testing, or whatever you can imagine. Whilst that is happening the main thread is paused and suspended, waiting until competition.
It’s kind of like async, await state machines, or futures for LLMs.▶ revenge of the subagents It was pretty hard to get to bed last night. Truth be told, I stayed up just watching it in fascination. Instead of running an infinite loop where it would blow up the main context window (which would result in the code base ending up in an incomplete state) resulting in me having to jump back in and gets hands on to do other things with prompting to try and rescue it, now the main thread, the context window, it barely even increments and every loop completes. Thank you, Thorsten, for making my dreams a reality. Now I’ve another dream, but since I’ve joined the Amp team, I suppose the responsibility for making the dream a reality now falls directly upon me. The buck stops with me to get it done. Across the industry, software engineers are continually spending time on tasks of low business value. Some companies even refer to it as KTLO, or “Keep the Lights On”. If these tasks are neglected, however, they present a critical risk to the business. Yet they don’t get done because the product is more important. So it’s always a risk-reward trade-off. So here’s the pitch. All those tasks will soon be automated. Now that we have automated context management through subagents, the next step is to provide primitives that allow for the automation and removal of classes of KTLO, or, as Mr. 10 likes to describe in Factorio terms, we need quality modules.
the path to ticket to production
To be frank, the industry and foundation models aren’t yet advanced enough to fully automate software development without engineers being in or out of the loop. Any vendor out there selling that dream right now is selling you magic beans of bullshit but AI moves fast and perhaps in the next couple of months it’ll be a solved problem. Don’t get me wrong - we’re close. The continual evolution of Cursed (above), a brand-new programming language that is completely vibe-coded and hands-free, is proof to me that it will be possible in time. You see, a compiler isn’t like a Vercel v0 website. No, it’s serious stuff. It isn’t a toy. Compilers have symbolic meaning and substance. Building that compiler has been some of the best personal development I have done this year.- It has taught me many things about managing the context window.
- It has taught me to be less controlling of AI agents and more hands-free.
- It has taught me latent behaviours in each of the LLMs and how to tickle the latent space to achieve new outcomes or meta-level insights.
but, it starts by starting in the small
In the private Amp repository on GitHub, there is this mermaid diagram. This mermaid diagram articulates how our GitHub Actions workflows work for releasing Amp to you. It exists to make onboarding our staff into the project easier.
The following prompt generated it:
ps. socials
- X - https://x.com/GeoffreyHuntley/status/1931192949611827568
- BlueSky - https://bsky.app/profile/ghuntley.com/post/3lqyg7facxc2g
- LinkedIn - https://www.linkedin.com/posts/geoffreyhuntley_i-dream-of-roombas-thousands-of-automated-activity-7336959517479358466-D-3T
Key Takeaways
Notable Quotes
Just yesterday morning, I was writing a conference talk on best practices for maintaining the LLM context window, which was quite detailed.Context: The author reflects on the rapid obsolescence of technical best practices due to advancements in AI.
Instead of allocating everything to the main context window and then overflowing it, you spawn a subagent.Context: Introducing the concept of subagents to manage tasks within AI’s context window more efficiently.
All those tasks will soon be automated.Context: Predicting the future of software maintenance, where mundane tasks are automated by AI.
Related Topics
- [[topics/ai-agents]]
- [[topics/agent-native-architecture]]
- [[topics/prompt-engineering]]
- [[topics/ktlo-keep-the-lights-on]]
- [[topics/software-automation]]
Related Articles
I dream about AI subagents; they whisper to me while I'm asleep
Geoffrey Huntley · explanation · 83% similar
autoregressive queens of failure
Geoffrey Huntley · explanation · 81% similar
everything is a ralph loop
Geoffrey Huntley · explanation · 80% similar
Originally published at https://ghuntley.com/ktlo/.