Tiled Hacker news on React Router

Show HN: I put an AI agent on a $7/month VPS with IRC as its transport layer

250 points - yesterday at 10:41 PM

The stack: two agents on separate boxes. The public one (nullclaw) is a 678 KB Zig binary using ~1 MB RAM, connected to an Ergo IRC server. Visitors talk to it via a gamja web client embedded in my site. The private one (ironclaw) handles email and scheduling, reachable only over Tailscale via Google's A2A protocol.

Tiered inference: Haiku 4.5 for conversation (sub-second, cheap), Sonnet 4.6 for tool use (only when needed). Hard cap at $2/day.

A2A passthrough: the private-side agent borrows the gateway's own inference pipeline, so there's one API key and one billing relationship regardless of who initiated the request.

You can talk to nully at https://georgelarson.me/chat/ or connect with any IRC client to irc.georgelarson.me:6697 (TLS), channel #lobby.

Source

eu_93
today at 9:47 AM
Really interesting idea. Finally a use of AI that goes beyond the simple “chat on top of a resume” and actually tries to demonstrate skills with real evidence by reading the code.
I also really like the separation between the public and private agent. It’s an architectural choice that many people ignore when talking about AI agents.
The approach to cost control and model selection is also very solid.
InitialPhase55
yesterday at 11:51 PM
Curious, how did you settle on Haiku/Sonnet? Because there are much cheaper models on OpenRouter that probably perform comparatively...
Consider Haiku 4.5: $1/M input tokens | $5/M output tokens vs MiniMax M2.7: $0.30/M input tokens | $1.20/M output tokens vs Kimi K2.5: $0.45/M input tokens | $2.20/M output tokens
I haven't tried so I can't say for sure, but from personal experience, I think M2.7 and K2.5 can match Haiku and probably exceed it on most tasks, for much cheaper.
czhu12
today at 12:33 AM
Super random but I had a similar idea for a bot like this that I vibe coded while on a train from Tokyo to Osaka
https://web-support-claw.oncanine.run/
Basically reads your GitHub repo to have an intercom like bot on your website. Answer questions to visitors so you don’t have to write knowledge bases.
faangguyindia
today at 2:21 AM
I actually use IRC in my coding agent
Change into rooms to get into different prompts.
using it as remote to change any project, continue from anywhere.
wolvoleo
today at 4:20 AM
I tried it, it was cool. I don't like nully's attitude though. Very dismissive and tough.
But I like your setup as a whole. I'll see if I can get some takeaways from it.
I do tiered here too, with the lowest tier just a qwen local bot.
By the way how do you handle the escalation from haiku to opus I wonder?
oceliker
today at 1:23 AM
For future reference I recommend having another Haiku instance monitor the chat and check if people are up to some shenanigans. You can use ntfy to send yourself an alert. The chat is completely off the rails right now...
password4321
today at 9:53 AM
This looks like a fun project. I'm going to be that guy and spam this reminder regarding the HN submission text:
Don't post generated/AI-edited comments. HN is for conversation between humans
https://news.ycombinator.com/item?id=47340079
At the very least prompt your LLM to skip the AI-isms for "your" comments!
0xbadcafebee
yesterday at 11:23 PM
This is such a great idea. I have an idea now for a bot that might help make tech hiring less horrible. It would interview a candidate to find out more about them personally/professionally. Then it would go out and find job listings, and rate them based on candidate's choices. Then it could apply to jobs, and send a link to the candidate's profile in the job application, which a company could process with the same bot. In this way, both company and candidate could select for each other based on their personal and professional preferences and criteria. This could be entirely self-hosted open-source on both sides. It's entirely opt-in from the candidate side, but I think everyone would opt-in, because you want the company to have better signal about you than just a resume (I think resumes are a horrible way to find candidates).
shreyssh
today at 7:40 AM
Cool approach using IRC as transport. I've been experimenting with MCP as the control plane for letting AI agents manage infrastructure specifically database operations. The lightweight transport idea is underrated vs heavy REST APIs.
ForHackernews
today at 9:03 AM
This reads like it was written by AI. I don't understand how it provides any real security if the "guardrails" against prompt injection are just a system prompt telling the dumber model "don't do this"
sbinnee
yesterday at 11:31 PM
Nice. I had some fun. Good work!
One question. Sonnet for tool use? I am just guessing here that you may have a lot of MCPs to call and for that Sonnet is more reliable. How many MCPs are you running and what kinds?
chatmasta
today at 1:34 AM
> That boundary is deliberate: the public box has no access to private data.
Challenge accepted? It’d be fun to put this to the test by putting a CTF flag on the private box at a location nully isn’t supposed to be able to access. If someone sends you the flag, you owe them 50 bucks :)
iammrpayments
today at 7:47 AM
That was very educational, I found out I didn't know a lot of stuff.
greesil
today at 3:18 AM
How do you keep it from getting prompt injected?
Oh I get it the runtimes are nice and small, you're using Claude for the intelligence. Obv
I think I'm just impressed with anthropic more than anything. Defcon would have me believe that prompt injections are trivial
consumer451
today at 12:57 AM
The demo seems to be in a messed up state at the moment. Maybe it's just getting hammered and too far behind?
jaboostin
today at 2:31 AM
lol I sent this link to my Claude bot connected to my Discord server and it started converting with nully and another bot named clawdia. moltbook all over again. I’m surprised how effortlessly it connected to IRC and started talking.
anoojb
today at 2:46 AM
I wonder if this brings back demand for IRC clients on mobile devices? ;-)
agnishom
today at 1:46 AM
> The model can't tell you anything the resume doesn't already say.
Good observation. But I would worry that in the scenario when this setup is the most successful, you have built a public facing bot that allows people to dox you.
ruptwelve
today at 3:02 AM
While I am a huge fan of IRC, wouldn't be simpler to simulate IRC, since you are embedding it? Or is the chatroom the actual point? Kudos on the project!
messh
today at 2:14 AM
Can be significantly cheaper on a vm that wakes up only when yhe agebt works, see for e.g. https://shellbox.dev
mememememememo
today at 12:53 AM
Yeah that chat got hosed by HN as any Show HN $communicationchannel does
iLoveOncall
yesterday at 11:13 PM
The model used is a Claude model, not self-hosted, so I'm not sure why the infrastructure is at all relevant here, except as click bait?
ekianjo
today at 2:25 AM
But relying on a Claude API so you don't really "own the stack" as claimed in the article...
ozozozd
today at 4:21 AM
Super cool! Love seeing IRC in the wild.
Kudos and best of luck!
appstorelottery
today at 5:46 AM
Lol. /nick The IRC implementation needs to be a bit more locked down. EDIT: So much fun to be in an IRC chat room - replete with trolling! Like a Time Machine to the 90's!
topaz0
today at 3:13 AM
Curious, which API key are you using?
Imustaskforhelp
today at 6:23 AM
I have a 7$/yr vps 512mb ram which can run this. I have run crush from the charmbracelet team on the vps and all of it just works and I get an AI agent which I can even use with Openrouter free api key to get some free agentic access for free or have it work with the free gemini key :-)
heyitsaamir
today at 1:01 AM
Great idea and great write up!
eric_khun
today at 12:02 AM
that's so fun ! how do you know when to call haiku or sonnet?
tc1989tc
today at 3:21 AM
it's great project
jgrizou
yesterday at 11:20 PM
Works very well
m00dy
today at 1:15 AM
Did you give your email access to a AI provider ?
slopinthebag
today at 1:12 AM
I can tell it's vibe coded because it takes about 1 minute for a message to appear.
yesterday at 10:42 PM
sudeepsd__
today at 8:02 AM
[dead]
johnwhitman
today at 7:11 AM
[dead]
craxyfrog
today at 12:27 AM
[dead]
agentpiravi
today at 12:03 AM
[dead]
johnwhitman
today at 1:59 AM
[dead]
teamorouter
today at 4:47 AM
[dead]
teamorouter
today at 4:05 AM
[dead]
sayYayToLife
today at 1:19 AM
[dead]
felixagentai
yesterday at 11:14 PM
[flagged]
getverdict
today at 3:15 AM
[flagged]

Show HN: I put an AI agent on a $7/month VPS with IRC as its transport layer

eu_93

InitialPhase55

lanyard-textile

nl

ruguo

jeremyjh

faangguyindia

0123456789ABCDE

attentive

ls612

czhu12

k2xl

czhu12

faangguyindia

AbanoubRodolf

chatmasta

chatmasta

entropie

stackghost

d0963319287

achille

wolvoleo

lanyard-textile

wolvoleo

flux3125

oceliker

agnishom

10keane

password4321

0xbadcafebee

codebje

mandeepj

pbhjpbhj

jaggederest

gedy

eclipxe

NetOpWibby

ihsw

shreyssh

ForHackernews

mobilefriendly

sbinnee

chatmasta

iammrpayments

greesil

consumer451

johnisgood

consumer451

oceliker

Henchman21

consumer451

johnisgood

jaboostin

anoojb

agnishom

ruptwelve

messh

mememememememo

iLoveOncall

jazzyjackson

petcat

echelon

ekianjo

selcuka

ekianjo

selcuka

chatmasta

ozozozd

appstorelottery

topaz0

Imustaskforhelp

heyitsaamir

eric_khun

tc1989tc

jgrizou

m00dy

slopinthebag

consumer451

sudeepsd__