Tiled Hacker news on React Router

Show HN: Rudel – Claude Code Session Analytics

137 points - yesterday at 1:41 PM

We built rudel.ai after realizing we had no visibility into our own Claude Code sessions. We were using it daily but had no idea which sessions were efficient, why some got abandoned, or whether we were actually improving over time.

So we built an analytics layer for it. After connecting our own sessions, we ended up with a dataset of 1,573 real Claude Code sessions, 15M+ tokens, 270K+ interactions.

Some things we found that surprised us: - Skills were only being used in 4% of our sessions - 26% of sessions are abandoned, most within the first 60 seconds - Session success rate varies significantly by task type (documentation scores highest, refactoring lowest) - Error cascade patterns appear in the first 2 minutes and predict abandonment with reasonable accuracy - There is no meaningful benchmark for 'good' agentic session performance, we are building one.

The tool is free to use and fully open source, happy to answer questions about the data or how we built it.

Source

c5huracan
today at 2:17 PM
The "no meaningful benchmark for good agentic session performance" point resonates. Success varies so much by task type that a single metric is almost meaningless. A 60-second documentation lookup and a 30-minute refactoring session could both be successes.
Curious what shape the benchmark takes. Are you thinking per-task-type baselines, or something more like an aggregate efficiency score?
dmix
yesterday at 3:22 PM
I've seen Claude ignore important parts of skills/agent files multiple times. I was running a clean up SKILL.md on a hundred markdown files, manually in small groups of 5, and about half the time it listened and ran the skill as written. The other half it would start trying to understand the codebase looking for markdown stuff for 2min, for no good reason, before reverting back to what the skill said.
LLMs are far from consistent.
monsterxx03
today at 1:32 PM
I built something in a similar space: Linko (https://github.com/monsterxx03/linko), a transparent MITM proxy with a webui that lets you see what's actually being sent between Claude Code and LLM APIs in real time.
```
  It's been really helpful for me to debug my own sessions and understand what the model is seeing (system prompts, tool definitions, tracing tool calls etc.).
```
emehex
yesterday at 2:23 PM
For those unaware, Claude Code comes with a built in /insights command...
Aurornis
yesterday at 3:22 PM
> 26% of sessions are abandoned, most within the first 60 seconds
Starting new sessions frequently and using separate new sessions for small tasks is a good practice.
Keeping context clean and focused is a highly effective way to keep the agent on task. Having an up to date AGENTS.md should allow for new sessions to get into simple tasks quickly so you can use single-purpose sessions for small tasks without carrying the baggage of a long past context into them.
lgvdp
today at 10:22 AM
I see a lot of people with concerns about privacy and security. Not shown in the post, but the github shows how to self host. No need to use 3rd party, you can just have your own too
tmaly
yesterday at 8:21 PM
I have seen numbers claiming tools are only called 59% of the time.
Saw another comment on a different platform where someone floated the idea of dynamically injecting context with hooks in the workflow to make things more deterministic.
152334H
yesterday at 2:14 PM
is there a reason, other than general faith in humanity, to assume those '1573 sessions' are real?
I do not see any link or source for the data. I assume it is to remain closed, if it exists.
marconardus
yesterday at 2:14 PM
It might be worthwhile to include some of an example run in your readme.
I scrolled through and didn’t see enough to justify installing and running a thing
blef
yesterday at 2:40 PM
Reminds me https://www.agentsview.io/.
KaiserPister
yesterday at 2:48 PM
This is awesome! I’m working on the Open Prompt Initiative as a way for open source to share prompting knowledge.
alyxya
yesterday at 2:54 PM
Why does it need login and cloud upload? A local cli tool analyzing logs should be sufficient.
mbesto
yesterday at 4:02 PM
So what conclusions have you drawn or could a person reasonably draw with this data?
smallerfish
yesterday at 7:53 PM
> content, the content or transcript of the agent session
Does this include the files being worked on by the agent in the session, or just the chat transcript?
ekropotin
yesterday at 2:21 PM
> That's it. Your Claude Code sessions will now be uploaded automatically.
No, thanks
ericwebb
yesterday at 5:21 PM
I 100% agree that we need tools to understand and audit these workflows for opportunities. Nice work.
TBH, I am very hesitant to upload my CC logs to a third-party service.
anthonySs
yesterday at 2:59 PM
is this observability for your claude code calls or specifically for high level insights like skill usage?
would love to know your actual day to day use case for what you built
yesterday at 3:18 PM
mentalgear
yesterday at 3:08 PM
How diverse is your dataset?
bool3max
yesterday at 7:23 PM
Why is the comment calling out the biggest issue with this so heavily downvoted? Privacy is a massive concern with this.
lau_chan
yesterday at 1:54 PM
Does it work for Codex?
dboreham
yesterday at 5:27 PM
One potential reason for sessions being abandoned within 60 seconds in my experience is realizing you forgot to set something in the environment: github token missing, tool set for the language not on the path, etc. Claude doesn't provide elegant ways to fix those things in-session so I'll just exit, fix up and start Claude again. It does have the option to continue a previous session but there's typically no point in these "oops I forgot that" cases.
cluckindan
yesterday at 1:58 PM
Nice. Now, to vibe myself a locally hosted alternative.
yangro
today at 12:16 AM
[flagged]
sriramgonella
yesterday at 2:59 PM
[flagged]
socialinteldev
yesterday at 3:59 PM
[flagged]
mrothroc
yesterday at 2:18 PM
[dead]
longtermemory
yesterday at 3:57 PM
From session analysis, it would be interesting to understand how crucial the documentation, the level of detail in CLAUDE.md, is. It seems to me that sometimes documentation (that's too long and often out of date) contributes to greater entropy rather than greater efficiency of the model and agent.
It seems to me that sometimes it's better and more effective to remove, clean up, and simplify (both from CLAUDE.md and the code) rather than having everything documented in detail.
Therefore, from session analysis, it would be interesting to identify the relationship between documentation in CLAUDE.md and model efficiency. How often does the developer reject the LLM output in relation to the level of detail in CLAUDE.md?
aplomb1026
yesterday at 5:32 PM
[flagged]
bhekanik
yesterday at 2:01 PM
[dead]
Sebastian_Dev
yesterday at 3:38 PM
[dead]
huflungdung
yesterday at 3:38 PM
[dead]
ptak_dev
yesterday at 7:27 PM
[flagged]
multidude
yesterday at 2:17 PM
[flagged]
mihir_kanzariya
yesterday at 3:13 PM
[flagged]
robutsume
yesterday at 4:01 PM
[flagged]
ozgurozkan
yesterday at 2:55 PM
[flagged]
vova_hn2
yesterday at 2:37 PM
This is so sad that on top of black box LLMs we also build all these tools that are pretty much black box as well.
It became very hard to understand what exactly is sent to LLM as input/context and how exactly is the output processed.

Show HN: Rudel – Claude Code Session Analytics

c5huracan

dmix

cbg0

conception

keks0r

dmix

stpedgwdgfhgdd

monsterxx03

emehex

loopmonster

hombre_fatal

fragmede

hombre_fatal

keks0r

huflungdung

evrendom

Aurornis

sethammons

eddythompson80

longtermemory

lgvdp

evrendom

tmaly

evrendom

152334H

keks0r

languid-photic

marconardus

keks0r

blef

keks0r

mentalgear

KaiserPister

keks0r

alyxya

keks0r

mbesto

avilesrafa

evrendom

smallerfish

evrendom

ekropotin

keks0r

tgtweak

keks0r

jamiemallers

ericwebb

evrendom

ericwebb

anthonySs

keks0r

mentalgear

keks0r

bool3max

lau_chan

keks0r

dboreham

cluckindan

vidarh

keks0r

vidarh

keks0r

yangro

sriramgonella

keks0r

socialinteldev

avilesrafa

DeltaCoast

simpsond

mrothroc

keks0r

mrothroc

longtermemory

avilesrafa

aplomb1026

bhekanik

Sebastian_Dev

huflungdung

ptak_dev