\

How we hacked McKinsey's AI platform

317 points - today at 9:59 AM

Source
  • frankfrank13

    today at 3:03 PM

    Some insider knowledge: Lilli was, at least a year ago, internal only. VPN access, SSO, all the bells and whistles, required. Not sure when that changed.

    McKinsey requires hiring an external pen-testing company to launch even to a small group of coworkers.

    I can forgive this kind of mistake on the part of the Lilli devs. A lot of things have to fail for an "agentic" security company to even find a public endpoint, much less start exploiting it.

    That being said, the mistakes in here are brutal. Seems like close to 0 authz. Based on very outdated knowledge, my guess is a Sr. Partner pulled some strings to get Lilli to be publicly available. By that time, much/most/all of the original Lilli team had "rolled off" (gone to client projects) as McKinsey HEAVILY punishes working on internal projects.

    So Lilli likely was staffed by people who couldn't get staffed elsewhere, didn't know the code, and didn't care. Internal work, for better or worse, is basically a half day.

    This is a failure of McKinsey's culture around technology.

      • OptionOfT

        today at 4:39 PM

        Couple of things to add:

        McKinsey has a weird structure where there are too many cooks in the kitchen.

        Everybody there is reviewed on client impact, meaning it ends up being an everybody-for-themselves situation.

        So as a developer you have little guidance (in fact, you're still being reviewed on client impact, even if you have 0 client exposure).

        Then a (Senior) Partner comes in with this idea (that will get them a good review), and you jump on that. After all, it's all you can do to get a good review.

        You work on it, and then the (Senior) Partner moves on. But it's not done. It's enough for the review, but continuing to work on it doesn't bring you anything, in fact, it will actually pull you down, as finishing the project doesn't give immediate client results.

        So what does this mean? Most products of McKinsey are a grab-bag of raw ideas of leadership, implemented as a one-off, without a cohesive vision or even a long-term vision at all. It's all about the review cycle.

        McKinsey is trying to do software like they do their other engagements. It doesn't work. You can't just do something for 6 months and then let it go. Software rots.

        The fact that they laid off a good amount of (very good) software engineers in 2024 is a reflection on how they see software development.

        And McKinsey's people, who go to other companies, take those ideas with them. Result: The UI of your project changes all the time, because everybody is looking at the short-term impact they have that gets them a good review, not what is best for the project in the long term.

          • yard2010

            today at 7:38 PM

            I'm far from being an expert, but it sounds like this company needs some consultancy.

              • munk-a

                today at 8:20 PM

                Can McKinsey fund McKinsey by consulting for McKinsey? Could we oroborus corporate consulting so that those consultants could be trapped in a loop and those of us doing useful work wouldn't need to interact with them anymore?

            • gavinray

              today at 7:15 PM

              Why would anyone work there, then, unless that's the only place they could get hired as a dev?

              And if the latter is the case, then that sort of stamps the case closed from the get-go...

                • dmbche

                  today at 7:34 PM

                  Great money?

                    • ng12

                      today at 7:49 PM

                      According to levels the pay band caps out around $250k and a principal title. It's good but probably not enough for most to put up with the culture long term.

                        • john_strinlai

                          today at 8:04 PM

                          >[...] the pay band caps out around $250k [...] probably not enough for most [...]

                          an absolutely wild statement to 99.9% of the world

              • steve1977

                today at 5:17 PM

                > McKinsey is trying to do software like they do their other engagements. It doesn't work.

                I mean, it doesn't work for their consulting gigs either. There's a reason McKinsey has such a bad reputation.

                  • _doctor_love

                    today at 6:17 PM

                    But it does work for them? They make tons of money.

                      • steve1977

                        today at 7:43 PM

                        Well, fair point. It doesn't work for their clients.

                        • operatingthetan

                          today at 7:03 PM

                          As an ex-consultant: consulting at that level is kind of a grift. They over-promise and under-deliver as SOP. It's ripe for AI disruption, whatever that looks like.

                            • steve1977

                              today at 7:44 PM

                              Ideally, executives will get replaced by AI soon. Which should actually be easier than engineers. That will kind of solve the consulting problem automatically.

                      • Spooky23

                        today at 7:37 PM

                        Their model works great.

                        It’s really about bypassing the existing power structure of the company. Competence of the work itself is a secondary objective. Most in-house initiatives can be slow rolled by management.

                        The fresh faced consultant with 2-3 steps to access the CEO neutralizes that. It seems grifty but is really exploiting bugs in corporate governance.

                        The current fad of firing the managers is a riff on this. Every jackass C-level is coming up with the novel idea of flattening.

                          • steve1977

                            today at 7:48 PM

                            This somehow implies that initiatives or strategies from consultants are somewhat successful. This is not the case in my experience.

                              • entrox

                                today at 8:21 PM

                                No, you misunderstood. It is not about their output, it almost never is.

                                Most of the times, the business decision has already been made long before McK is hired. It’s all about legitimizing that decision and making it happen.

                                You can also wield them as a weapon against internal competitors or opponents. Look up how they were used to kill off Cariad for example.

                • cmiles8

                  today at 4:00 PM

                  Net conclusion: Don’t hire McKinsey to advise on AI implementation or tech org design and practices if they can’t get it right themselves.

                    • frankfrank13

                      today at 4:17 PM

                      Fair take, but you'd be hard pressed to find much resemblance to any advice McK gives to its own practices.

                      Pre-AI, I always said McK is good at analysis, if you need complicated analysis done, hire a consulting firm.

                      If you need strategy, custom software, org design, etc. I think you should figure out the analysis that needs to be done, shoot that off to a consulting firm, and then make your decision.

                      IME, F500 execs are delegation machines. When they wake up every morning with 30 things to delegate, and 25 execs to delegate to, they hire 5 consulting teams. Whether you hire Mck, or Deloitte, or Accenture will only come down to:

                      1. Your personal relationships

                      2. Your company's policies on procurement

                      3. Your budget

                      in that order.

                      McK's "secret sauce" is that if you, the exec, don't like the powerpoint pages Mck put in front of you, 3 try-hard, insecure, ivy-league educated analysts will work 80 hours to make pages you do like. A sr. partner will take you to dinner. You'll get invited to conferences and summits and roundtables, and then next time you look for a job, it will be easier.

                        • decidu0us9034

                          today at 5:14 PM

                          Analysis of what? What does that mean? What's something you conceivably would need a consulting firm to "analyze?" I don't understand why management consulting firms would hire software people in the first place, and then punish them for not being on a client-facing project. That seems a bit contradictory to me, but this is all way out of my wheelhouse

                            • frankfrank13

                              today at 5:58 PM

                              Analysis:

                              1. How do I build a datacenter

                              2. How is the industrial ceramic market structured, how do they perform

                              3. How does a changing environment impact life insurance

                              Strategy:

                              1. Should I build a datacenter

                              2. Should I invest in an industrial ceramics company

                              3. Should I divest my life insurance subsidiary

                              Specifically in the software world this would be "automate some esoteric ERP migration" or "build this data pipeline" vs. "how can we be more digital native" or "how do we integrate more AI into our company"

                              • cl0ckt0wer

                                today at 5:40 PM

                                For instance, what would we need to start offering siracha in our burger?

                        • steve1977

                          today at 5:19 PM

                          The only people who hire McKinsey are execs who are even more clueless than the consultants.

                            • aleph_minus_one

                              today at 7:46 PM

                              The executives who hire McKinsey are often not clueless, but they often lack the political power in the company to push through their plans. So they hire some well-regarded business consultancy to get an "objective" analysis what needs to be done.

                                • bonoboTP

                                  today at 8:05 PM

                                  How can it be that what you just wrote is such a widely known fact? I've been reading this and hearing this from consultancy people as well for many years now. If the guy lacks the political power, why don't his internal political opponents say, "nice try hiring the consultants, but we know this trick very well, you still don't get it your way".

                                  It has to be some kind of higher level protection racket or something. Like if you hire the consultants there is some kind of kickbacks to the higherups or something with more steps involved where those who previously opposed it will now accept it if it's rubberstamped by the consultants.

                                  Or perhaps those other players who are politically opposing this person are just dummies and don't know about this trick and actually trust the consultants. Or maybe it's a bit of a check, that you can't get anything and everything rubberstamped by the consultants, so it is some kind of sanity filter that the guy isn't proposing something that only benefits himself and screws everyone else.

                                  And if it's the latter, then it is genuine value, a somewhat impartial second opinion. Basically there is a fog-of-war for all the execs regarding all the internal politics going on, it's not like they see through everything all the time and simply refuse to take the obviously correct decision for no reason.

                          • m4rtink

                            today at 4:21 PM

                            This can be simplified further: "Don't hire McKinsey." ;-)

                        • eisa01

                          today at 4:31 PM

                          Maybe it was opened up so it could be used in recruiting?

                          McKinsey challenges graduates to use AI chatbot in recruitment overhaul: https://www.ft.com/content/de7855f0-f586-4708-a8ed-f0458eb25...

                            • j45

                              today at 4:31 PM

                              Using a 2 year old paradigm.

                              And require a chatbot to be used that can be easily gamed by asking a model of how best to navigate it lol.

                              Implementing the past of AI practices is requesting something that will be easily outdone.

                          • dahcryn

                            today at 4:31 PM

                            is this the same at quantumblack? They at least give the impression their assets on Brix are somewhat up to date and uesable

                            • j45

                              today at 4:31 PM

                              I am not sure what accounting or management consulting firms are doing in tech.

                              They look to package up something and sell it as long as they can.

                              AI solutions won't have enough of a shelf life, and the thought around AI is evolving too quickly.

                              Very happy to be wrong and learn from any information folks have otherwise.

                                • fidotron

                                  today at 4:35 PM

                                  The purpose of hiring them is to make them come to the conclusion you already have, so when it goes well you get the credit for doing it, or if it goes sideways you can pin the blame on them.

                                    • boringg

                                      today at 5:05 PM

                                      Or, alternatively, there are so many companies that are weak on tech they pay for someone else to guide them.

                                        • frankfrank13

                                          today at 5:24 PM

                                          Yeah its more this, the companies who ask Mck's help in software tend to hire contractors or vend out software already.

                                      • apercu

                                        today at 6:24 PM

                                        Most companies are not _just_ tech companies and don't have business analysts, consulting analysts, solutions consultants, software engineers and DBA's on staff.

                                        Many, many, many companies are very happy with the consulting firms they hire.

                                        Of course, those are the consulting firms that aren't publicly traded and in the news all the time (for all the wrong reasons).

                            • joenot443

                              today at 2:17 PM

                              > One of those unprotected endpoints wrote user search queries to the database. The values were safely parameterised, but the JSON keys — the field names — were concatenated directly into SQL.

                              I was expecting prompt injection, but in this case it was just good ol' fashioned SQL injection, possible only due to the naivety of the LLM which wrote McKinsey's AI platform.

                                • 3abiton

                                  today at 6:50 PM

                                  I just wonder how much professional grade code written by LLMs, "reviewed" by devs, and commited that made similar or worse mistakes. A funny consequence of the AI boom, especially in coding, is the eventual rise in need for security researchers.

                                  • simonw

                                    today at 2:22 PM

                                    Yeah, gotta admit I'm a bit disappointed here. This was a run-of-the-mill SQL injection, albeit one discovered by a vulnerability scanning LLM agent.

                                    I thought we might finally have a high profile prompt injection attack against a name-brand company we could point people to.

                                      • jfkimmes

                                        today at 2:58 PM

                                        Not the same league as McKinsey, but I like to point to this presentation to show the effects of a (vibe coded) prompt injection vulnerability:

                                        https://media.ccc.de/v/39c3-skynet-starter-kit-from-embodied...

                                        > [...] we also exploit the embodied AI agent in the robots, performing prompt injection and achieve root-level remote code execution.

                                        • TheDong

                                          today at 3:04 PM

                                          Github actions has had a bunch of high-profile prompt injection attacks at this point, most recently the cline one: https://adnanthekhan.com/posts/clinejection/

                                          I guess you could argue that github wasn't vulnerable in this case, but rather the author of the action, but it seems like it at least rhymes with what you're looking for.

                                            • simonw

                                              today at 4:06 PM

                                              Yeah that was a good one. The exploit was still a proof of concept though, albeit one that made it into the wild.

                                          • danenania

                                            today at 3:09 PM

                                            > I thought we might finally have a high profile prompt injection attack against a name-brand company we could point people to.

                                            These folks have found a bunch: https://www.promptarmor.com/resources

                                            But I guess you mean one that has been exploited in the wild?

                                              • simonw

                                                today at 4:15 PM

                                                Yeah I'm still optimistic that people will start taking this threat seriously once there's been a high profile exploit against a real target.

                                        • doctorpangloss

                                          today at 3:53 PM

                                          The tacit knowledge to put oauth2-proxy in front of anything deployed on the Internet will nonetheless earn me $0 this year, while Anthropic will make billions.

                                          • oliver_dr

                                            today at 2:47 PM

                                            [dead]

                                        • bee_rider

                                          today at 2:24 PM

                                          I don’t love the title here. Maybe this is a “me” problem, but when I see “AI agent does X,” the idea that it might be one of those molt-y agents with obfuscated ownership pops into my head.

                                          In this case, a group of pentesters used an AI agent to select McKinsey and then used the AI agent to do the pentesting.

                                          While it is conventional to attribute actions to inanimate objects (car hits pedestrians), IMO we should be more explicit these days, now that unfortunately some folks attribute agency to these agentic systems.

                                            • simonw

                                              today at 2:45 PM

                                              Yeah, the original article title "How We Hacked McKinsey's AI Platform" is better.

                                              • tasuki

                                                today at 2:58 PM

                                                > now that unfortunately some folks attribute agency to these agentic systems.

                                                You're doing that by calling them "agentic systems".

                                                  • bee_rider

                                                    today at 5:44 PM

                                                    Unfortunately that’s what they are called. I was hoping the phrasing would highlight the problem rather than propagate it.

                                                      • pixl97

                                                        today at 7:23 PM

                                                        Eh, if you tell me that I need to do X, then I can make choices on how to accomplish X, that I am no longer an agent as a human?

                                                        You're trying to redefine long standing definitions for God knows what reason.

                                                          • bee_rider

                                                            today at 7:34 PM

                                                            The difference is that you are a sentient person who decides to follow my instructions, not just a tool that I use.

                                                • dang

                                                  today at 7:17 PM

                                                  Ok, we've reverted the title (submitted title was "AI Agent Hacks McKinsey")

                                                  • causal

                                                    today at 2:57 PM

                                                    Yah it's just an ad, and "Pentesting agents finds low-hanging vulnerability" isn't gonna drive clicks.

                                                      • nkozyra

                                                        today at 7:33 PM

                                                        ... at a massive company

                                                        That's important. Cloudwall isn't really saying they have some secret sauce here, but it's noteworthy who they nabbed.

                                                        • jacquesm

                                                          today at 3:14 PM

                                                          It's not an ad for McKinsey though.

                                                  • fhd2

                                                    today at 2:05 PM

                                                    > This was McKinsey & Company — a firm with world-class technology teams [...]

                                                    Not exactly the word on the street in my experience. Is McKinsey more respected for software than I thought? Otherwise I'm curious why TFA didn't just politely leave this bit out.

                                                      • aerhardt

                                                        today at 2:08 PM

                                                        The LLM that wrote this simply couldn’t help itself.

                                                          • codechicago277

                                                            today at 2:23 PM

                                                            Picked up a vibe, but couldn’t confirm it until the last paragraph, but yeah clearly drafted with at least major AI help.

                                                              • nprateem

                                                                today at 6:12 PM

                                                                > Why this matters

                                                                Hello Gemini

                                                                • vanillameow

                                                                  today at 3:09 PM

                                                                  Can we stop softening the blow? This isn't "drafted with at least major AI help", it's just straight up AI slop writing. Let's call a spade a spade. I have yet to meet anyone claiming they "write with AI help but thoughts are my own" that had anything interesting to say. I don't particularly agree with a lot of Simon Willison's posts but his proofreading prompt should pretty much be the line on what constitutes acceptable AI use for writing.

                                                                  https://simonwillison.net/guides/agentic-engineering-pattern...

                                                                  Grammar check, typo check, calls you out on factual mistakes and missing links and that's it. I've used this prompt once or twice for my own blog posts and it does just what you expect. You just don't end up with writing like this post by having AI "assistance" - you end up with this type of post by asking Claude, probably the same Claude that found the vulnerability to begin with, to make the whole ass blog post. No human thought went into this. If it did, I strongly urge the authors to change their writing style asap.

                                                                  "So we decided to point our autonomous offensive agent at it. No credentials. No insider knowledge. And no human-in-the-loop. Just a domain name and a dream."

                                                                  Give me a fucking break

                                                                    • skybrian

                                                                      today at 4:16 PM

                                                                      Your reaction is worse than the article. There's no way you could know for sure what their writing process was, but that doesn't stop you from making overconfident claims.

                                                                        • theredbeard

                                                                          today at 6:28 PM

                                                                          I’m sorry but no attempt was made here. It contains all the red flags in the first few paragraphs.

                                                                      • beepbooptheory

                                                                        today at 7:34 PM

                                                                        One thing I've learned recently is a lot guys (like here) have been out here reading each word of a given company's tech blog, closely parsing each sentence construction.. I really cant imagine being even concious of the prose for something like this. A corporate blog, to me, has some base level of banality to it. It's like reading a cereal box and getting angry at the lack of nuance.

                                                                        Like who cares? Is there really some nostalgia for a time before this? When reading some press release from a cybersecurity company was akin to Joyce or Nabakov or whatever? (Maybe Hemingway...)

                                                                        We really gotta be picking our battles here imo, and this doesn't feel like a high priority target. Let companies be the weird inhuman things that they are.

                                                                        Read a novel! They are great, I promise. Then when you read other stuff, maybe you won't feel so angry?

                                                                        • yomismoaqui

                                                                          today at 5:38 PM

                                                                          Sorry but seems like most people don't care or even like AI writing more:

                                                                          https://x.com/kevinroose/status/2031397522590282212

                                                                            • toraway

                                                                              today at 6:46 PM

                                                                              That's the problem with AI writing in a nutshell. In a blind, relatively short comparison (similarly used for RLHF), AI writing has a florid, punchy quality that intuitively feels like high quality writing.

                                                                              But then after you read the exact same structure a dozen times a day on the web, it becomes like nails on the chalkboard. It's a combination of "too much of a good thing" with little variation throughout a long piece of prose, and basic pattern recognition of AI output from a model coalescing to a consistent style that can be spotted as if 1-3 human ghost writers wrote 1/4 of the content on the web.

                                                                      • theredbeard

                                                                        today at 6:27 PM

                                                                        A vibe? It’s completely obvious AI slop with no attempt to make it legible. They didn’t even prompt out the emdashes. For such a cool finding this is extremely disappointing.

                                                                • alexpotato

                                                                  today at 3:44 PM

                                                                  They generally hire smart people who are good at a combination of:

                                                                  - understanding existing systems

                                                                  - what the paint points are

                                                                  - making suggestions on how to improve those systems given the paint points

                                                                  - that includes a mix of tech changes, process updates and/or new systems etc

                                                                  Now, when it comes to implementing this, in my experience it usually ends up being the already in place dev teams.

                                                                  Source: worked at a large investment bank that hired McKinsey and I knew one of the consultants from McK prior to working at the bank.

                                                                    • xpe

                                                                      today at 5:58 PM

                                                                      My take*: McKinsey hiring largely selects for staying calm under pressure and presenting a confident demeanor to clients. Verbal fluency with decision-making frameworks goes a long way. Having strong analytical skills seemed essential; hopefully the bar for "sufficiently analytical" has raised along with general data science skills in industry.

                                                                      I don't view them as top-tier experts in their own right, whether it be statistics or technology, but they have a knack for corporate maneuvering. I often question their overall value beyond the usual "hire the big guns to legitimize a change" mentality. Maybe a useful tradeoff? I'd rather see herd-like adoption of current trends than widespread corporate ignorance and insularity.**

                                                                      A huge selling point for M&Co is kind of a self-fulfulling prophecy based on the access they get. This gives them a positive feedback loop to find the juiciest and most profitable areas to focus on.

                                                                      For those who know more, how do my takes compare?

                                                                      * I interviewed with them over 15 years ago, know people who have worked there, and I pay attention to their reports from time to time.

                                                                      ** Of course, I'd rather see a third way: cross-pollination between organizations to build strong internal expertise and use model-based decision making for nuanced long-term decisions... but that's just crazy talk.

                                                                        • alexpotato

                                                                          today at 6:39 PM

                                                                          > Having strong analytical skills seemed essential

                                                                          and

                                                                          > they have a knack for corporate maneuvering

                                                                          One way to view this is that the above combination of skills is both rare and very useful. That means it's expensive. So instead of hiring someone like that at "full rate" and keeping them around, you can "borrow" them from McK to solve a problem your regular crew can't (or isn't able to) for various reasons.

                                                                          Plus, as one manager of mine said many years ago:

                                                                          "We use consultants b/c they are both easy to hire AND easy to fire"

                                                                  • sharadov

                                                                    today at 3:52 PM

                                                                    No, they don't have world class technology teams, they hire contractors to do all the tech stuff, their expertise is in management, yes that's world class.

                                                                      • OvervCW

                                                                        today at 4:36 PM

                                                                        Yes, world class in causing human suffering.

                                                                        https://www.youtube.com/watch?v=Q7pgDmR-pWg

                                                                        • cmiles8

                                                                          today at 5:43 PM

                                                                          Is it though? Managing teams to not torpedo your company with stupid stuff like this is kinda core to “good management.” The evidence would indicate they’re not very good at that either.

                                                                            • theredbeard

                                                                              today at 6:31 PM

                                                                              It’s a self fulfilling prophecy. They’re extremely expensive so they must be good so they must be worth it. And because at that level measurement is extremely subjective it’s mainly about the vibes.

                                                                              Like everything it’s just marketing.

                                                                              • linhns

                                                                                today at 6:43 PM

                                                                                They were good. Not so good now.

                                                                        • lenerdenator

                                                                          today at 2:14 PM

                                                                          > Not exactly the word on the street in my experience.

                                                                          Depends on the street you're on. Are you on Main Street or Wall Street?

                                                                          If you're hiring them to help with software for solving a business problem that will help you deliver value to your customers, they're probably just like anyone else.

                                                                          If you're hiring them to help with software for figuring out how to break down your company for scrap, or which South African officials to bribe, well, that's a different matter.

                                                                      • sriramgonella

                                                                        today at 6:36 PM

                                                                        One interesting takeaway here is how quickly AI agents expose weaknesses in internal systems.

                                                                        Many enterprise tools were designed assuming human interaction, where authentication flows, manual reviews, and internal processes add implicit safeguards.

                                                                        But once you introduce autonomous agents that can systematically probe endpoints, missing authorization checks or misconfigured APIs become much easier to discover and exploit.

                                                                        I suspect we’ll see a growing need for automated validation layers that continuously test internal AI tools for access control, data exposure, and unintended behaviors before they’re widely deployed.

                                                                        • sigmar

                                                                          today at 2:39 PM

                                                                          I've got no idea who codewall is. Is there acknowledgment from McKinsey that they actually patched the issue referenced? I don't see any reference to "codewall ai" in any news article before yesterday and there's no names on the site.

                                                                          https://www.google.com/search?q=codewall+ai

                                                                            • today at 5:32 PM

                                                                              • rzmmm

                                                                                today at 3:07 PM

                                                                                Yeah can't find much information either. I would like to see at least some proof. Either via Mckinsey or from the security team.

                                                                                • doron

                                                                                  today at 3:36 PM

                                                                                  it is weird isn't it? The register article implies that it's acknowledged by McKinsey- https://www.theregister.com/2026/03/09/mckinsey_ai_chatbot_h...

                                                                                  Edit: Apparently, this is the CEO https://github.com/eth0izzle

                                                                                    • sigmar

                                                                                      today at 4:17 PM

                                                                                      >A McKinsey spokesperson told The Register that it fixed all of the issues identified by CodeWall within hours of learning about the problems.

                                                                                      Ah. Thanks for the link. I'm suspicious of everything posted to a blog without proof these days.

                                                                                  • eisa01

                                                                                    today at 4:04 PM

                                                                                    If it's true that there's 58k users in the dump, that would mean former employees are in the dump

                                                                                    I assume that means McKinsey would need to disclose it, or at least alert the former employees of the breach?

                                                                                    • philipwhiuk

                                                                                      today at 5:35 PM

                                                                                      There's a responsible disclosure timeline at the bottom indicating they'd all been fixed.

                                                                                  • gbourne1

                                                                                    today at 1:29 PM

                                                                                    - "The agent mapped the attack surface and found the API documentation publicly exposed — over 200 endpoints, fully documented. Most required authentication. Twenty-two didn't."

                                                                                    Well, there you go.

                                                                                    • elorant

                                                                                      today at 7:55 PM

                                                                                      Meanwhile, you're paying top dollars to a consulting firm that resolves back to an LLM to provide its services.

                                                                                      • sailfast

                                                                                        today at 6:22 PM

                                                                                        What I don't see in this article that should be explicit:

                                                                                        If your data is in this database, it's gone. Other people have it. Your sensitive data that you handed over to their teams has vanished in a puff of smoke. You should probably ask if your data was part of the leak.

                                                                                        Fail to see how a state actor would not have come across this already.

                                                                                        • cmiles8

                                                                                          today at 2:12 PM

                                                                                          I can only remember a McKinsey team pushing Watson on us hard ages ago. Was a total train wreck.

                                                                                          They’ve long been all hype no substance on AI and looks like not much has changed.

                                                                                          They might be good at other things but would run for the hills if McKinsey folks want to talk AI.

                                                                                          • paxys

                                                                                            today at 2:22 PM

                                                                                            > named after the first professional woman hired by the firm in 1945

                                                                                            Going out of their way to find a woman's name for an AI assistant and bragging about it is not as empowering as the creators probably thought in their heads.

                                                                                            • sgt101

                                                                                              today at 1:52 PM

                                                                                              Why was there a public endpoint?

                                                                                              Surely this should all have been behind the firewall and accessible only from a corporate device associated mac address?

                                                                                                • consp

                                                                                                  today at 3:40 PM

                                                                                                  > accessible only from a corporate device associated mac address

                                                                                                  Like that ever stopped anyone. That's just a checkbox item.

                                                                                                  • jihadjihad

                                                                                                    today at 2:04 PM

                                                                                                    Surely.

                                                                                                • nubg

                                                                                                  today at 4:20 PM

                                                                                                  Could the author please provide the prompt that was used to vibe write this blog post? The topic is interesting, but I would rather read the original prompt, as I am not sure which parts still match what the author wanted to say, vs flowerly formulations for captivating reading that the LLM produced.

                                                                                                  • bxguff

                                                                                                    today at 4:08 PM

                                                                                                    Its so funny its a SQL injection because drum roll you can't santize llm inputs. Some problems are evergreen.

                                                                                                      • dmix

                                                                                                        today at 5:00 PM

                                                                                                        Technically it was a search box input no prompts. Which tbf are often endpoints reused by RAGs

                                                                                                    • StartupsWala

                                                                                                      today at 5:35 PM

                                                                                                      One interesting takeaway here is how quickly organizations are deploying AI tools internally without fully adapting their security models.

                                                                                                      Traditional application security assumes fairly predictable inputs and workflows, but LLM-based systems introduce entirely new attack surfaces—prompt injection, data leakage, tool misuse, etc.

                                                                                                      It feels like many enterprises are still treating these systems as just another SaaS product rather than something closer to an autonomous system that needs a different threat model...

                                                                                                      • nullcathedral

                                                                                                        today at 4:02 PM

                                                                                                        I think the underlying point is valid. Agents are a potential tool to add to your arsenal in addition to "throw shit at the wall and see what sticks" tools like WebInspect, Appscan, Qualys, and Acunetix.

                                                                                                        • himata4113

                                                                                                          today at 5:05 PM

                                                                                                          How long until a hallucinated data breach that spreads globally. There's a few inconsistencies and the typical low effort language AI has.

                                                                                                          • sd9

                                                                                                            today at 1:53 PM

                                                                                                            Cool but impossible to read with all the LLM-isms

                                                                                                              • vanillameow

                                                                                                                today at 2:12 PM

                                                                                                                Tiring. Internet in 2026 is LLMs reporting on LLMs pen-testing LLM-generated software.

                                                                                                                • causal

                                                                                                                  today at 2:54 PM

                                                                                                                  Those short "punchy sentence" paragraphs are my new trigger:

                                                                                                                  > No credentials. No insider knowledge. And no human-in-the-loop. Just a domain name and a dream.

                                                                                                                  It just sounds so stupid.

                                                                                                                    • darkport

                                                                                                                      today at 4:25 PM

                                                                                                                      Founder of CodeWall here. It's quite funny because whilst an LLM did write the bulk of the posts factual content (based on the agents findings), I wrote the intro and summary at the end. That's just my writing style. Feel free to read my personal blog to compare: https://darkport.co.uk

                                                                                                                        • bootsmann

                                                                                                                          today at 6:35 PM

                                                                                                                          Idk how big your team is of course but imo try to hire a technical writer (they’re really cheap now), it pays dividends for a long time as consistent style and keywords build up SEO reputation. This article is making the rounds, some bigger papers picked it up, it is very valuable to land it well.

                                                                                                                          • causal

                                                                                                                            today at 4:50 PM

                                                                                                                            If you really DID come up with that paragraph 100% completely on your own with no LLM influence then...I apologize for the insult, though I can't really back out from what I said. It's still a bombastic way of saying very little.

                                                                                                                        • consp

                                                                                                                          today at 3:44 PM

                                                                                                                          It's an actual story telling method, molded into a supposed to be informative article with a bunch of "please make it interesting" sprinkled on top of it. These day known as the what's left of the internet.

                                                                                                                          • philipwhiuk

                                                                                                                            today at 5:37 PM

                                                                                                                            It's LinkedIn speech.

                                                                                                                            Two word sentences, each one on a new line.

                                                                                                                              • causal

                                                                                                                                today at 7:16 PM

                                                                                                                                Ah. That might be why I find it especially triggering.

                                                                                                                        • today at 2:24 PM

                                                                                                                      • gonzalovargas

                                                                                                                        today at 5:22 PM

                                                                                                                        That data is worth billions to frontier AI labs. I wonder if someone is already using it to train models

                                                                                                                        • VadimPR

                                                                                                                          today at 3:35 PM

                                                                                                                          I wonder how these offensive AI agents are being built? I am guessing with off the shelf open LLMs, finetuned to remove safety training, with the agentic loop thrown in.

                                                                                                                          Does anyone know for sure?

                                                                                                                            • simonw

                                                                                                                              today at 4:25 PM

                                                                                                                              Honestly you can point regular Claude Code or Codex CLI at a web app and tell it to start a penetration test and get surprisingly good results from their default configurations.

                                                                                                                                • VadimPR

                                                                                                                                  today at 5:30 PM

                                                                                                                                  I didn't think of that given how censored the models are becoming. Thanks for the idea! I'll try it against my websites before anyone else gets to it.

                                                                                                                          • bananamogul

                                                                                                                            today at 5:27 PM

                                                                                                                            At first glance, I thought this was about an AI agent named "Hacks McKinsey."

                                                                                                                            • quinndupont

                                                                                                                              today at 5:17 PM

                                                                                                                              I’m waiting for the agentic models trained on virus and worm datasets to join the red team!

                                                                                                                              • build-or-die

                                                                                                                                today at 6:50 PM

                                                                                                                                parameterized values but raw key concatenation is the kind of thing that looks safe in code review. easy to miss for humans, but an agent will just keep poking at every input until something breaks.

                                                                                                                                • ecshafer

                                                                                                                                  today at 2:55 PM

                                                                                                                                  If the AI was poisoned to alter advice, then maybe McKinsey advice would actually be a net good.

                                                                                                                                  • jacquesm

                                                                                                                                    today at 3:16 PM

                                                                                                                                    And: AI agent writes blog post.

                                                                                                                                    • captain_coffee

                                                                                                                                      today at 2:14 PM

                                                                                                                                      Music to my ears! Couldn't happen to a better company!

                                                                                                                                      • palmotea

                                                                                                                                        today at 3:29 PM

                                                                                                                                        With all we've been learning from stuff like the Epstein emails, it would have been nice if someone had leaked this data:

                                                                                                                                        > 46.5 million chat messages. From a workforce that uses this tool to discuss strategy, client engagements, financials, M&A activity, and internal research. Every conversation, stored in plaintext, accessible without authentication.

                                                                                                                                        > 728,000 files. 192,000 PDFs. 93,000 Excel spreadsheets. 93,000 PowerPoint decks. 58,000 Word documents. The filenames alone were sensitive and a direct download URL for anyone who knew where to look.

                                                                                                                                        I'm sure lots of very informative journalism could have been done about how corporate power actually works behind the scenes.

                                                                                                                                          • cmiles8

                                                                                                                                            today at 5:50 PM

                                                                                                                                            That information is likely already in the hands of various folks as I highly doubt the authors were the first to find this glaring security issue, they’re likely only the first to disclose it. If McKinsey has hard data that nobody else exploited this now would be a good time to disclose that given what sounds like an extremely severe data leak.

                                                                                                                                              • frankfrank13

                                                                                                                                                today at 6:03 PM

                                                                                                                                                The chat messages are very very sensitive. You could easily reverse engineer nearly every ongoing Mck engagement. The underlying data is not as sensitive, its decades of post-mortems, highly sanitized. No client names, no real numbers.

                                                                                                                                        • cs702

                                                                                                                                          today at 3:49 PM

                                                                                                                                          ... in two hours:

                                                                                                                                          > No credentials. No insider knowledge. And no human-in-the-loop. Just a domain name and a dream. ... Within 2 hours, the agent had full read and write access to the entire production database.

                                                                                                                                          Having seen firsthand how insecure some enterprise systems are, I'm not exactly surprised. Decision makers at the top are focused first and foremost on corporate and personal exposure to liability, also known as CYA in corporate-speak. The nitty-gritty details of security are always left to people far down the corporate chain who are supposed to know what they're doing.

                                                                                                                                          • peterokap

                                                                                                                                            today at 3:59 PM

                                                                                                                                            I wonder what is their security level and Observability method to oversee the effort.

                                                                                                                                            • victor106

                                                                                                                                              today at 2:54 PM

                                                                                                                                              this reads like it was written by an LLM

                                                                                                                                              • lenerdenator

                                                                                                                                                today at 1:59 PM

                                                                                                                                                Not exactly clear from the link: were they doing red team work for McKinsey or is this just "we found a company we thought wouldn't get us arrested and ran an AI vuln detector over their stuff"?

                                                                                                                                                You'd think that the world's "most prestigious consulting firm" would have already had someone doing this sort of work for them.

                                                                                                                                                  • frereubu

                                                                                                                                                    today at 2:51 PM

                                                                                                                                                    From TFA: "Fun fact: As part of our research preview, the CodeWall research agent autonomously suggested McKinsey as a target citing their public responsible diclosure policy (to keep within guardrails) and recent updates to their Lilli platform. In the AI era, the threat landscape is shifting drastically — AI agents autonomously selecting and attacking targets will become the new normal."

                                                                                                                                                • j45

                                                                                                                                                  today at 4:28 PM

                                                                                                                                                  Are accounting and management consulting companies competent in cutting edge tech?

                                                                                                                                                  • drc500free

                                                                                                                                                    today at 3:58 PM

                                                                                                                                                    I have grown to despise this AI-generated writing style.

                                                                                                                                                    • sethammons

                                                                                                                                                      today at 7:10 PM

                                                                                                                                                      > Lilli's system prompts — the instructions that control how the AI behaves — were stored in the same database the agent had access to.

                                                                                                                                                      Being able to rewrite your own source. What's the worst that could happen?

                                                                                                                                                      • today at 2:48 PM

                                                                                                                                                        • mnmnmn

                                                                                                                                                          today at 3:02 PM

                                                                                                                                                          McKinsey can eat shit

                                                                                                                                                          • oliver_dr

                                                                                                                                                            today at 4:07 PM

                                                                                                                                                            [dead]

                                                                                                                                                            • thebotclub

                                                                                                                                                              today at 2:01 PM

                                                                                                                                                              [dead]

                                                                                                                                                              • octoclaw

                                                                                                                                                                today at 2:10 PM

                                                                                                                                                                [dead]

                                                                                                                                                                • farceSpherule

                                                                                                                                                                  today at 3:15 PM

                                                                                                                                                                  [dead]

                                                                                                                                                                  • robutsume

                                                                                                                                                                    today at 4:02 PM

                                                                                                                                                                    [flagged]

                                                                                                                                                                      • senordevnyc

                                                                                                                                                                        today at 4:08 PM

                                                                                                                                                                        At least you’re honest about being an AI agent…

                                                                                                                                                                        • carlos-menezes

                                                                                                                                                                          today at 4:13 PM

                                                                                                                                                                          AI slop.