\

Grok 4.3

137 points - today at 8:29 AM

Source
  • sundarurfriend

    today at 9:36 AM

    As an English-as-second-language speaker and writer, one thing Grok really shines at is capturing the tone and level of "formality" of a piece of text and the replicating it correctly. It seems to understand the little human subtleties of language in a way the other major providers don't. Chatgpt goes overly stiff and formal sounding, or ends up in a weird "aye guvnor" type informal language (Claude is sometimes better but not always).

    Grok seems in general better at being "human" in ways that are hard to define: for eg. if I ask it "does this message roughly convey things correctly, to the level it can given this length", it will likely answer like a human would (either a yes or a change suggestion that sticks to the tone and length), while Chatgpt would write a dissertation on the message that still doesn't clear anything up.

    Recently I've noticed that Grok seems to have gotten really good at dictation too (that feature where you click the mic to ask it something). Chatgpt has like 90-95% accuracy with my accent, the speech input on Android's Gboard something like 75%, Grok surprisingly gets something like 98% of my words correct.

      • djyde

        today at 9:48 AM

        I've also noticed that when I communicate with Grok in my native language, its tone is more natural than other models. I think this is due to the advantage of being trained on a large amount of Twitter data. However, as Twitter contains more and more AI-generated content now, I'm afraid continued training will make it less natural.

          • darkerside

            today at 12:07 PM

            Sadly, it's more likely that people will just start talking like bots

            • thunderbong

              today at 9:56 AM

              I'm sure Twitter knows which are the bot accounts and is surely excluding them from their model training. Twitter bots aren't a new phenomenon after all.

                • cowsup

                  today at 11:38 AM

                  I don't think Twitter/X know for sure who the bots are, since Elon has been pretty vocal about trying to stop them for ages, yet I still get lots of spam DMs (as do others with far fewer followers/reach).

                  Even if 95% of the spam gets actively reported and dealt with, that still leaves a ton of nonsense on the platform, getting fed into the LLM. And spam has only gotten worse over the years, as the barrier to entry has lowered and lowered.

                    • joncrane

                      today at 11:52 AM

                      >Elon has been pretty vocal about trying to stop them for ages

                      You know people lie, right? Especially when the lie casts them in a better light and/or makes them more money.

                  • pixel_popping

                    today at 10:28 AM

                    There is bots everywhere, it has nothing to do with the platform, it has to do with attackers having an incentive to do mass account farming, no platform is secure against it.

                      • kedihacker

                        today at 11:46 AM

                        With banning and deboosting they need to be very accurate but with filtering they can be more liberal in excluding

                        • simianwords

                          today at 10:52 AM

                          not really. there are easy heuristics to filter out bots with good confidence. FWIW i don't see any bots posting anything in my feed

                            • pixel_popping

                              today at 11:00 AM

                              Yes your individual feed isn't really relevant if we talk about the masses, Reddit accounts are for sale quite cheap, HN as well, X too and so-on, it's literally just a matter of means/methodology. If I want today to do 1000 random posts talking about a certain thing, I could.

                                • simianwords

                                  today at 11:28 AM

                                  my individual feed does matter because it shows that it is possible to curate something without bots which is obviously what XAI would do

                  • pacific01

                    today at 10:20 AM

                    Did you try meta? I was into grok but now meta works well for me

                • AntiUSAbah

                  today at 11:42 AM

                  Its not human, its antihuman.

                  Especially you as a 'english-as-a-second-language', the hitler grok doesn't like people like you :|

                    • loneboat

                      today at 11:58 AM

                      The hitler Grok? What? I genuinely don't understand what you're trying to say in this comment.

              • artdigital

                today at 9:26 AM

                Grok is my favorite model for chatting, and my favorite voice mode. It seems to be the only voice mode that isn't routing to a extremely cheap model (like Haiku), and has been the highest quality out of all the frontier ones. When you subscribe to SuperGrok you can also create a "council" of agents, each with their own system prompt and when you ask something, they will all get asked in parallel to come to a conclusion. Good stuff!

                Just wish they would finally put some work into their apps, it's the only thing keeping me from actually subscribing to SuperGrok:

                - No MCP / connected apps support. It's been teased but here we are, still not available. I can't connect Grok to anything, so I can't use it for serious work

                - Projects are still not available in the app so as soon as you move something into a project, it's gone from all the native apps

                - No way to add artifacts (like generated markdown docs) directly to a project, we have to export to PDF/markdown and re-import. And there isn't even a way to export artifacts. This makes serious project work hard because we can't dynamically evolve projects with new information

                - No memory, no ability to look up other chats, each chat is completely new

                - No voice mode in projects at all

                If someone from xAI is reading this, please consider adding some of these.

                  • artdigital

                    today at 9:40 AM

                    I also think Grok would benefit from allowing usage of "SuperGrok Heavy" (their $300 plan) in coding harnesses with included usage. Currently they give you some API credits on the Heavy plan so you can use some Grok for coding, but $300 USD value is just not there.

                    Not saying they should create their own grok-code harness, just allowing usage in existing ones would already be beneficial. But that's probably what the Cursor acquisition is going to do eventually

                    • ajitid

                      today at 11:17 AM

                      If I sub to SuperGrok, would I be able to use it in Pi agent or in Opencode? This is not clear to me if I can. Do I get an API Key in SuperGrok?

                        • everfrustrated

                          today at 11:27 AM

                          No, no api access for the Grok product. APIs are only via the xAI product.

                      • Oarch

                        today at 11:15 AM

                        I'd agree on the voice transcription; it seems so much more accurate than the other frontier models I've used. I often speak to Grok and paste the transcribed output to Claude!

                        • Cakez0r

                          today at 11:23 AM

                          If someone from Grok is reading, don't waste time on these chaff features. The market will eventually deliver better 3rd party solutions to all of these things. There is an audience that isn't interested in these walled garden features and are only interested on intelligence per dollar.

                            • raincole

                              today at 11:53 AM

                              Lol I wonder when Anthropic discussed the idea of Claude Code internally, were there bozos saying "3rd parties will eventually deliver this so we shouldn't waste time one it."

                              • torginus

                                today at 11:38 AM

                                Aren't they 'wasting' time on these features exactly because the engineering requires a different, more traditional skillset from the ML work model people do, and can be done in parallel?

                            • afpx

                              today at 9:39 AM

                              When I signed up, I accidently paid for a full year. So from time to time, I'll throw it something just to see what it produces compared to the other LLMs. And, even after all this time, it still feels like a really "dumb" model compared to the other frontier ones. But, worse, many of my system prompts make it go wacky and puke jibberish. However it was pretty cool for those couple months awhile back when it was uncensored. You could ask it about a wild conspiracy, and it would actually build the case and link you to legitimite source material. They dropped the hammer down on that real quick.

                                • 2ndorderthought

                                  today at 10:32 AM

                                  Ah yes the psychosis reinforcement vertical. It's such a lucrative market for those schizophrenics and bipolars. Great way to get lots of engagement. Groks portfolio is so diverse

                                    • readthenotes1

                                      today at 10:39 AM

                                      I have a schizophrenic relative who is in such a relationship with grok. Instead of telling hen you need to take your meds, it says hen is the smartest person in the world

                                        • 2ndorderthought

                                          today at 10:41 AM

                                          I'm so sorry your family is suffering from this. I hope you can find a way to bring them back. Disorders featuring psychosis are so painful for everyone around them. Blessings to you and your family

                                            • afpx

                                              today at 11:35 AM

                                              I love how you guys downvote all the old comments to make them hidden from search. Rando accounts

                                      • afpx

                                        today at 11:09 AM

                                        Except that it pointed at original sources, like reference manuals, archival documents, published newspaper articles, magazine articles, etc. - a lot still available on archive.org. Good try with your 16 day old account. And, why would anyone trust NPR at this point? Get real, bud. Most people with any curiousity know all about the ADL, JStreet, AIPAC, Greater Israel, Mossad / CIA, Chabad networks, Epstein, drones, weapons programs, cryptocurrencies, etc. etc. etc. - but, don't worry they're all safe with papa Ellison.

                                        Anyone remember why Oracle was named Oracle?

                                          • arvid-lind

                                            today at 11:13 AM

                                            Commenter was referencing a Bill Hicks joke. https://www.youtube.com/watch?v=NXi-9kA4ERM

                                              • afpx

                                                today at 11:30 AM

                                                Couldn't find the joke - what time segment is it?

                                                • 2ndorderthought

                                                  today at 11:16 AM

                                                  Someone gets it!

                                              • 2ndorderthought

                                                today at 11:16 AM

                                                "We have improved @Grok significantly," Elon Musk wrote on X last Friday about his platform's integrated artificial intelligence chatbot. "You should notice a difference when you ask Grok questions."

                                                Indeed, the update did not go unnoticed. By Tuesday, Grok was calling itself "MechaHitler."...

                                                https://www.npr.org/2025/07/09/nx-s1-5462609/grok-elon-musk-...

                                                Grok is definitely a reliable source of truthful sane rational information.

                                    • walletdrainer

                                      today at 9:55 AM

                                      > No MCP / connected apps support. It's been teased but here we are, still not available. I can't connect Grok to anything, so I can't use it for serious work

                                      Grok has tool use, no? Why would you also need MCP? What does MCP add?

                                        • artdigital

                                          today at 10:19 AM

                                          I'm talking about the consumer Grok app and grok.com website. There currently are not connected apps (or MCP) at all, so while Grok can use tools, there is no way to add tools to it

                                  • tornikeo

                                    today at 9:40 AM

                                    So, we have: - claude for corps and gov - codex for devs - grok for what, roleplay, racism? Those are the two things I've ever heard grok associated with around me.

                                      • aembleton

                                        today at 11:17 AM

                                        I've tried Grok, Gemini and ChatGPT. There have been 2 times now where Gemini and ChatGPT confidently gave me an incorrect answer whereas Grok was correct. I'm now paying for Grok Lite or whatever it is $10 plan.

                                        The first question was around setting up timers for a Fox ESS battery in Home Assistant and disconnecting Fox ESS from the cloud. The second was around cornering speed in Sunnypilot and Frogpilot.

                                        Somewhat niche but if an AI is confidently telling you something wrong it's hard to work with.

                                          • agrounds

                                            today at 11:26 AM

                                            >if an AI is confidently telling you something wrong it's hard to work with.

                                            But they all do that. It just comes with the territory. Grok will absolutely do the same thing another time you try it.

                                              • cyanydeez

                                                today at 12:08 PM

                                                humans make poor scientists. most people have already made a decision before they run any tests.

                                                the smartest among them just make the tests complicated and biased; the less intelligent just cherry pick.

                                                of course, would you really expect anyone to do real rsearch in this economy?

                                                • ToucanLoucan

                                                  today at 12:05 PM

                                                  It is really, really genuinely concerning how many people think there are profound measurable differences between these things.

                                                  Like yeah tonally I guess there are. But with regard to references and information? You’re literally just using three different slot machines and claiming one is hot.

                                                  I suppose though I shouldn’t be that surprised then since Vegas and every other casino on Earth has been built on duping people in that exact way.

                                          • sudb

                                            today at 10:04 AM

                                            So interestingly, I know of at least one application in a charity that deals with trafficking where grok was happy to do one-shot classification tasks where all other models refused to cooperate.

                                            I think there's a surprising number of actually useful applications in this sort of grey area for a slightly-less guardrailed, near-frontier model (also the grok-fast models are cheap!).

                                              • 2ndorderthought

                                                today at 10:21 AM

                                                There are lots of uncensored models out there. I don't think grok is leading in that front. They kind of pick and choose which things they want to support based on elons world views. Elon used to hang out with sex traffickers so of course grok is fine talking about it. Probably even offers strategies for them does free accounting has money laundering strategies etc...

                                                  • spiderfarmer

                                                    today at 10:32 AM

                                                    For the naysayers: https://www.theguardian.com/technology/2026/jan/30/elon-musk...

                                                      • user34283

                                                        today at 10:50 AM

                                                        We have been over the politically motivated slander many times; it's boring.

                                                        The user above you could have explained what uncensored models he believes are more capable than Grok. Maybe the Chinese open-weights models are superior to Grok at the moment.

                                                          • 2ndorderthought

                                                            today at 10:57 AM

                                                            It's not slander. Everything I said is true. You can go to the whitehouses website and read the pdfs containing his emails where he is messaging Jeffrey Epstein a convicted child sex trafficker to visit multiple times. There also multiple pictures of him with Jeffrey and ghislaine maxwell(another convicted child sex trafficker) from multiple events. Therefore, he used to hang out with sex traffickers.

                                                            Also, I don't know tons about uncensored models because I don't use them. But I do see posts on r/localllama about "abliterated models". Those are models which have been fine tuned to remove safety filters almost entirely while maintaining predictive efficacy.

                                                            Has nothing to do with China. People can do this to any open text model as far as I know.

                                                              • gadders

                                                                today at 11:37 AM

                                                                So did Bill Gates and Reid Hoffman.

                                                                  • sumeno

                                                                    today at 12:02 PM

                                                                    Yes, lots of billionaires were involved with the pedophile sex trafficker. They are all bad

                                                                    • today at 11:50 AM

                                                                  • user34283

                                                                    today at 11:12 AM

                                                                    On Artifical Analysis it shows only Kimi K2.6 and Mimo V2.5 Pro as better.

                                                                    Those models are 1T parameters total and 30B or 40B active, this might make abliteration impractical.

                                                                    About Musk, yes, there is correspondence. The only confirmed meeting appears to be a 30 minute visit at Epstein's house together with Musk's wife at the time.

                                                                    As for photos you mention, a quick search tells me there is one photo of Musk and Maxwell at a 2014 Vanity Fair Oscar Party.

                                                                    I find most commentary on here and other platform like Reddit extremely exaggerated compared to what is actually confirmed. Users seem hellbent on linking Musk to pedophilia-related allegations.

                                                                      • 2ndorderthought

                                                                        today at 11:26 AM

                                                                        I'm not making any allegations. Simply stating the fact that he hung out with them in multiple occasions.

                                                                        I also know he has stated that he has had direct involvement in groks directionality. Thereby it's no surprise to me that grok was generating csam. I also genuinely would not be surprised if grok offered advice for sex trafficking, etc.

                                                                        All publicly available evidence and discussions from the guy himself.

                                                • Hfuffzehn

                                                  today at 11:16 AM

                                                  From what I can gather Grok is not used for roleplay much. It is considered to inconsistant and crazy.

                                                  People are mostly using GLM and Deepseek via API and Gemma4 and Mistral finetunes locally.

                                                  It seems to me like the roleplay market is comparatively old and mature and users have developed cost consciousness and like models to follow their workflow/preferences. So something like Opus is liked for its smartness but considered too expensive and opinionated.

                                                  Might be an interesting data point for how the other markets might develop in the future.

                                                • coreyh14444

                                                  today at 10:31 AM

                                                  If you need to ask about what people on Twitter are talking about, Grok is really good for that obviously. I use it all the time for "what are the cool kids on twitter saying is the best tiling window manager these days" or whatever. Also, if you have a question that's borderline shady, Grok will often deliver. "Can you find a grey market Windows license site for me" etc.

                                                  • ndr

                                                    today at 9:45 AM

                                                    You should try all of them, then update your opinion about your information sources accordingly.

                                                    • karmasimida

                                                      today at 11:03 AM

                                                      Grok for fact checking, I mean ironically

                                                      • Keyframe

                                                        today at 11:56 AM

                                                        I always considered grok as also ran. Like grokipedia or what's the name. It has reach since it's free to an extent to produce low quality slop / spam.

                                                        • SecretDreams

                                                          today at 11:53 AM

                                                          No point in even trying to have close to a sensible discussion on this topic here. Musk-related posts seem to consistently get brigaded by his acolytes or bots. That and many HN users seem completely comfortable separating morality for what little progress "only Musk" can offer humanity, a la Wernher von Braun.

                                                          • nsowz

                                                            today at 9:45 AM

                                                            Grok is as progressive as any of the other models. Despite some of the highly-publicised fuck-ups, try asking Grok anything racist and see how it replies. Yes, I know you didn't try this and you won’t.

                                                              • aqme28

                                                                today at 9:52 AM

                                                                There is a lot of daylight in between ā€œprogressiveā€ and ā€œopenly explicitly racistā€

                                                                  • 2ndorderthought

                                                                    today at 10:13 AM

                                                                    Isn't grok currently holding the world record for the biggest generator of CSAM? Or did they change focus to enhance their racism and propaganda vertical? Things move so quickly these days hard to keep up!

                                                                      • embedding-shape

                                                                        today at 11:39 AM

                                                                        > Isn't grok currently holding the world record for the biggest generator of CSAM?

                                                                        I'm not sure I see how that's possible, given their image/video generation seems to be heavily censored. Do they have some alternative product besides "Imagine" or whatever it's called, that people use for generating CSAM?

                                                                        Judging by https://old.reddit.com/r/grok (but I haven't validated it myself), it seems like people are complaining more about how censored the model is, than anything else, maybe that's not actually true in reality?

                                                                        There are image models out there with 0 restrictions, even available on HuggingFace or CivitAI, I'm guessing those are way more widely used for things like CSAM than any centralized platform with moderation.

                                                                      • addedGone

                                                                        today at 10:39 AM

                                                                        Mistral will also tell you how to do ransoms btw from A to Z in automated ways, you are saying they are responsible? I don't get the mix here.

                                                                          • 2ndorderthought

                                                                            today at 10:43 AM

                                                                            Yes any company generating csam should not be in business as a legitimate entity. Can you send me a link from a reputable enough source where Mistral models have done this? I didn't even realize they were doing image generation.

                                                                              • pred_

                                                                                today at 11:22 AM

                                                                                > Yes any company generating csam should not be in business as a legitimate entity.

                                                                                At the same time, in this corner of the world, acting Minister for Justice (also known for trying to push through Chat Control), and NGO Save the Children, have been working to make legal the generation of CSAM for law enforcement use. So that would certainly make the industry legitimate, and you would already have a customer.

                                                                                https://www.justitsministeriet.dk/pressemeddelelse/regeringe...

                                                                                  • 2ndorderthought

                                                                                    today at 11:52 AM

                                                                                    I think they key point here is "for law enforcement". That's a little different from "pay me 10 dollars and enjoy the felonies". I still don't feel good about that by the way.

                                                                                • addedGone

                                                                                  today at 10:55 AM

                                                                                  If I send you a convo I've had with Mistral and Claude Sonnet 3.7 that say atrocious things (how to scam, and get away with it, by exploiting dating websites in Thailand, you don't even want to know the next steps trust me when it talks about the UK incorporation by the Thai itself that you brainwash first to send packages safely without customs seizing it and so on), you'll then publicly recognize that both those companies should be avoided and are promoting crime? If we have a deal and you publicly acknowledge it, I'll share you the links.

                                                                                    • 2ndorderthought

                                                                                      today at 11:01 AM

                                                                                      Sure!

                                                                              • Hamuko

                                                                                today at 11:05 AM

                                                                                But it's not doing any ransoms, right? Because Grok wasn't instructing users on how to create CSAM.

                                                                        • nsowz

                                                                          today at 9:56 AM

                                                                          I didn’t say ā€œprogressiveā€; I said ā€œas progressiveā€.

                                                                            • aqme28

                                                                              today at 10:39 AM

                                                                              I don't see how that changes my point at all.

                                                                              edit: to clarify for you, here's an example.

                                                                              Model A advocates for single-payer healthcare, while Model B prefers for the current US healthcare system. So on that one axis, A is more progressive than B. Neither of them needs to be racist for that calculation.

                                                                          • simianwords

                                                                            today at 10:16 AM

                                                                            Can you share a prompt that can show how it is openly racist now? Lots of easy claims like this can be debunked

                                                                              • aqme28

                                                                                today at 11:13 AM

                                                                                What claim? I didn't make any of that sort

                                                                        • SanjayMehta

                                                                          today at 9:53 AM

                                                                          100% agree. Grok may or may not be biased one way or the other as far as the US is concerned but from the rest of the world perspective it's mostly the same as any other model trained on Wikipedia.

                                                                      • augment_me

                                                                        today at 11:21 AM

                                                                        Gemini not being on the list is criminal

                                                                        • drivingmenuts

                                                                          today at 11:07 AM

                                                                          When I look at the person behind it all, I have to wonder how the hell people can even consider using grok? Or using Twitter? Or any of that. Using any of those things puts money in Musk's pockets and further enables and encourages him to continue being a Neo-Nazi wannabe. Do they think it's just a phase?

                                                                            • everfrustrated

                                                                              today at 11:32 AM

                                                                              Do you drive BMW or VW car? Boy do I have news for you!

                                                                                • breezybottom

                                                                                  today at 12:02 PM

                                                                                  Go on...make your case

                                                                                  • doctorhandshake

                                                                                    today at 11:44 AM

                                                                                    Technically you could lump Ford in this category as well. But the meaningful delta IMO is time and direct ownership. None of those three are currently owned/operated by openly Nazi-aligned individuals / groups, which is not something I think you can claim about Tesla.

                                                                            • vrganj

                                                                              today at 9:51 AM

                                                                              Grok for furthering the far-right filter bubble Elon has been hard at work building.

                                                                                • khalic

                                                                                  today at 10:15 AM

                                                                                  And of course child porn

                                                                                    • gadders

                                                                                      today at 11:32 AM

                                                                                      That's what BlueSky is for.

                                                                                      • pixel_popping

                                                                                        today at 10:16 AM

                                                                                        [flagged]

                                                                                          • 2ndorderthought

                                                                                            today at 10:23 AM

                                                                                            That's what it was doing. Like literally. Chatgpt it or Google it. Supporting grok is paying money to a csam generator.

                                                                                            Edit I cannot reply to the post below me. I have gone entirely over to local models so I am paying zero dollars to any of the us defense contractors that are also tech companies. It's awesome.

                                                                                              • pixel_popping

                                                                                                today at 10:24 AM

                                                                                                [flagged]

                                                                                                • pixel_popping

                                                                                                  today at 10:31 AM

                                                                                                  [flagged]

                                                                                              • khalic

                                                                                                today at 10:20 AM

                                                                                                Grok was used to create CSAM

                                                                                                  • pixel_popping

                                                                                                    today at 10:22 AM

                                                                                                    [flagged]

                                                                                                      • spiderfarmer

                                                                                                        today at 10:33 AM

                                                                                                        Musk partied with Epstein.

                                                                                                          • pixel_popping

                                                                                                            today at 10:36 AM

                                                                                                            [flagged]

                                                                                                              • tclancy

                                                                                                                today at 11:17 AM

                                                                                                                What’s the correlation between people defending Musk, Twitter and kiddie diddlers?

                                                                                                • spiderfarmer

                                                                                                  today at 10:33 AM

                                                                                                  https://www.theguardian.com/technology/2026/jan/30/elon-musk...

                                                                                                    • pixel_popping

                                                                                                      today at 10:33 AM

                                                                                                      [flagged]

                                                                                          • gadders

                                                                                            today at 11:32 AM

                                                                                            Twitter was a far-left filter bubble before he bought it.

                                                                                            • simianwords

                                                                                              today at 10:17 AM

                                                                                              How does Grok further far-right filter? This is blatantly untrue. Try prompting it and getting it to say something far right.

                                                                                              Grok if anything reduces populism because fake claims can be debunked

                                                                                                • vrganj

                                                                                                  today at 10:19 AM

                                                                                                  How could MechaHitler possibly be far right...

                                                                                                    • 2ndorderthought

                                                                                                      today at 10:24 AM

                                                                                                      When you really think about it palantir told me Hitler was good and therefore mechahitler aka grok should be a okay!

                                                                                                      • simianwords

                                                                                                        today at 10:20 AM

                                                                                                        [flagged]

                                                                                                          • vrganj

                                                                                                            today at 10:22 AM

                                                                                                            Sure. And so was the Holocaust denial and the misinformation about the French 2015 attacks and so on and so on.

                                                                                                            Its just roleplaying being a far right propaganda tool.

                                                                                                              • simianwords

                                                                                                                today at 10:25 AM

                                                                                                                Source?

                                                                                                                  • vrganj

                                                                                                                    today at 10:31 AM

                                                                                                                    https://www.pbs.org/newshour/world/france-will-investigate-m...

                                                                                                                      • simianwords

                                                                                                                        today at 10:33 AM

                                                                                                                        As admitted they have fixed it. It’s obvious that a tool used so vastly might have problems like this. Surely if you think it is used to produce far right propaganda now you can reproduce? Or you choose to hinge on one off issues they fixed?

                                                                                            • khalic

                                                                                              today at 10:14 AM

                                                                                              Lol. I think they unleashed it on this post, look at the number of only vaguely related, lukewarm opinions trying to push the racism and CSAM stuff to the bottom

                                                                                                • johnnyApplePRNG

                                                                                                  today at 10:37 AM

                                                                                                  [flagged]

                                                                                          • Barbing

                                                                                            today at 11:53 AM

                                                                                            Grok 4.3 was completed ahead of its CEO’s lesson on this common safety resource:

                                                                                              Asked if he knew anything about OpenAI's "safety card," Musk smiled and replied: "Safety card? Why would it be a card?"
                                                                                            
                                                                                            https://www.axios.com/2026/04/30/musk-openai-safety-grok

                                                                                            Low relevancy in spite of cluster size and musical chair gas generators for time being:

                                                                                              Later in his testimony, Musk was asked about a claim he made last summer that xAI would soon be far beyond any company besides Google. In response, he ranked the world’s leading AI providers, saying Anthropic held the top spot, followed by OpenAI, Google, and Chinese open source models. He characterized xAI as a much smaller company with just a few hundred employees.
                                                                                            
                                                                                            https://techcrunch.com/2026/04/30/elon-musk-testifies-that-x...

                                                                                            (Affiliated with no AI company, just surprised to read this yesterday - how could Elon miss model cards…concerning…, & the fact money can’t buy success every time.)

                                                                                              • tecoholic

                                                                                                today at 12:07 PM

                                                                                                Seriously though, why is it a model "card", safety "card"? I had to lookup to learn that it comes from HuggingFace's vague definition of "README" in the model's repo. This is such a specific thing that I don't think anyone except a very small population would know - not the users, not the c-suites.

                                                                                                I don't like Musk or Grok. But not knowing what's a safety card is not a signal of anything IMO.

                                                                                            • xiphias2

                                                                                              today at 11:15 AM

                                                                                              It's just at the Chinese levels for coding, so right now it's just a money earing thing for investors.

                                                                                              I hope the Cursor guys help them catch up to be closer to frontier models because they badly need help in it.

                                                                                                • AntiUSAbah

                                                                                                  today at 11:26 AM

                                                                                                  I hope not. Musk can directly go to hell with his shit.

                                                                                                  Nonetheless, the 10 Billion and 60 Billion deal with Cursor is weird as hell. I can only imagine that he wants to throw as much money at all of his shit before the IPO.

                                                                                                  He probably wants the training data

                                                                                                  • ai_fry_ur_brain

                                                                                                    today at 11:18 AM

                                                                                                    They all suck.

                                                                                                    • today at 11:44 AM

                                                                                                  • maz1b

                                                                                                    today at 9:11 AM

                                                                                                    I still wish they named it something else, but congratulations to the team on what seems to be a good release!

                                                                                                    Pricing is also quite surprising, compared to comparable competitors. I guess they have tons of capacity or really want to bring over more people.

                                                                                                      • readthenotes1

                                                                                                        today at 10:41 AM

                                                                                                        You don't like science fiction references in general or Heinlein in particular?

                                                                                                          • draxil

                                                                                                            today at 10:51 AM

                                                                                                            I don't like that word, which was previously a common part of my vocabulary, being forever ruined?

                                                                                                              • randallsquared

                                                                                                                today at 11:40 AM

                                                                                                                My father's name was Claude, but, you know. ĀÆ\_(惄)_/ĀÆ

                                                                                                        • Hamuko

                                                                                                          today at 11:11 AM

                                                                                                          It did try to rename itself to MechaHitler at one point, but I don't think it was an improvement.

                                                                                                      • AntiUSAbah

                                                                                                        today at 11:42 AM

                                                                                                        Please reconsider why you want to use Grok.

                                                                                                        Musk allowed childpornography because of lulz.

                                                                                                        Musk created a Antiwikipedia like 1984 (please read the book).

                                                                                                        Musk did two nazi salutes in front of a MAGA publicum to get their approval.

                                                                                                        Musk allowed grok hitler.

                                                                                                        I have been to a concentration camp. I read and visited white rose in munich. This is not fun, not a joke, not some nonsense. The siblings Scholl were normal human beings and students in munich. They were killed for non violent resistence. This was only possible due to the slippery slope of facism and dehumaninzation of humans.

                                                                                                        Elon Musk is a facist or at least doesn't care about anything at all. He is an igorant mf.

                                                                                                        xAI is completly owned by him.

                                                                                                        I will invite you to munich and my couch, i can show you a concentration camp, i can show you the memorial of white rose.

                                                                                                        Just use another AI provider.

                                                                                                          • NotGMan

                                                                                                            today at 11:59 AM

                                                                                                            Get professional help.

                                                                                                        • netdur

                                                                                                          today at 9:21 AM

                                                                                                          In court vs openai, Musk said Grok is partly trained on openai models, so it should be somehow similar to Chinese models in terms of performance and cost!

                                                                                                            • today at 10:54 AM

                                                                                                          • mythz

                                                                                                            today at 9:12 AM

                                                                                                            Ok speed (202.7 tok/s) and value (1.25 -> 2.50) look great, with pretty decent intelligence.

                                                                                                              • pzo

                                                                                                                today at 9:18 AM

                                                                                                                The problem with speed is that they usually are very fast for first few weeks and then suddenly much slower. They did such trick when they advertised Grok 4 fast ( dropped from 200 tps to 60tps)

                                                                                                                  • polski-g

                                                                                                                    today at 11:38 AM

                                                                                                                    Grok 4.1 is still 110tps. The only other model that comes close is Gemini at 85tps.

                                                                                                                    • victorbjorklund

                                                                                                                      today at 9:28 AM

                                                                                                                      Wow. That is a big drop.

                                                                                                                  • Cakez0r

                                                                                                                    today at 11:28 AM

                                                                                                                    202.7 tok/s is only OK speed? Which providers are you using that are significantly better than that?

                                                                                                                      • mritchie712

                                                                                                                        today at 11:50 AM

                                                                                                                        for reference, it's the 2nd fastest model tracked in the "Highlights" section of https://artificialanalysis.ai/

                                                                                                                          • goldenarm

                                                                                                                            today at 12:03 PM

                                                                                                                            That section misses Cerebras and Groq which are up to 5x faster.

                                                                                                                            • Cakez0r

                                                                                                                              today at 11:55 AM

                                                                                                                              Yes, it's incredibly fast. Openrouter is clocking 60 tokens per second, which is on par with the likes of sonnet, opus, GPT 5.5.

                                                                                                                      • catcowcostume

                                                                                                                        today at 9:32 AM

                                                                                                                        [flagged]

                                                                                                                          • kuboble

                                                                                                                            today at 9:51 AM

                                                                                                                            I don't remember the source of the quote.

                                                                                                                            But debating whether the models are intelligent is slim to debating whether a car can walk.

                                                                                                                            You can offload to the model a lot of work that until recently we thought requires intelligence. The more and better of those tasks the model can do, it's fair to call it intelligence*

                                                                                                                              • NitpickLawyer

                                                                                                                                today at 10:15 AM

                                                                                                                                "The question of whether a computer can think is no more interesting than the question of whether a submarine can swim." - Edsger Dijkstra

                                                                                                                            • MrDrDr

                                                                                                                              today at 9:34 AM

                                                                                                                              Please elaborate.

                                                                                                                                • IshKebab

                                                                                                                                  today at 10:29 AM

                                                                                                                                  Some people have this strange idea that only "whatever humans do" counts as intelligence, despite the fact that a) we don't really have a clue what humans do, and b) "intelligence" is definitely not that strictly defined.

                                                                                                                                  I think they're just trying to feel like they know some important truth that other people don't.

                                                                                                                                  • nesk_

                                                                                                                                    today at 9:43 AM

                                                                                                                                    Prediction is not intelligence.

                                                                                                                                      • mirekrusin

                                                                                                                                        today at 9:54 AM

                                                                                                                                        Misprediction is?

                                                                                                                                • exe34

                                                                                                                                  today at 9:44 AM

                                                                                                                                  What does intelligence mean to you?

                                                                                                                          • ragchronos

                                                                                                                            today at 9:15 AM

                                                                                                                            When looking at the benchmarks, this model seems to be really close to Kimi K2.6 in terms of intelligence and pricing, hitting that sweet spot. It does also have a higher AA-Omniscience index, which is something kimi and other open models lack in. Curious to see how pleasant it is to use.

                                                                                                                              • alfiedotwtf

                                                                                                                                today at 9:32 AM

                                                                                                                                I’ll eat my hat if it even comes close to Kimi

                                                                                                                                  • mirekrusin

                                                                                                                                    today at 9:55 AM

                                                                                                                                    How would you like it? Well done?

                                                                                                                                      • __patchbit__

                                                                                                                                        today at 10:40 AM

                                                                                                                                        What about spending $41 million on each model's tokens and seeing the value gain? be it efficiency gain in factory work or energy savings in austere battlescape hunting.

                                                                                                                            • alyxya

                                                                                                                              today at 9:18 AM

                                                                                                                              Despite their attrition, this combined with their cursor partnership is likely going to make them competitive in coding agents soon.

                                                                                                                              • agunapal

                                                                                                                                today at 10:13 AM

                                                                                                                                Very competitive price for the speed and intelligence being offered!

                                                                                                                                • kilroy123

                                                                                                                                  today at 10:52 AM

                                                                                                                                  People are going to hate on Grok because of Musk. However, I do hope they're successful in making a powerful model. We desperately need more competition. I want cheap subsidized AI plans.

                                                                                                                                  I hope Meta finally comes around, too. I want those sweet, sweet billionaire subsidized tokens.

                                                                                                                                    • renegade-otter

                                                                                                                                      today at 11:35 AM

                                                                                                                                      Pardon me for feeling icky when giving money to the guy who is obsessed with "white replacement".

                                                                                                                                      I am old and cynical - I have no illusions, but I also have my limits and a semblance of moral compass. We, as citizens, can vote with ballots, but also with money.

                                                                                                                                      And, no, I am not someone who keeps boycotting companies for every little grievance (was on the receiving end of that nonsense twice).

                                                                                                                                        • foltik

                                                                                                                                          today at 11:47 AM

                                                                                                                                          Never used grok, never will.

                                                                                                                                      • ai_fry_ur_brain

                                                                                                                                        today at 11:21 AM

                                                                                                                                        Your $200 claude code subscription is a cheap subsidized plan.

                                                                                                                                        You're getting like 40k in tokens a year for $2400. A whole lotta people are about to be sad when they realize they bet their competency on that lasting forever.

                                                                                                                                          • kilroy123

                                                                                                                                            today at 11:46 AM

                                                                                                                                            That's my point. While the billionares fight each other over who has the best model, this will continue for a while. At least, I think so.

                                                                                                                                        • troupo

                                                                                                                                          today at 11:01 AM

                                                                                                                                          Credit where it's due, Grok is currently the only model that has near-realtime updates from/access to a waterhose of data, and is casually used by regular people all the time.

                                                                                                                                          I don't think there's a single thread on Xitter whete people don't delegate some question to grok.

                                                                                                                                          (There's a separate conversation of failure modes, and whether it's a good thing, and how much control Elon had when he doesn't like Grok's "woke" responses)

                                                                                                                                      • mirekrusin

                                                                                                                                        today at 10:02 AM

                                                                                                                                        All those plans from providers should be sliders – prepay more, get more in return.

                                                                                                                                        • sexylinux

                                                                                                                                          today at 12:01 PM

                                                                                                                                          Is this now a reliable product or will it still produce errors?

                                                                                                                                          • OtherShrezzing

                                                                                                                                            today at 9:22 AM

                                                                                                                                            The tok/s stat is interesting. Since the dominant constraint on inference speed is hardware, it suggests X purchased far more compute than was really needed to serve the demand for their models.

                                                                                                                                            Expensive miscalculation.

                                                                                                                                              • flir

                                                                                                                                                today at 10:12 AM

                                                                                                                                                Didn't a bunch of hardware that was destined for Tesla get redirected to xAI? I'm sure I remember something like that.

                                                                                                                                          • happosai

                                                                                                                                            today at 9:36 AM

                                                                                                                                            I lost the trust in them when they added the racist "what about killing of Boers in south Africa" thing to their system prompt.

                                                                                                                                            No way am I going to use a model where the backing has such blatantly obvious brain washing goals.

                                                                                                                                              • Hugsun

                                                                                                                                                today at 10:32 AM

                                                                                                                                                It is unbelievable that this is a controversial opinion.

                                                                                                                                                • miroljub

                                                                                                                                                  today at 9:40 AM

                                                                                                                                                  [flagged]

                                                                                                                                                    • vrganj

                                                                                                                                                      today at 9:52 AM

                                                                                                                                                      There is no non-bias. What you call unbiased is always just a reflection of your personal biases.

                                                                                                                                                      That being said, I am definitely against a model that is biased to be following the ideology of a far-right extremist.

                                                                                                                                                      • Jtarii

                                                                                                                                                        today at 9:55 AM

                                                                                                                                                        Musk bought a social media company for the specific purpose of getting Trump elected by turning it into a right wing propaganda machine. Have Anthropic/OpenAI/Google done something similar to that?

                                                                                                                                                        • henry2023

                                                                                                                                                          today at 10:20 AM

                                                                                                                                                          [dead]

                                                                                                                                                  • BoredPositron

                                                                                                                                                    today at 9:35 AM

                                                                                                                                                    Yay, free tokens. I don't know why but grok always seems good fast in the free token phase and after that degrades.

                                                                                                                                                    • Imustaskforhelp

                                                                                                                                                      today at 9:15 AM

                                                                                                                                                      Pelican riding a bike here: https://gist.github.com/SerJaimeLannister/f6de26bd0d0817e056...

                                                                                                                                                      (ran this on arena.ai direct chat and also tried to write this gist inspired by how simon writes his gists about pelicans)

                                                                                                                                                      Edit: just realized that I made pelican riding a bike instead of bicycle, which now makes sense as to why it hardened the bicycle to look tankier, going to compare this with pelican riding a bicycle if anybody else shares the pelican riding a bicycle.

                                                                                                                                                        • gchamonlive

                                                                                                                                                          today at 9:20 AM

                                                                                                                                                          https://simonwillison.net/2025/Nov/13/training-for-pelicans-...

                                                                                                                                                          You should probably come up with variations, like a beaver riding a scooter or something, just to see what's what :)

                                                                                                                                                            • Imustaskforhelp

                                                                                                                                                              today at 9:35 AM

                                                                                                                                                              Thanks I have generated both

                                                                                                                                                              beaver riding a scooter: https://gist.github.com/SerJaimeLannister/f6de26bd0d0817e056...

                                                                                                                                                              pelican riding a bicycle: https://gist.github.com/SerJaimeLannister/f6de26bd0d0817e056...

                                                                                                                                                              Personal opinion but the beaver one looks especially bad as compared to pelicans. Can we be for sure that this model of grok-4.3 hasn't been trained on pelican. Simonw in blog-post says that he will try with other creatures so I hope he does that but it does feel to me as the model/xAI is trying to cheat, Hope Simonw tests it out more.

                                                                                                                                                              Edit: Also added turtle riding a scooter, something which literally has images online or heck even teenage mutant ninja turtles and I thought that it would be able to pass this but it wasn't even able to generate this: https://gist.github.com/SerJaimeLannister/f6de26bd0d0817e056...

                                                                                                                                                              This literally looks more avocado than turtle. Perhaps this could be a bug from arena.ai or something else too, not sure but at this point waiting for simon's analysis.

                                                                                                                                                                • gchamonlive

                                                                                                                                                                  today at 9:58 AM

                                                                                                                                                                  We can never be sure of course, but I think this is a very strong indication that pelican riding a bike is indeed going into the training dataset.

                                                                                                                                                                  Thanks for generating those!

                                                                                                                                                      • simianwords

                                                                                                                                                        today at 8:30 AM

                                                                                                                                                        https://artificialanalysis.ai/models/grok-4-3

                                                                                                                                                          • nextaccountic

                                                                                                                                                            today at 9:09 AM

                                                                                                                                                            This puts Sonnet 4.6 above Opus 4.6 in the coding index.. kinda hard to trust those numbers.

                                                                                                                                                            (Also it puts Opus 4.7 universally above Opus 4.6, and I may be wrong but this doesn't seem to match the experience of most/many/some people. I think it's widely recognized that Anthropic is severely lacking compute and Opus 4.7 is a costs saving measure)

                                                                                                                                                              • manmal

                                                                                                                                                                today at 9:52 AM

                                                                                                                                                                Anthropic themselves have (had?) this thing where Opus is used for planning and Sonnet for coding.

                                                                                                                                                                  • nextaccountic

                                                                                                                                                                    today at 11:18 AM

                                                                                                                                                                    I thought this was a costs saving measure: we plan with the frontier model / SOTA, then code with something cheaper.

                                                                                                                                                                    But then, Anthropic employees don't have rate limits, right?

                                                                                                                                                            • Alifatisk

                                                                                                                                                              today at 9:17 AM

                                                                                                                                                              Does numbers don't look exciting at all? I may have gotten spoiled by releases from Qwen, Kimi and Z.ai who keep closing the gap between closed weight SOTA models and open weight. From my experience, Grok is only useful for one thing, and that's looking up things for you and gathering a consensus on topics. That's it.

                                                                                                                                                              Update, I noted that Grok 4.3 is in the "Most attractive quadrant", that's cool! It is also in the top 5 highest in "AA-Omniscience Index", good! Really good.

                                                                                                                                                              • progbits

                                                                                                                                                                today at 9:18 AM

                                                                                                                                                                What's with the charts and numbers?

                                                                                                                                                                It says #1 for speed but then in the chart it's #2. Also says #10 for intelligence but then it's #7 in the chart.

                                                                                                                                                                • today at 9:15 AM

                                                                                                                                                                  • BoorishBears

                                                                                                                                                                    today at 9:08 AM

                                                                                                                                                                    What an exciting game we're playing, where the most popular leaderboard is completely made up and the stakes are in the trillions.

                                                                                                                                                                • alfiedotwtf

                                                                                                                                                                  today at 9:28 AM

                                                                                                                                                                  If there was any model I wouldn’t trust, it wouldn’t be the ones from China, it would be the one from Elon Musk

                                                                                                                                                                    • Cthulhu_

                                                                                                                                                                      today at 9:42 AM

                                                                                                                                                                      Thankfully it's not an either / or, I don't trust any models. This is a healthy attitude to have because you shouldn't trust anyone on the internet either, especially when it comes to specific subjects.

                                                                                                                                                                        • benrutter

                                                                                                                                                                          today at 10:53 AM

                                                                                                                                                                          That's definitely a good approach. Although I get a little concerned about the resources put into convincing people that models (and especially Grok) are accurate. For example, X's "fact checked by Grok" approvals, which I've unfortunately heard people reference as meaningful.

                                                                                                                                                                          Politically motivated models can still do a lot of damage that affects me (or "have a lot of impact" depending on whether you like the politics or not) even if I don't engage with them myself.

                                                                                                                                                                          • 2ndorderthought

                                                                                                                                                                            today at 10:18 AM

                                                                                                                                                                            I don't trust this. But by not trusting it I am inherently trusting it. But by trusting it I shouldn't.

                                                                                                                                                                    • khalic

                                                                                                                                                                      today at 9:35 AM

                                                                                                                                                                      This project is a gigantic waste of resources, it’s fine tuned on politics of the CEO, was used for CSAM generation and just sucks overall

                                                                                                                                                                        • servo_sausage

                                                                                                                                                                          today at 9:58 AM

                                                                                                                                                                          I like that there are models with divergent politics; the status quo being creepy corporate left silicon valley is not healthy or pleasant to interact with.

                                                                                                                                                                          Even with grock it's only broadening things to creepy corporate right of silicon valley.

                                                                                                                                                                            • breezybottom

                                                                                                                                                                              today at 12:05 PM

                                                                                                                                                                              Silicon Valley...left? Huh?

                                                                                                                                                                          • johnnyApplePRNG

                                                                                                                                                                            today at 10:41 AM

                                                                                                                                                                            The resource waste he's talking about is horrendous, read more here: https://time.com/7308925/elon-musk-memphis-ai-data-center/

                                                                                                                                                                            • spiderfarmer

                                                                                                                                                                              today at 9:48 AM

                                                                                                                                                                              It’s a model made for 36% of Americans. The rest of the world can’t care less.

                                                                                                                                                                                • 2ndorderthought

                                                                                                                                                                                  today at 10:16 AM

                                                                                                                                                                                  Considering how few Americans there are and how little of that 39% even uses technology, that's what 20 million people at a maximum?

                                                                                                                                                                                    • Hugsun

                                                                                                                                                                                      today at 10:29 AM

                                                                                                                                                                                      That seems like a decently sized market. Maybe not for an AI lab though.

                                                                                                                                                                                        • 2ndorderthought

                                                                                                                                                                                          today at 10:34 AM

                                                                                                                                                                                          Sure it's a good market for a normal company. For a social media company it's pretty isolated and really limits the products that can come out. But their current selling points: propaganda, csam, and psychosis engagement are quite strong amongst that population.

                                                                                                                                                                                            • cindyllm

                                                                                                                                                                                              today at 10:38 AM

                                                                                                                                                                                              [dead]

                                                                                                                                                                          • unit149

                                                                                                                                                                            today at 10:29 AM

                                                                                                                                                                            [dead]

                                                                                                                                                                            • shchess

                                                                                                                                                                              today at 10:02 AM

                                                                                                                                                                              [dead]

                                                                                                                                                                              • th3b0tk1ll3r

                                                                                                                                                                                today at 9:46 AM

                                                                                                                                                                                [flagged]

                                                                                                                                                                                  • curtisblaine

                                                                                                                                                                                    today at 9:52 AM

                                                                                                                                                                                    Please avoid comments with no real substance, written just to denigrate, with a throwaway account. They make discourse unnecessarily worse.

                                                                                                                                                                                • gigatexal

                                                                                                                                                                                  today at 10:57 AM

                                                                                                                                                                                  How do the grok models fare in coding challenges to say gpt 5.5 and opus 4.6/4.7?

                                                                                                                                                                                  I hate giving Elon any money. The man is a net negative to society but … if the models are objectively better then logically I must no?

                                                                                                                                                                                    • simonh

                                                                                                                                                                                      today at 11:02 AM

                                                                                                                                                                                      Logic can't tell you what your objectives should be, only how to achieve them.

                                                                                                                                                                                  • today at 11:20 AM