\

Show HN: GitMCP is an automatic MCP server for every GitHub repo

144 points - yesterday at 6:28 PM

Source
  • kiitos

    yesterday at 9:39 PM

    > Simply change the domain from github.com or github.io to gitmcp.io and get instant AI context for any GitHub repository.

    What does this mean? How does it work? How can I understand how it works? The requirements, limitations, constraints? The landing page tells me nothing! Worse, it doesn't have any links or suggestions as to how I could possibly learn how it works.

    > Congratulations! The chosen GitHub project is now fully accessible to your AI.

    What does this mean??

    > GitMCP serves as a bridge between your GitHub repository's documentation and AI assistants by implementing the Model Context Protocol (MCP). When an AI assistant requires information from your repository, it sends a request to GitMCP. GitMCP retrieves the relevant content and provides semantic search capabilities, ensuring efficient and accurate information delivery.

    MCP is a protocol that defines a number of concrete resource types (tools, prompts, etc.) -- each of which have very specific behaviors, semantics, etc. -- and none of which are identified by this project's documentation as what it actually implements!

    Specifically what aspects of the MCP are you proxying here? Specifically how do you parse a repo's data and transform it into whatever MCP resources you're supporting? I looked for this information and found it nowhere?

      • broodbucket

        yesterday at 11:01 PM

        As someone who is obviously not the target audience, I feel like literally anything on this page that could lead me to explain what MCP is would be nice, while we're talking about what the landing page doesn't tell you. Even just one of the MCP mentions being a link to modelcontextprotocol.io would be fine.

        Or maybe I'm so out of the loop it's as obvious as "git" is, I dunno.

          • fragmede

            today at 11:25 AM

            Itā€™s fair to be curious, but at some point itā€™s also reasonable to expect people are capable of using Google to look up unfamiliar terms. I'm not gatekeepingā€”just, like, put in a bit of effort?

            Threads like this work better when they can go deeper without rehashing the basics every time.

              • johannes1234321

                today at 2:34 PM

                Having a Link to the mcp website won't be "rehashing" but how the web once was supposed to be.

        • T3RMINATED

          today at 12:04 PM

          [dead]

          • sdesol

            today at 7:26 AM

            [flagged]

              • john2x

                today at 9:29 AM

                Is this the new LMGTFY?

                • kiitos

                  today at 11:48 AM

                  I appreciate that! Now maybe they could update the readme accordingly! ;)

          • ianpurton

            today at 6:12 AM

            Some context.

            1. Some LLMs support function calling. That means they are given a list of tools with descriptions of those tools.

            2. Rather than answering your question in one go, the LLM can say it wants to call a function.

            3. Your client (developer tool etc) will call that function and pass the results to the LLM.

            4. The LLM will continue and either complete the conversation or call more tools (functions)

            5. MCP is gaining traction as a standard way of adding tools/functions to LLMs.

            GitMCP

            I haven't looked too deeply but I can guess.

            1. Will have a bunch of API endpoints that the LLM can call to look at your code. probably stuff like, get_file, get_folder etc.

            2. When you ask the LLM for example "Tell me how to add observability to the code", the LLM can make calls to get the code and start to look at it.

            3. The LLM can keep on making calls to GitMCP until it has enough context to answer the question.

            Hope this helps.

            • liadyo

              yesterday at 6:28 PM

              We built an open source remote MCP server that can automatically serve documentation from every Github project. Simply replace github.com with gitmcp.io in the repo URL - and you get a remote MCP server that serves and searches the documentation from this repo (llms.txt, llms-full.txt, readme.md, etc). Works with github.io as well. Repo here: https://github.com/idosal/git-mcp

                • nlawalker

                  yesterday at 10:15 PM

                  >searches the documentation from this repo (llms.txt, llms-full.txt, readme.md, etc)

                  What does etc include? Does this operate on a single content file from the specified GitHub repo?

              • the_arun

                yesterday at 6:40 PM

                But why would we need an MCP server for a github repo? Sorry, I am unable to understand the use case.

                  • liadyo

                    yesterday at 6:46 PM

                    It's very helpful when working with a specific technology/library, and you want to access the project's llms.txt, readme, search the docs, etc from within the IDE using the MCP client. Check it out, for exmaple, with the langgraph docs: https://gitmcp.io/#github-pages-demo It really improves the development experience.

                      • yesterday at 8:22 PM

                    • scosman

                      today at 11:15 AM

                      Itā€™s one of my favourite MCP use cases. I have cloned projects and used the file browser MCP for this, but this looks great.

                      It allows you to ask questions about how an entire system works. For example the other day ā€œthis GitHub action requires the binary X. Is it in the repo, downloading it, or building it on deploy, or something else.ā€ Or ā€œwhat tools does this repo used to implement full text search? Give me an overviewā€

                      • qainsights

                        yesterday at 7:50 PM

                        Same here. Can't we just give the repo URL in Cursor/Windsurf to use the search tool to get the context? :thinking:

                          • adusal

                            yesterday at 8:44 PM

                            As an example, some repositories have huge documents (in some cases a few MBs) that agents won't process today. GitMCP offers semantic search out of the box.

                            • jwblackwell

                              yesterday at 9:14 PM

                              Yeah this is one fundamental reason I don't see MCP taking off. The only real use cases there are will just be built in natively to the tools.

                                • hobofan

                                  today at 7:15 AM

                                  Yes, they could be, but then you 100% rely on the client tools doing a good job doing that, which they aren't always good at, and they also have to reinvent the wheel on what are becoming essentially commodity features.

                                  E.g. one of the biggest annoyances for me with cursor was external documentation indexing, where you hand it the website of a specific libarary and then it crawls and indexes that. That feature has been completely broken for me (always aborting with a crawl error). Now with a MCP server, I can just use one that is specialized in this kind of documentation indexing, where I also have the ability to tinker with it if it breaks, and then can use that in all my agentic coding tools that need it (which also allows me to transfer more work to background/non-IDE workflows).

                              • cruffle_duffle

                                today at 1:20 AM

                                MCP servers present a structured interface for accessing something and (often) a structured result.

                                You tell the LLM to visit your GitHub repository via http and it gets backā€¦ unstructured, unfocused content not designed with an LLMā€™s context window in mind.

                                With the MCP server the LLM can initiate a structured interface request and get back structured repliesā€¦ so instead of HTML (or text extracted from HTML) it gets JSON or something more useful.

                                  • cgio

                                    today at 5:29 AM

                                    Is html less structured than json? I thought with LLMs the schematic of structure is less relevant than the structure itself.

                            • SkyPuncher

                              yesterday at 11:43 PM

                              Once case Iā€™ve found valuable is dropping a reference to a PR thatā€™s relevant to my work.

                              Iā€™ll tell it to look at that PR to gain context about what was previously changed.

                              • ramoz

                                yesterday at 11:46 PM

                                Right, because agents can utilize git natively.

                                If this is for navigating/searching github in a fine-grained way, then totally cool and useful.

                            • pcwelder

                              yesterday at 7:18 PM

                              Why not have a single mcp server that takes in the repo path or url in the tool call args? Changing config in claude desktop is painful everytime.

                            • yesterday at 11:01 PM

                              • qwertox

                                yesterday at 6:43 PM

                                That is a complex webserver. https://github.com/idosal/git-mcp/tree/main/api

                                What about private repos in, let's say GitLab or Bitbucket instances, or something simpler?

                                A Dockerfile could be helpful to get it running locally.

                                  • liadyo

                                    yesterday at 6:48 PM

                                    Yes, this is a fully remote MCP server, so the need for an SSE support makes the implementation quite complex. The MCP spec updated to use HTTP streaming, but clients do not support it yet.

                                      • TechDebtDevin

                                        yesterday at 6:54 PM

                                        Gemini does I believe. On my list of todos is to add this to my fork of mcp-go.

                                          • vessenes

                                            yesterday at 7:31 PM

                                            +1 for this, I'm so so tired of writing my MCP code in python.

                                              • prophesi

                                                yesterday at 7:44 PM

                                                If you're in Elixir land, Hermes MCP[0] is a fantastic library for building out MCP clients/servers with both SSE/HTTP support. And will be quite robust given the scalability and fault tolerance of the BEAM.

                                                [0] https://github.com/cloudwalk/hermes-mcp

                                                  • vessenes

                                                    yesterday at 7:50 PM

                                                    ooh cool. Sadly I am far from Elixir land. MCP starting out as largely STDIO definitely has made things harder for server-side engineers. I expect this will sort itself out this year though.

                                • fzysingularity

                                  today at 3:50 AM

                                  While I like the seamless integration with GitHub, Iā€™d imagine this doesnā€™t fully take advantage of the stateful nature of MCP.

                                  A really powerful git repo x MCP integration would be to automatically setup the GitHub repo library / environment and be able to interact with that library, making it more stateful and significantly more powerful.

                                  • xena

                                    today at 2:09 AM

                                    How do I opt-out for my repos?

                                      • scosman

                                        today at 11:32 AM

                                        Do you think that should be an option? I totally get opting out of crawlers, search or training but this is different.

                                        But should the author be able to opt out of a tool used for manually initiated queries? I canā€™t say ā€œdonā€™t use grepā€ on my repo.

                                          • xena

                                            today at 11:40 AM

                                            Grep is a tool. This is a service.

                                              • scosman

                                                today at 1:23 PM

                                                Yes. But is that the line?

                                                Crawling makes sense (automated traffic) but this isnā€™t automated, itā€™s user initiated. Search indexing makes sense (this isnā€™t that). Training makes sense (this isnā€™t that).

                                                It should have a honest user agent so server can filter, for sure.

                                                If Iā€™m allowed ā€˜git clone X && grep -rā€™ against a service, why canā€™t I do the same with MCP.

                                    • creddit

                                      yesterday at 10:23 PM

                                      How does this differ from the reference Github MCP server?

                                      https://github.com/modelcontextprotocol/servers/tree/main/sr...

                                      EDIT: Oh wait, lol, I looked closer and it seems that the difference is that the server runs on your server instead which is like the single most insane thing I can think of someone choosing to do when the reference Github MCP server exists.

                                        • adusal

                                          today at 2:21 PM

                                          Just to be clear, GitMCP isn't a repository management tool. Its sole purpose is to make documentation accessible to AI in ways the current tools do not (e.g., semantic search, not necessarily limited to a repository), with minimal overhead for users. GitMCP itself is a free, public, open-source repository. The tool doesn't have access to PII and doesn't store agent queries.

                                          • creddit

                                            yesterday at 10:27 PM

                                            This literally looks like spyware to me. Crazy.

                                        • eagleinparadise

                                          yesterday at 10:05 PM

                                          Getting "@ SSE error: undefined" in Cursor for a repo I added. Is there also not a way to force a MCP server to be used? Haiku doesn't pick it up in Cursor.

                                            • adusal

                                              today at 2:04 PM

                                              Update: We've upgraded our resources to accommodate the growing traffic!

                                              • adusal

                                                yesterday at 10:53 PM

                                                The error usually isn't an issue since the agent can use the tools regardless. It's a by-product of the current implementation's serverless nature and SSE's limitations. We are looking into alternative solutions.

                                            • lukew3

                                              yesterday at 6:38 PM

                                              Cool project! I would probably call it an MCP server for every Github repo though since project could be confused for Github Projects which is their work planning/tracking tool.

                                                • liadyo

                                                  yesterday at 6:54 PM

                                                  Thanks!

                                              • fallat

                                                today at 12:31 AM

                                                Ok, wow.

                                                MCP is REALLY taking off FAST D:

                                                • thomasfromcdnjs

                                                  yesterday at 7:49 PM

                                                  This is awesome, well done!

                                                  • pfista

                                                    yesterday at 7:14 PM

                                                    Do you auto-generate specific MCP tools for the repo? Curious what the queries you would use with an AI agent to get a response back.

                                                    I'm building my own hosted MCP solution (https://skeet.build) and have been deliberately choosing which tools to expose depending on the use case- since there are tool limits due to the context window for apps like Cursor.

                                                    • T3RMINATED

                                                      today at 12:03 PM

                                                      [dead]