\

Show HN: A WYSIWYG word processor in Python

28 points - today at 6:40 PM


Hi all,

Finding a good data structure for a word processor is a difficult problem. My notebook diaries on the problem go back 25 years when I was frustrated with using Word for my diploma thesis - it was slow and unstable at that time. I ended up getting pretty hooked on the problem.

Right now I’m taking a professional break and decided to finally use the time to push these ideas further, and build MiniWord — a WYSIWYG word processor in Python.

My goal is to have a native, non-HTML-based editor that stays simple, fast, and is hackable. So far I am focusing on getting the fundamentals right. What is working yet is:

- Real WYSIWYG editing (no HTML layer, no embedded browser) with styles, images and tables.

- Clean, simple file format (human-readable, diff-friendly, git-friendly, AI-friendly)

- Markdown support

- Support for Python-plugins

Things that I found:

- B-tree structures are perfect for holding rich text data

- A simple text-based file format is incredibly useful — you can diff documents, version them, and even process them with AI tools quite naturally

What I’d love feedback on:

- Where do you see real use cases for something like this?

- What would be missing for you to take it seriously as a tool or platform?

- What kinds of plugins or extensions would actually be worth building?

Happy about any thoughts — positive or critical. Greetings

Source
  • __d

    today at 9:02 PM

    This is great!

    Curious about the choice of toolkit: what led you to wxPython?

    • vishnuharidas

      today at 8:32 PM

      This took me down the nostalgic memory lane of the planet-source-code days. There were hundreds of such projects in Visual Basic, Delphi, C/C++/MFC etc., and text editors and paint clones were the most popular projects.

      • analogpixel

        today at 8:33 PM

        at this point, a WYSIWYG just seems like a huge step backwards from just using markdown. I love having access to my files in a standard text format this is super easy to parse, and not being locked into whatever weird format that WYSIWYG decides to store it in.

        I still don't understand why people still use ~~Microsoft Word~~Copilot document writer , I think they have gotten into some weird mindset that their documents require all this weird unnecessary formatting to look "official"

          • sakesun

            today at 9:11 PM

            Yes. These days, with plain text, pasrsers, Internet, mobile devices and LLM, we really get more than what we see. Only few case where paper print out is still more useful.

            • httpsterio

              today at 8:59 PM

              Markdown without formatting isn't usually the nicest to read imo. I actually appreciate a well laid out and formatted document myself.

              Also wysiwyg doesn't mean it can't be back and forwards compatible with markdown, it might just mean that it's a markdown editor gui with a preview.

          • chjail-11

            today at 8:03 PM

            I adore anything that avoids using a browser. <3

            • kubb

              today at 8:26 PM

              I thought the data structure part is solved:

              https://en.wikipedia.org/wiki/Rope_(data_structure)

                • chrisecker

                  today at 8:43 PM

                  Ropes are for strings. In a word processor you need text with formatting, and structures as tables, images and math.

              • mttpgn

                today at 8:00 PM

                On MacOS, I'm seeing `ModuleNotFoundError: No module named 'miniword.core.utils'` whether I run `python3 -m miniword` from src/miniword/ or from src/miniword/miniword/.

                  • chrisecker

                    today at 8:13 PM

                    My mistake. Now it works (on linux).

                • fractallyte

                  today at 8:31 PM

                  One feature missing from almost every mainstream word processor: REVEAL CODES! (https://kb.corel.com/en/127364)

                  This is a famous "killer" feature from WordPerfect: the ability to view and edit the low-level formatting for a document. It's invaluable for fixing weird bugs.

                  However, it works only because WP uses the "text-stream" paradigm, where a document comprises a linear stream of text with formatting codes (Bold, Font, Hard Return, etc.) embedded directly at the point at which they're applied.

                  In contrast, Word uses the "nested containers" model (characters inside words, words inside paragraphs, paragraphs inside sections, etc.), where this feature can't be replicated.

                  I didn't look closely at your code, but just thought to mention this feature.

                  • avryhof

                    today at 8:01 PM

                    Looks like a nice project.

                    Looks like you missed a file, though.

                    ModuleNotFoundError: No module named 'miniword.core.utils'

                    I don't see it in my local clone of your repo, nor the repo iteslf.

                      • chrisecker

                        today at 8:10 PM

                        My apologies. I added the missing file.

                          • avryhof

                            today at 8:35 PM

                            Thanks. I got it to run on my work laptop that runs Windows. Selections don't work, and cairo spits out a bunch of errors during the screen redraws.

                            I'll give it a shot on my own Ubuntu laptop.

                    • LoganDark

                      today at 7:21 PM

                      Love to see wxPython!

                      • johnwhitman

                        today at 9:00 PM

                        [dead]