Redleaf Engine Update

I’ve always been struck by the fact that while our computers can store files, they don’t really understand them. That’s the idea behind Redleaf—and I’m excited to announce that version 2.0 is now ready.

For anyone new to the project, Redleaf is a local-first knowledge engine. The goal is to take a folder of documents (PDFs, transcripts, HTML, etc.) and turn it into a private, interconnected knowledge graph that you actually own and control.

The main dashboard view in Redleaf.

This new version is a big step up in stability and usability. Here are some of the main things I’ve been working on.

Better Document Intelligence

Redleaf 1.0 already used spaCy to extract people, places, and organizations. In 2.0, browsing that information is much faster. On the side of every document there is now an integrated indexing tab showing every detected person, place, and thing—making it much easier to navigate documents at a glance.

The side panel showing extracted entities.

Implied Relationships

Beyond just listing entities, the engine now analyzes their proximity and context to automatically suggest implied relationships. This helps uncover connections you might have missed.

Automatically detecting relationships between detected elements.

A More Practical Writing Setup

The new dual-pane editor is designed for actually getting work done. It has citation tools built-in that generate Chicago-style citations and export to .odt without messing up the formatting.

The dual-pane write-up editor.

Interactive Media Viewer

I put a lot of work into the media tools. You can now pair .srt files with podcast .xml feeds to get all the metadata, and the SRT viewer now syncs the audio/video with an animated transcript. The PDF viewer has also been overhauled for a smoother experience.

The interactive SRT viewer in action.

A PDF loaded up in Synthetist to cite.

Portable Knowledge Graphs

You can now export an entire graph as a single .rklf file. A teammate can import it, point it to their copy of the source documents, and the whole graph links back up without any reprocessing.

Under the Hood

I also spent a lot of time on stability. I redesigned the multiprocessing core to make it more robust, and the PDF viewer got an overhaul with in-document search, which makes it much nicer to use.


What’s Next on the Roadmap

I’m already planning the next steps for the engine. Here are two of the big ideas I’m exploring:


Redleaf Engine 2.0 is ready. I’d love for you to try it, break it, and let me know what you think.

— Nathaniel Westveer