Skip to main content

2 posts tagged with "lessons-learned"

View All Tags

We Spent Five Weeks Making Docling Work. Then We Deleted It.

· 6 min read · updated
Danish Javed
Software Engineer

This is a post-mortem on five weeks of infrastructure work that ended with git rm and 1,452 lines deleted from the lockfile alone.

The library in question is Docling. It's a capable open-source document parser from IBM Research — handles PDFs, tables, figures, DOCX, the lot. On paper it looked like exactly what we needed. In practice it turned out to be a small ML platform hiding inside a Python package, and we didn't fully appreciate that distinction until we were already three acts deep.