It's hopefully not too late, so Happy New Year!

It's been quite some time since our last update. Rest assured we did not stay idle. Since releasing our major PDF parsing upgrade, we've worked non-stop tuning and improving our new engine to cater to more and more use cases.

We're starting 2023 with a few small improvements that we hope will make your parsing workflow better.

New: Export last document data only

A few customers asked for a simple way to download the data of the last document they had parsed. The typical use case is that you are receiving a daily report containing updates. As soon as you receive a new report, the previous one becomes obsolete, and hence you only want the data from the freshest document.

We added a "Last document only" option to the Download and Google Sheets exports for this purpose.

The new "Last document only" option is available in the Export section of your mailbox

New: find labels starting from the bottom of the document

Labels power our flagship Dynamic OCR feature, which lets you extract data fields that move horizontally or vertically in documents.

When creating a label in an OCR Template, Parseur automatically computes the occurrence and total number of occurrences of that label in the document. Parseur will then use this information to compute the position of the label if there is more than one occurrence.

Label occurrence is calculated from the top of the document by default. However, sometimes you want Parseur to locate the label from the bottom of the document instead. For example, you want to always take the last occurrence of "Total" in a document even if the total number of occurrences varies from one document to the next.

We added the option to count occurrences from the bottom instead of the top on the label option screen.

In this example, we set the label as the first occurrence of all "Total:" labels counted from the bottom of the document, effectively asking Parseur to always take the last one

Other improvements and bug fixes

We made many updates behind the scenes to correctly handle the strangest and weirdest types of PDFs (PDFs come in all shapes and flavors).
The field usage page in your mailbox now includes fields used in OCR templates as well.

That's all for this month! As usual, please don't hesitate to share your use cases and feature requests on the chat or on our feedback page directly.

Sylvestre Dupont LinkedIn

Co-Founder and CEO at Parseur

Sylvestre is the Co-Founder and CEO of Parseur, where he leads a fully distributed team spanning Asia, Europe, the US, Africa, and the Middle East. A software engineer by training, he honed his skills managing multimillion-dollar projects at a leading consulting firm before launching Parseur. Passionate about elegantly solving complex problems, Sylvestre thrives on building products that enable companies to automate and delegate tedious operational tasks to AI. When he's not steering Parseur, he's racing co-founder Sylvain to see who can visit the most countries. He's losing.

Last updated on January 24th, 2023

Ready to automate your
document data extraction?

Start free in minutes and see how Parseur fits into your workflow.

No model training required

Automates data entry from any document

Scales from point-and-click to API

Export last document, and more

New: Export last document data only

New: find labels starting from the bottom of the document

Other improvements and bug fixes

Ready to automate yourdocument data extraction?

Ready to automate your
document data extraction?