Use "extra" to access meta fields in post processing

Portrait of Sylvestre Dupont
by Sylvestre Dupont
1 min read
last updated on

Based on feedback received, we are improving our Post Processing feature. You now have access to an extra dict variable when writing your post processing code. This dictionary contains all extra fields, even if they are not enabled in your mailbox.

This allows you, for example, to extract a field using a regular expression from the HtmlDocument extra field without having the full document content as part of your parsed result.

For example, we are now using this internally to improve the accuracy and data quality of our Doordash managed template. Doordash PDFs can have duplicated line items and the PDF footer gets in the way on multi-page orders. We are using post processing and the extra["HtmlDocument"] data to extract items and clean totals from the PDF.

Learn more about writing Python code to post process your data.

AI-based data extraction software. Start using Parseur today.

Automate text extraction from emails, PDFs, and spreadsheets.
Save hundreds of hours of manual work.
Embrace work automation with AI.

Sign up for free
Parseur rated 5/5 on Capterra
Parseur.com has the highest adoption on G2
Parseur.com has the happiest users badge on Crozdesk
Parseur rated 5/5 on GetApp
Parseur rated 4.5/5 on Trustpilot