How to parse a webpage from a link in an email?

At Parseur, we started with extracting data from emails. But Parseur's ultimate mission is to extract data from the Internet as a whole. In this article we are going to describe how to use Parseur to parse a webpage from a link in an email.

How to parse a webpage cover image

Step 1: Create a Parseur mailbox

If you haven't already done so, create your Parseur account and you’ll get started on our free forever plan.

Create your first mailbox and forward an email containing a link you want to extract data from.

Sample email we want to extract the link from

Sample email we want to extract the link from

Note: Parseur works best with machine generated emails.

Step 3: Create a template for the fetched web page

Now create a template for the web page by clicking on the plus button.

Create as many fields in the new fetched web page as you need to capture data

Create as many fields in the new fetched web page as you need to capture data

Click Create and...

Step 4: Watch Parseur parse a web page and profit!

Creating the template will parse your document and extract the relevant data from it.

See the data extracted by Parseur on the right-hand side

See the data extracted by Parseur on the right-hand side

Now, every time you send a similar email with a link, the web page will be fetched and if it matches one of your existing templates, data from the web page will be parsed and extracted automatically.

Some closing remarks

Parseur is not limited to extracting links from emails. Any field in a template with the format "Linked Document" will be used to download documents and extract data. That means that you can fetch web pages from email attachments as well as from other web pages!

Parseur charges you on the number of successfully processed documents. Which means that fetching a web page from an email link and parsing that web page will count as 2 credits.

Infinite loop warning

Since the address of the document appears in the subject, try not to create a template that extract and fetch the subject as a link, it will create a new document from the same link again and again...

Last updated: