🧡 Skip to main content🔍 Skip to search

The Use web content feature enables you to use any file or webpage from the internet as a flexible input parameter in any Action. A static input value is substituted with a dynamic value at Task runtime.

Using web content gives a workflow ability to automatically react to HTML, text, JSON, or XML file changes. Arbitrary fields from a text based file can be extracted using powerful Text Actions to branch your workflow later using IF–ELSE logic.

Automation Workshop allows converting HTML files on the web to elegantly formatted plain text files fully automatically. When requirements are to extract only one or couple tags from an XML or HTML file, use a very flexible Enclosed Text Action.

VariableDetails
Webpage or file URL A fully qualified URL (path to the web resource)—webpage or file address.
Examples:
  • example.org/files/invoices.txt
  • www.example.net/dir/web-resource/
  • https://www.example.com/products/page.html
Retrieve as plain text Automatically converts a HTML file to plain text file. The built-in converting engine uses the standard Unicode text encoding · Preview
Include metadata Choose to include additional metadata if the plain text option is selected:
  • URL · a fully qualified URL of the specified web address after all redirections (301, 302, etc.) are made.
  • Title · value of the <title> tag in the plain text format.
  • Description · description of the specified web resource that is automatically extracted from the first <meta> tag that contains the name="description" attribute.
Adjust*Streamline creation of Task workflows even further—instantly make quick variable adjustments such as in-place replacement (all, first, or last occurrence), trimming (whitespaces, quotes, etc.), or changing capitalization. Optionally, set a different display name.

Plain text…

Automation Workshop enables you to extract plain text from any webpage in a no-code manner. The whole process is fully automated and unattended. The converted text is nicely formatted using the standard ASCII or UTF-8 text encoding.

The HTML to plain text converting engine treats ordered lists, unordered lists, tables, and forms as formatted elements, and tries to reproduce them in the text format using special spacing and tabbing.

Automatically convert webpage to plain text document
Use web content · Convert webpage to plain text

Details…

The extracted text is usually multiple times smaller than the original web page, as it doesn't contain any HTML markup. The converting engine automatically skips all <script> tags and as a result—the text that is generated by JavaScript doesn't appear in the plain text file.

On the other hand, the text that is not visible or is hidden when a page rendered in a regular web browser, may appear in the plain text version, as Automation Workshop doesn't parse CSS files. It also ignores any <style> elements. All comments and images are also skipped.

To finetune the final text, simply use the built-in text Actions, such as the Find & Replace Text Action. Small adjustments can be done "inline" using the Variable Wizard adjust feature (replace special symbols or change text case).

Just ask…

If you have any questions, please do not hesitate to contact our support team.