Restricts the text of the main document based on the text value of an Xpath expression (e.g.
Dumps the DOM representation of a document into a file
Adds domain (or host) to metadata - can be used later on for indexing
Extracts data from JSON-LD representation (https://json-ld.org/)
ParseFilter to extract additional links with Xpath can be configured with e.g.
Computes a signature for a page, based on the binary content or text.
Simple ParseFilter to illustrate and test the interface.
Copyright © 2018 DigitalPebble Ltd. All rights reserved.