Assigns one or more tags to the metadata of a document based on its URL matching patterns defined in a JSON resource file.
Rewrites single metadata containing comma separated values into multiple values for the same key, useful for instance for keyword tags.
Restricts the text of the main document based on the text value of an Xpath expression (e.g.
Dumps the DOM representation of a document into a file
Adds domain (or host) to metadata - can be used later on for indexing
Extracts data from JSON-LD representation (https://json-ld.org/)
ParseFilter to extract additional links with Xpath can be configured with e.g.
Computes a signature for a page, based on the binary content or text.
Simple ParseFilter to illustrate and test the interface.
Copyright © 2018 DigitalPebble Ltd. All rights reserved.