Extracts URLs from a sitemap file. The parsing is triggered by sniffing the
content and can also be forced by 'isSitemap=true' in the metadata, otherwise
the tuple are passed on to the default stream, whereas any URLs extracted
from the sitemaps are sent to the 'status' field with a 'DISCOVERED' status.