document.sitemap Module

Sitemap.xml

class wpull.document.sitemap.SitemapReader(html_parser)[source]

Bases: wpull.document.base.BaseDocumentDetector, wpull.document.base.BaseExtractiveReader

Sitemap XML reader.

MAX_ROBOTS_FILE_SIZE = 4096
classmethod is_file(file)[source]

Return whether the file is likely a Sitemap.

classmethod is_request(request)[source]

Return whether the document is likely to be a Sitemap.

classmethod is_response(response)[source]

Return whether the document is likely to be a Sitemap.

classmethod is_url(url_info)[source]

Return whether the document is likely to be a Sitemap.