Sitemap generator
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
Kamo Petrosyan 6484c78352 From knocker 4 years ago
pysitemap From knocker 4 years ago
pysitemap_old From knocker 4 years ago
tests mechanize removed 6 years ago
.gitignore Version 0.5.0 with Python 3 Compatibility 6 years ago
CANGELOG новый файл: CANGELOG 6 years ago
LICENSE Update LICENSE 6 years ago
NOTICE modified: .gitignore 6 years ago
README.md Update README.md 6 years ago
composer.json modified: .gitignore 6 years ago
requirements.txt modified: .gitignore 6 years ago
run.py From knocker 4 years ago
setup.py version 0.5.2 6 years ago
sitemap.xml Python 3 compatible 6 years ago

README.md

pysitemap

Sitemap generator

installing

pip install sitemap-generator

Gevent

Sitemap-generator uses gevent to implement multiprocessing. Install gevent:

pip install gevent

example

import pysitemap
if __name__ == '__main__':
    url = 'http://www.example.com/'  # url from to crawl
    logfile = 'errlog.log'  # path to logfile
    oformat = 'xml'  # output format
    crawl = pysitemap.Crawler(url=url, logfile=logfile, oformat=oformat)
    crawl.crawl()

multiprocessing example

import pysitemap
if __name__ == '__main__':
    url = 'http://www.example.com/'  # url from to crawl
    logfile = 'errlog.log'  # path to logfile
    oformat = 'xml'  # output format
    crawl = pysitemap.Crawler(url=url, logfile=logfile, oformat=oformat)
    crawl.crawl(pool_size=10)  # 10 parsing processes