Kamo Petrosyan 6484c78352 | 4 years ago | |
---|---|---|
pysitemap | 4 years ago | |
pysitemap_old | 4 years ago | |
tests | 6 years ago | |
.gitignore | 6 years ago | |
CANGELOG | 6 years ago | |
LICENSE | 6 years ago | |
NOTICE | 6 years ago | |
README.md | 6 years ago | |
composer.json | 6 years ago | |
requirements.txt | 6 years ago | |
run.py | 4 years ago | |
setup.py | 6 years ago | |
sitemap.xml | 6 years ago |
Sitemap generator
pip install sitemap-generator
Sitemap-generator uses gevent to implement multiprocessing. Install gevent:
pip install gevent
import pysitemap
if __name__ == '__main__':
url = 'http://www.example.com/' # url from to crawl
logfile = 'errlog.log' # path to logfile
oformat = 'xml' # output format
crawl = pysitemap.Crawler(url=url, logfile=logfile, oformat=oformat)
crawl.crawl()
import pysitemap
if __name__ == '__main__':
url = 'http://www.example.com/' # url from to crawl
logfile = 'errlog.log' # path to logfile
oformat = 'xml' # output format
crawl = pysitemap.Crawler(url=url, logfile=logfile, oformat=oformat)
crawl.crawl(pool_size=10) # 10 parsing processes