ReactorNotRestartable error in while loop with scrapy

By default, CrawlerProcess's .start() will stop the Twisted reactor it creates when all crawlers have finished.

You should call process.start(stop_after_crawl=False) if you create process in each iteration.

Another option is to handle the Twisted reactor yourself and use CrawlerRunner. The docs have an example on doing that.

python python-2.7 scrapy twisted

I was able to solve this problem like this. process.start() should be called only once.

from time import sleepfrom scrapy import signalsfrom scrapy.crawler import CrawlerProcessfrom scrapy.utils.project import get_project_settingsfrom scrapy.xlib.pydispatch import dispatcherresult = Nonedef set_result(item):    result = itemwhile True:    process = CrawlerProcess(get_project_settings())    dispatcher.connect(set_result, signals.item_scraped)    process.crawl('my_spider')process.start()

python python-2.7 scrapy twisted

Ref http://crawl.blog/scrapy-loop/

 import scrapy from scrapy.crawler import CrawlerProcess from scrapy.utils.project import get_project_settings      from twisted.internet import reactor from twisted.internet.task import deferLater def sleep(self, *args, seconds):    """Non blocking sleep callback"""    return deferLater(reactor, seconds, lambda: None) process = CrawlerProcess(get_project_settings()) def _crawl(result, spider):    deferred = process.crawl(spider)    deferred.addCallback(lambda results: print('waiting 100 seconds before     restart...'))    deferred.addCallback(sleep, seconds=100)    deferred.addCallback(_crawl, spider)    return deferred_crawl(None, MySpider)process.start()

CodeHunter

ReactorNotRestartable error in while loop with scrapy

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last