Skip to content

scraping allrecipes website response errors #13

@schnapi

Description

@schnapi

I would like to know why I am getting a lot of errors like this when I want to scrape allrecipes.com?

Thanks!

2017-10-27 13:31:38 [allrecipes] DEBUG: No item received for http://allrecipes.com/recipe/16348/baked-pork-chops-i/
2017-10-27 13:31:38 [scrapy.core.scraper] ERROR: Spider error processing <GET http://allrecipes.com/recipe/16348/baked-pork-chops-i/> (referer: http://allrecipes.com/recipes/?page=2)
Traceback (most recent call last):
  File "/usr/local/lib/python2.7/dist-packages/scrapy/utils/defer.py", line 102, in iter_errback
    yield next(it)
  File "/usr/local/lib/python2.7/dist-packages/scrapy/spidermiddlewares/offsite.py", line 29, in process_spider_output
    for x in result:
  File "/usr/local/lib/python2.7/dist-packages/scrapy/spidermiddlewares/referer.py", line 339, in <genexpr>
    return (_set_referer(r) for r in result or ())
  File "/usr/local/lib/python2.7/dist-packages/scrapy/spidermiddlewares/urllength.py", line 37, in <genexpr>
    return (r for r in result or () if _filter(r))
  File "/usr/local/lib/python2.7/dist-packages/scrapy/spidermiddlewares/depth.py", line 58, in <genexpr>
    return (r for r in result or () if _filter(r))
  File "/mnt/c/Users/Sandi/Desktop/food2vec-master/food2vec-master/dat/RecipesScraper/RecipesScraper/spiders/allrecipes_spider.py", line 33, in parse_item
    if len(data['items']) == 0:
TypeError: list indices must be integers, not str

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions