首页
学习
活动
专区
圈层
工具
发布
社区首页 >问答首页 >Scrapy Start_url错误

Scrapy Start_url错误
EN

Stack Overflow用户
提问于 2017-07-05 17:48:02
回答 1查看 65关注 0票数 0

我是scrapy的新手,我正在尝试从范围(1,70000000)的页面中抓取日期,我使用的代码是

代码语言:javascript
复制
import scrapy, json, re
from blackberry.items import BlackberryItem
class BlackSpider(scrapy.Spider):
    name = 'datas'
    start_urls = [
              'https://appworld.blackberry.com/cas/content/%s?countryid=100&lang=en&callback=_content_2360&_=1499177414482' %page for page in xrange(1, 10000000),
              'https://appworld.blackberry.com/cas/content/%s?countryid=100&lang=en&callback=_content_2360&_=1499177414482'%y for y in xrange(10000000, 20000000),
              'https://appworld.blackberry.com/cas/content/%s?countryid=100&lang=en&callback=_content_2360&_=1499177414482'%a for a in xrange(20000000, 30000000),
              'https://appworld.blackberry.com/cas/content/%s?countryid=100&lang=en&callback=_content_2360&_=1499177414482'%b for b in xrange(40000000, 50000000),
              'https://appworld.blackberry.com/cas/content/%s?countryid=100&lang=en&callback=_content_2360&_=1499177414482'%c for c in xrange(50000000, 60000000),
              'https://appworld.blackberry.com/cas/content/%s?countryid=100&lang=en&callback=_content_2360&_=1499177414482'%d for d in xrange(60000000, 70000000)
              ]

但是我得到了这个错误:

代码语言:javascript
复制
"y is not defined"
EN

回答 1

Stack Overflow用户

发布于 2017-07-05 18:09:38

可能的解决方案之一是如下所示。

代码语言:javascript
复制
import scrapy
import json
import re
from blackberry.items import BlackberryItem
class BlackSpider(scrapy.Spider):
    name = 'datas'
    start_urls = ['https://appworld.blackberry.com/cas/content/%s?countryid=100&lang=en&callback=_content_2360&_=1499177414482' % page for page in xrange(10000000, 20000000)]
    start_urls += ['https://appworld.blackberry.com/cas/content/%s?countryid=100&lang=en&callback=_content_2360&_=1499177414482' % page for page in xrange(20000000, 30000000)]
    start_urls += ['https://appworld.blackberry.com/cas/content/%s?countryid=100&lang=en&callback=_content_2360&_=1499177414482' % page for page in xrange(30000000, 40000000)]
    start_urls += ['https://appworld.blackberry.com/cas/content/%s?countryid=100&lang=en&callback=_content_2360&_=1499177414482' % page for page in xrange(40000000, 50000000)]
    start_urls += ['https://appworld.blackberry.com/cas/content/%s?countryid=100&lang=en&callback=_content_2360&_=1499177414482' % page for page in xrange(50000000, 60000000)]
票数 0
EN
页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持
原文链接:

https://stackoverflow.com/questions/44922676

复制
相关文章

相似问题

领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档