我正在尝试提取这个fandango页面上列出的电影的名称。
names_tag = soup.findAll('a', {'class': 'dark showtimes-movie-title'})这是在其中保留名称的锚类。问题是,当我运行代码时,输出是:
<a class="dark showtimes-movie-title" href="http://www.fandango.com/godzilla3d_170083/movieoverview">Godzilla 3D</a>当我在Godzilla 3D中想要的。如何才能成功解析此数据?
#anchor element containing the names of each movie
names_tag = soup.findAll('a', {'class': 'dark showtimes-movie-title'})
names_tag = str(names_tag)
movie_name = names_tag.split(',')
for each_line in movie_name:
movie_names.append(each_line)
i = 0
while (i < len(movie_names)):
print 'The length of %s is %s' %(movie_names[i], movie_times[i])
i+=1发布于 2014-05-19 12:20:28
使用text属性:
names_tag = soup.findAll('a', {'class': 'dark showtimes-movie-title'})
names = [name_tag.text for name_tag in names_tag]https://stackoverflow.com/questions/23729636
复制相似问题