文章/答案/技术大牛

发布

社区首页 >问答首页 >在python变量中查找regex模式文本

问在python变量中查找regex模式文本
EN

Stack Overflow用户

提问于 2017-02-15 16:50:44

回答 1查看 93关注 0票数 0

    # Ex1
    # Number of datasets currently listed on data.gov
    # http://catalog.data.gov/dataset


    import requests
    import re

    from bs4 import BeautifulSoup


    page = requests.get(
        "http://catalog.data.gov/dataset")

    soup = BeautifulSoup(page.content, 'html.parser')

    value = soup.find_all(class_='new-results')

    results = re.search([0-9][0-9][0-9],[0-9][0-9][0-9], value


    print(value)

代码在上面..我想以regex = 0-90-9,0-90-9的形式查找文本

变量'value‘中的文本

我该怎么做呢？

根据ShellayLee的建议，我将其更改为

import requests
import re

from bs4 import BeautifulSoup


page = requests.get(
    "http://catalog.data.gov/dataset")

soup = BeautifulSoup(page.content, 'html.parser')

value = soup.find_all(class_='new-results')

my_match = re.search(r'\d\d\d,\d\d\d', value)


print(my_match)

仍然收到错误

回溯(最近一次调用)：文件"ex1.py"，第19行，在my_match = re.search(r'\d\d\d，\d\d\d'，"/Library/Frameworks/Python.framework/Versions/3.6/lib/python3.6/re.py"，)文件返回行182，在搜索返回_compile(模式，标志).search(字符串) TypeError:期望的字符串或类似字节的对象

python-3.x

beautifulsoup

回答 1

Stack Overflow用户

发布于 2017-02-15 17:31:03

您需要一些Python中的regex的基础知识。Python中的正则表达式在中表示为字符串，re模块提供了match、search、findall等函数，这些函数可以将字符串作为参数并将其视为模式。

在您的示例中，模式[0-9][0-9][0-9],[0-9][0-9][0-9]可以表示为：

my_pattern = r'\d\d\d,\d\d\d'

然后像这样使用

my_match = re.search(my_pattern, value_text)

其中，\d表示数字符号(与[0-9]相同)。字符串前面的r表示字符串中的反斜杠不会被视为转义符。

搜索函数返回一个match object。

我建议您先浏览一些教程，以消除进一步的困惑。官方的方法已经写得很好了：

https://docs.python.org/3.6/howto/regex.html

票数 0

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/42244575

复制

相似问题

问在python变量中查找regex模式文本
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问在python变量中查找regex模式文本EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问在python变量中查找regex模式文本
EN