我正在使用MARPA::R2实现一个解析器。
我有一个G1 rule,类似于:
PARAM ::= STRING | REGEX_STRINGL0 rule喜欢:
STRING ~ [^ \/\(\),&:\"~]+ -----> works fine
REGEX_STRING ~ [\"([^:]*?)\"] -----> doesn't work使用REGEX_STRING,我试图解析用双引号括起来的字符串,但是正则表达式有问题。另外,我希望删除双引号,并且只保留引号之间的内容。
因此,如果我使用以下代码提供输入:
my $recce = Marpa::R2::Scanless::R->new({grammar => $grammar});
my $input = "\"foo\""; --> here, it should parse "foo" and give me foo.
print "Trying to parse:\n$input\n\n";
$recce->read(\$input);
my $value_ref = ${$recce->value};
print "Output:\n".Dumper($value_ref);其他例子:"bar123“、"foo(123)”等。
发布于 2018-03-21 09:03:24
use 5.026;
use strictures;
use Data::Dumper qw(Dumper);
use Marpa::R2 qw();
my $grammar = Marpa::R2::Scanless::G->new({
bless_package => 'parsetree',
source => \<<'',
:default ::= action => [values] bless => ::lhs
lexeme default = action => [ start, length, value ] bless => ::name latm => 1
:start ::= expression
expression ::= funcname params
params ::= epsilon | lparen param rparen
epsilon ::=
funcname ~ [a-z0-9]+
lparen ~ '('
param ::= unquotedparam | quotedparam
unquotedparam ::= [a-z0-9]+
quotedparam ::= '"' stringliteral '"'
stringliteral ~ [^"]+
rparen ~ ')'
});
say $grammar->show_rules;
for my $input (qw[
func("foo")
bar123
foo(123)
]) {
my $r = Marpa::R2::Scanless::R->new({
grammar => $grammar,
trace_terminals => 1
});
$r->read(\$input);
say Dumper $r->value;
}https://stackoverflow.com/questions/49396033
复制相似问题