Parse::YYLex - Version of Parse::Lex to be used by a byacc parser.


ParseLex documentation Contained in the ParseLex distribution.

Index


Code Index:

NAME

Top

Parse::YYLex - Version of Parse::Lex to be used by a byacc parser.

SYNOPSIS

Top

Parse::Lex requires this perl version: require 5.004; use Parse::YYLex;

If using a procedural parser: Parse::YYLex->create ...; # exports &yylex and $yylval # see Parse::Lex for the token table args <...> Parse::YYLex::lex->from(\*FH); require 'MyParser.pl'; # generated by byacc yyparse();

If using an object-oriented parser: $lexer = new Parse::YYLex ...; # see Parse::Lex for the token table args <...> use MyParser; # generated by byacc5 $parser = new MyParser($lexer->getyylex, \&yyerror, $debug); # you must write &yyerror $lexer->from(\*STREAM); $parser->yyparse(*STREAM);

To get the token definitions from MyParser.ph instead of y.tab.ph or to change the skip regexp (default whitespace), do this before calling new or create: Parse::YYLex->ytab('MyParser.ph'); Parse::YYLex->skip('');

DESCRIPTION

Top

Often times you'd use a lexer in conjunction with a parser. And most of the time you'd want to generate that parser with a yacc parser generator. Parse::YYLex is a derivative of Parse::Lex compatible with yacc parsers, by adapting it to the byacc calling conventions:

Procedural Parsers

Yacc (and Bison) traditionally generate C or C++ parsers. Fortunately, Berkeley yacc has been modified to generate Perl, see ftp://ftp.sterling.com/local/perl-byacc.tar.Z

Byacc with the -P option generates procedural perl code that is compatible with both perl4 and perl5. (However you cannot use Parse::YYLex with perl4.) Use this variant for quick hacks, as it is more convenient than the one below. In this case Parse::YYLex-create> instantiates a lexer and exports a &yylex function (the lexer) and a $yylval variable (the token value) to its caller's namespace (which should be the namespace of the parser).

If you need to call any object methods of the created lexer (see Parse::Lex for documentation), use the $Parse::YYLex::lex variable.

Object-Oriented Parsers

Another byacc modification (I call it byacc5) generates object-oriented Perl5 code: CPAN/authors/id/JAKE/perl5-byacc-patches-0.5.tar.gz

Use this variant if you need more than one parser, you need flexibility, or you simply like OO. In this case you need to use new, and pass the return value of getyylex (a reference to the curried lexing function) to the parser. The lexing function returns a two-element array, the token type and value.

Numeric Token Table

Yacc parsers insist on using numeric token types, and define these in a file customarily named y.tab.ph. That is where Parse::YYLex will look by default, and the file has to be in the @INC path (which includes the current directory).

You can specify a different token table before calling new or create: Parse::YYLex->ytab('MyParser.ph');

LIMITATIONS

Top

Parse::YYLex is based on Parse::Lex which requires perl 5.004 and will not work with earlier versions. A slightly different version, Parse::CLex, works with earlier perl versions. It would be easy to allow a choice between Parse::Lex and Parse::CLex, but the latter has some limitations, and presently seems to have some bugs.

AUTHOR

Top

Vladimir Alexiev <vladimir@cs.ualberta.ca>

SEE ALSO

Top

byacc(1), Parse::Lex.


ParseLex documentation Contained in the ParseLex distribution.

package Parse::YYLex;

use strict;
use Parse::Lex;
use vars qw(@ISA $VERSION);

@ISA = qw(Parse::Lex);
$Parse::YYLex::VERSION = '2.20';

my ($TABLE,$yylval);
my $ytab = 'y.tab.ph';
use vars '$lex';                # global lexer

sub ytab {                      # class method
  my ($self,$tab) = @_;
  $ytab = $tab;
}

sub new {
  my $class = ref $_[0] || $_[0]; shift; 
  my ($path) = grep -f "$_/$ytab", @INC;
  $path or die "can't locate the file $ytab in \@INC\n"; 
  my %table;
  open(FILE,"$path/$ytab") or die "can't open $ytab: !$\n"; 
  for (<FILE>) {$table{$1}=$2 if /^\$(\w+)=(\d+);$/} 
  close(FILE);
  # Normally we would create the object, but Parse::Lex uses weird array-based 
  # attributes instead of the conventional hash-based attributes.
  my $self = bless new Parse::Lex(@_), $class; 
  # index of my slot in the Lex attr array
  if (!defined $TABLE) {$TABLE = scalar @$self;} 
  elsif ($TABLE != scalar @$self) 
    {die "Can't get a consistent slot for my TABLE attribute";}
  push @$self, \%table; 
  $self;
}

sub create {                    # class method
  $lex = shift->new(@_);        # get an instance 
  my $caller = caller;
  no strict 'refs';
  *{"${caller}::yylex"} = $lex->getyylex; 
  $yylval = \${"${caller}::yylval"};
}

sub yylex {                     # convert &next to yyparse calling conv
  my $self = shift;
  # my $stream = shift; $self->from($stream) if FIXME 
  # it's different from last time... 
  my $token = $self->next;
  my $text = $token->text;
  my $symbolic = $token->type;  # symbolic token type 
  my $numeric =                 # numeric token code
    $symbolic eq 'EOI' ? 0 :    # end of input
      $self->[$TABLE]{$symbolic} || # token defined in y.tab.ph
        (length $text==1 ? ord $text : # single-character token, eg '+'
         die "Please declare %token $symbolic (`$text') in the parser.\n");
  if ($yylval) {                # global lexer
    $$yylval = $text;
    return $numeric;
  } else {return ($numeric,$text);}
  # it doesn't hurt to return both in both cases, but let's play safe
}

sub getyylex {
  my $self = shift;
  sub{$self->yylex(@_)}
}

1;