SWISH::Prog::Native::Searcher - wrapper for SWISH::API::Object


SWISH-Prog documentation Contained in the SWISH-Prog distribution.

Index


Code Index:

NAME

Top

SWISH::Prog::Native::Searcher - wrapper for SWISH::API::Object

SYNOPSIS

Top

 # see SWISH::Prog::Searcher

DESCRIPTION

Top

The Native Searcher is a thin wrapper around SWISH::API::Object.

METHODS

Top

init

Instantiates the SWISH::API::Object instance and stores it in the swish() accessor.

sao_opts( array_ref )

Options to pass to SWISH::API::Object in new().

result_class( class_name )

Passed to SWISH::API::Object in new().

swish

The SWISH::API::Object instance.

search( query, opts )

Calls the query() method on the internal SWISH::API::Object. Returns a SWISH::API::Object::Results object.

opts is an optional hashref with the following supported key/values:

start

The starting position. Default is 0.

max

The ending position. Default is max_hits() as documented in SWISH::Prog::Searcher.

order

Takes a SQL-like sort string in pattern field direction. See the Swish-e docs for sort string details.

limit

Takes an arrayref of arrayrefs. Each child arrayref should have three values: a field (PropertyName) value, a lower limit and an upper limit.

rank_scheme

Takes an int, 0 or 1. Default is 1.

default_boolop

The default boolean connector for parsing query. Valid values are AND and OR. The default is AND.

AUTHOR

Top

Peter Karman, <perl@peknet.com>

BUGS

Top

Please report any bugs or feature requests to bug-swish-prog at rt.cpan.org, or through the web interface at http://rt.cpan.org/NoAuth/ReportBug.html?Queue=SWISH-Prog. I will be notified, and then you'll automatically be notified of progress on your bug as I make changes.

SUPPORT

Top

You can find documentation for this module with the perldoc command.

    perldoc SWISH::Prog




You can also look for information at:

* Mailing list

http://lists.swish-e.org/listinfo/users

* RT: CPAN's request tracker

http://rt.cpan.org/NoAuth/Bugs.html?Dist=SWISH-Prog

* AnnoCPAN: Annotated CPAN documentation

http://annocpan.org/dist/SWISH-Prog

* CPAN Ratings

http://cpanratings.perl.org/d/SWISH-Prog

* Search CPAN

http://search.cpan.org/dist/SWISH-Prog/

COPYRIGHT AND LICENSE

Top

SEE ALSO

Top

http://swish-e.org/


SWISH-Prog documentation Contained in the SWISH-Prog distribution.
package SWISH::Prog::Native::Searcher;
use strict;
use warnings;
use Carp;
use base qw( SWISH::Prog::Searcher );
use SWISH::API::Object;
use SWISH::Prog::Native::InvIndex;
use SWISH::Prog::Native::Result;
use Search::Query;

__PACKAGE__->mk_accessors(qw( swish sao_opts result_class ));

our $VERSION = '0.51';

sub init {
    my $self = shift;
    $self->SUPER::init(@_);

    $self->{swish} = SWISH::API::Object->new(
        indexes => [ map { $_->file } @{ $self->{invindex} } ],
        class => $self->{result_class} || 'SWISH::Prog::Native::Result',
        @{ $self->{sao_opts} || [] }
    );

    # add accessor methods to the Result class
    # to mimic what SWISH::API::Object does.
    my $resclass = $self->{swish}->{class};
    if ( $resclass->can('mk_accessors') ) {
        my @propnames = $self->{swish}->props;
        for my $name (@propnames) {
            if ( !$resclass->can($name) ) {
                $resclass->mk_accessors($name);
            }
        }
    }

    # load meta from the first invindex
    my $invindex = $self->invindex->[0];
    my $config   = $invindex->meta;

    # have to wrap MetaNames check in eval because
    # there may not be any explicitly defined in config.
    my $metanames;
    eval { $metanames = $config->MetaNames; };
    if ( $@ and $@ =~ m/^no such Meta key: MetaNames/ ) {
        $metanames = { swishdefault => {} };
    }
    my $field_names = [ keys %$metanames ];
    my %fieldtypes;
    for my $name (@$field_names) {

        # TODO check PropertyNames for string|int|date
        $fieldtypes{$name} = {};

        if ( exists $metanames->{$name}->{alias_for} ) {
            $fieldtypes{$name}->{alias_for}
                = $metanames->{$name}->{alias_for};
        }
    }

    # TODO could expose 'qp' as param to new().
    $self->{qp} ||= Search::Query::Parser->new(
        dialect          => 'SWISH',
        fields           => \%fieldtypes,
        query_class_opts => {
            default_field => $field_names,
            debug         => $self->debug,
        }
    );

    return $self;
}

my %boolops = (
    'AND' => '+',
    'OR'  => '',
);

sub search {
    my $self        = shift;
    my $query       = shift or croak "query required";
    my $opts        = shift || {};
    my $start       = $opts->{start} || 0;
    my $max         = $opts->{max} || $self->max_hits;
    my $order       = $opts->{order};
    my $limits      = $opts->{limit} || [];
    my $rank_scheme = $opts->{rank_scheme};
    $rank_scheme = 1 unless defined $rank_scheme;
    my $boolop = $opts->{default_boolop} || 'AND';

    if ( !exists $boolops{ uc($boolop) } ) {
        croak "Unsupported default_boolop: $boolop (should be AND or OR)";
    }
    $self->{qp}->default_boolop( $boolops{$boolop} );
    my $parsed_query = $self->{qp}->parse($query)
        or croak "Query syntax error: " . $self->{qp}->error;

    my $swishdb = $self->{swish};

    # use idf ranking
    $swishdb->rank_scheme($rank_scheme);
    $swishdb->die_on_error('critical_error');

    my $searcher = $swishdb->new_search_object;

    for my $limit (@$limits) {
        if ( !ref $limit or ref($limit) ne 'ARRAY' or @$limit != 3 ) {
            croak
                "poorly-formed limit ($limit). should be an array ref of 3 values.";
        }
        $searcher->set_search_limit(@$limit);
    }
    if ($order) {
        $searcher->set_sort($order);
        $swishdb->die_on_error;
    }

    my $results = $searcher->execute("$parsed_query");
    $results->{swish_query}
        = join( ' ', $results->parsed_words( $swishdb->indexes->[0] ) );
    $results->{query} = $parsed_query;
    $swishdb->die_on_error;
    $results->seek_result($start);
    return $results;
}

1;

__END__