Bio::DOOP::ClusterSubset - One subset of a cluster


Bio-DOOP-DOOP documentation  | view source Contained in the Bio-DOOP-DOOP distribution.

Index


NAME

Top

Bio::DOOP::ClusterSubset - One subset of a cluster

VERSION

Top

Version 0.13

SYNOPSIS

Top

  @cluster_subsets = @{$cluster->get_all_subsets};




DESCRIPTION

Top

This object represents one subset of a cluster. A subset is a set of homologous sequences, hopefully monophyletic, grouped by evolutionary distance from the reference species (Arabidopsis or human).

AUTHORS

Top

Tibor Nagy, Godollo, Hungary and Endre Sebestyen, Martonvasar, Hungary

METHODS

Top

new

Creates a new subset object from the subset primary id. You usually won't need this, as you will create the subsets from a Bio::DOOP::Cluster object, based on the subset type.

Return type: Bio::DOOP::ClusterSubset object

  $cluster_subset = Bio::DOOP::ClusterSubset->new($db,"123");

get_id

Prints out the subset primary id. This is the internal ID from the MySQL database.

Return type: string

  print $cluster_subset->get_id;

get_type

Prints out the subset type.

Return type: string

  print $cluster_subset->get_type;

get_seqno

Returns the number of sequences in the subset.

Return type: string

  for(i = 0; i < $cluster_subset->get_seqno; i++){
      print $seq[$i];
  }

get_featno

Returns the total number of features (motifs, TSSs and other) in the subset.

Return type: string

  if ($cluster_subset->get_featno > 4){
      print "We have lots of features!!!\n";
  }

get_motifno

Returns the number of motifs in the subset.

Return type: string

  $motifs = $cluster_subset->get_motifno;

get_orig

Returns 'y' if the subset is the same as the original cluster, 'n' if not.

Return type: string ('y' or 'n')

  if ($cluster_subset->get_orig eq "y") {
      print "This is the original cluster!\n";
  }
  elsif ($cluster_subset->get_orig eq "n"){
      print "This is some smaller subset!\n";
  }

get_cluster

Returns the ID of the cluster, from which the subset originates.

Return type: string

  $cluster_id = $cluster_subset->get_cluster;

get_dialign

Prints out the dialign format alignment of the subset.

Return type: string

  print $cluster_subset->get_dialign;

get_fasta_align

Prints out the fasta format alignment of the subset.

Return type: string

  print $cluster_subset->get_fasta_align;

get_all_motifs

Returns the arrayref of all motifs associated with the subset.

Return type: arrayref, the array containig Bio::DOOP::Motif objects

  @motifs = @{$cluster_subset->get_all_motifs};

get_all_seqs

Returns a sorted arrayref of all sequences associated with the subset.

Sorting the sequences by the following criteria: The first sequence is always the reference species (Arabidopsis/Human). All other sequences are sorted first by the taxon_class (B E M V in the plants and P R E H M N T F V C in the chordates ) and then by the alphabetical order.

Return type: arrayref, the array containig Bio::DOOP::Sequence objects

  @seq = @{$cluster_subset->get_all_seqs};


Bio-DOOP-DOOP documentation  | view source Contained in the Bio-DOOP-DOOP distribution.