Audio::Gramofile - Perl interface to libgramofile, a library derived from Gramofile


Audio-Gramofile documentation Contained in the Audio-Gramofile distribution.

Index


Code Index:

NAME

Top

Audio::Gramofile - Perl interface to libgramofile, a library derived from Gramofile

SYNOPSIS

Top

use Audio::Gramofile;

my $gramofile = Audio::Gramofile->new;

$gramofile->set_input_file($wav_file);

$gramofile->set_output_file($out_file);

# track splitting methods

$gramofile->init_tracksplit("make_use_rms" => 1);

$gramofile->split_to_tracks;

# signal processing methods

$gramofile->init_filter_tracks(@filter_list);

$gramofile->init_simple_median_filter("num_samples" => 7);

$gramofile->init_double_median_filter("first_num_samples" => 5);

$gramofile->init_simple_mean_filter("num_samples" => 9);

$gramofile->init_rms_filter("num_samples" => 3);

$gramofile->init_cmf_filter("rms_length" => 9);

$gramofile->init_cmf2_filter("rec_med_len" => 11);

$gramofile->init_cmf3_filter("fft_length" => 8);

$gramofile->init_simple_normalize_filter("normalize_factor" => 25);

$gramofile->use_begin_end_time($begin_time, $end_time);

$gramofile->process_whole_file;

$gramofile->adjust_frames($framesize);

$gramofile->filter_tracks;

ABSTRACT

Top

This module provides a Perl interface to Gramofile, a program for recording gramophone records. It is able to record hours of CD quality music, split long sound files in separate tracks, and remove ticks and pops from recordings.

Gramofile was written by Anne Bezemer and Ton Le.

Gramofile is available from http://www.opensourcepartners.nl/~costar/gramofile/

libgramofile - a library derived from Gramofile is available from http://sourceforge.net/projects/libgramofile

DESCRIPTION

Top

new

returns an object initialised with the parameters specified in the original C code.

e.g. my $gramofile = Audio::Gramofile->new;

set_input_file

sets the input .wav file for track splitting and signal processing methods.

e.g. $gramofile->set_input_file($wav_file);

track splitting methods

init_tracksplit

The following elements may be initialised by this method. All are used to modify the track splitting algorithm.

e.g. $gramofile->init_tracksplit("make_use_rms" => 1, "min_silence_blocks" => 10);

split_to_tracks

The input file is split into a number of tracks. A file with name new.wav will be split into tracks called new01.wav, new02.wav etc.

e.g. $gramofile->split_to_tracks;

signal processing methods

init_filter_tracks

Any, or all, of the following filters may be specified:

The filters are applied in the order given in the list.

e.g. $gramofile->init_filter_tracks("rms_filter", "simple_mean_filter");

by default the cond_median2_filter is used.

set_output_file

sets the output .wav file name for the signal processing method, filter_tracks.

e.g. $gramofile->set_output_file($out_file);

init_simple_median_filter

This method allows the parameters to be set for the simple median filter.

The following elements may be set for this filter:

e.g. $gramofile->init_simple_median_filter("num_samples" => 7);

init_double_median_filter

This method allows the parameters to be set for the simple median filter.

The following elements may be set for this filter:

e.g. $gramofile->init_double_median_filter("first_num_samples" => 5);

init_simple_mean_filter

This method allows the parameters to be set for the simple mean filter.

The following elements may be set for this filter:

e.g. $gramofile->init_simple_mean_filter("num_samples" => 9);

rms_filter

This method allows the parameters to be set for the rms filter.

The following elements may be set for this filter:

e.g. $gramofile->init_rms_filter("num_samples" => 3);

init_cmf_filter

This method allows the parameters to be set for the conditional mean filter.

The following elements may be set for this filter:

e.g. $gramofile->init_cmf_filter("rms_length" => 9, "tick_threshold" => 3000);

init_cmf2_filter

This method allows the parameters to be set for the second conditional mean filter.

The following elements may be set for this filter:

e.g. $gramofile->init_cmf2_filter("rec_med_len" => 11, "fine_threshold" => 2500);

init_cmf3_filter

This method allows the parameters to be set for the second conditional mean (frequency domain using fft) filter.

The following elements may be set for this filter:

e.g. $gramofile->init_cmf3_filter("fft_length" => 8, "fine_threshold" => 2500);

init_simple_normalize_filter

This method allows the parameters to be set for the simple normalize filter.

The following elements may be set for this filter:

e.g. $gramofile->init_simple_normalize_filter("normalize_factor" => 50);

use_begin_end_time

A begin time and end time can be specified. These times will be used instead of the track times derived from the split_to_tracks method.

e.g. $gramofile->use_begin_end_time($begin_time, $end_time);

process_whole_file

The whole file can be passed through the signal processing routines if this method is used.

e.g. $gramofile->process_whole_file;

adjust_frames

The default frame size is 588 (1/75 sec. @ 44.1 khz). This method allows this value to be user-defined.

e.g. $gramofile->adjust_frames($framesize);

filter_tracks

This method filters the tracks with the previously specified (or default) parameters.

e.g. $gramofile->filter_tracks;

EXPORT

Top

None by default.

SEE ALSO

Top

Gramofile : available from http://www.opensourcepartners.nl/~costar/gramofile/

libgramofile : A dynamically linked library derived from Gramofile, which this module needs, available from http://sourceforge.net/projects/libgramofile

fftw - the fastest Fourier Transform in the west : available from http://www.fftw.org

Signproc.txt, Tracksplit.txt, Tracksplit2.txt supplied with the Gramofile source code.

AUTHOR

Top

Bob Wilkinson, <bob@fourtheye.org>

COPYRIGHT AND LICENSE

Top


Audio-Gramofile documentation Contained in the Audio-Gramofile distribution.

package Audio::Gramofile;

use 5.006;
use strict;

use Carp;

use constant SECONDS_PER_HOUR => 3600;
use constant SECONDS_PER_MINUTE => 60;

require Exporter;
require DynaLoader;
use vars qw($VERSION @ISA);
@ISA = qw(Exporter
	DynaLoader);

$VERSION = '0.08';

bootstrap Audio::Gramofile $VERSION;

sub new {
  my $proto = shift;
  my $class = ref($proto) || $proto;
  my $self = {};
  bless $self, $class;

  $self->_init;
  $self;
}

sub set_input_file {
  my $self = shift;
  my $file = shift;

  croak "Need an input file, $!" unless (defined $file);
  $self->{input_file} = $file;
}

sub set_output_file {
  my $self = shift;
  my $file = shift;

  croak "Need an output file, $!" unless (defined $file);
  $self->{output_file} = $file;
}

sub _init {
  my $self = shift;

  $self->default_tracksplit;
  $self->default_filter_tracks;
  $self->default_simple_median_filter;
  $self->default_double_median_filter;
  $self->default_simple_mean_filter;
  $self->default_rms_filter;
  $self->default_cmf_filter;
  $self->default_cmf2_filter;
  $self->default_cmf3_filter;
  $self->default_simple_normalize_filter;
  $self->{usetracktimes} = 1;
}

sub init_tracksplit {
  my $self = shift;

  my %hash = (@_);
  my $make_use_rms = delete $hash{make_use_rms};
  my $make_graphs = delete $hash{make_graphs};
  my $blocklen = delete $hash{blocklen};
  my $global_silence_factor = delete $hash{global_silence_factor};
  my $local_silence_threshold = delete $hash{local_silence_threshold};
  my $min_silence_blocks = delete $hash{min_silence_blocks};
  my $min_track_blocks = delete $hash{min_track_blocks};
  my $extra_blocks_start = delete $hash{extra_blocks_start};
  my $extra_blocks_end = delete $hash{extra_blocks_end};
  croak "BAD ELEMENT in TRACKSPLIT HASH" if %hash;

  $self->{tracksplit}->{make_use_rms} = $make_use_rms if (defined $make_use_rms);
  $self->{tracksplit}->{make_graphs} = $make_graphs if (defined $make_graphs);
  $self->{tracksplit}->{blocklen} = $blocklen if (defined $blocklen);
  $self->{tracksplit}->{global_silence_factor} = $global_silence_factor if (defined $global_silence_factor);
  $self->{tracksplit}->{local_silence_threshold} = $local_silence_threshold if (defined $local_silence_threshold);
  $self->{tracksplit}->{min_silence_blocks} = $min_silence_blocks if (defined $min_silence_blocks);
  $self->{tracksplit}->{min_track_blocks} = $min_track_blocks if (defined $min_track_blocks);
  $self->{tracksplit}->{extra_blocks_start} = $extra_blocks_start if (defined $extra_blocks_start);
  $self->{tracksplit}->{extra_blocks_end} = $extra_blocks_end if (defined $extra_blocks_end);
}

sub default_tracksplit {
  shift->init_tracksplit(
    make_use_rms => 1,
    make_graphs => 0,
    blocklen => 4410,
    global_silence_factor => 150,
    local_silence_threshold => 5,
    min_silence_blocks => 20,
    min_track_blocks => 50,
    extra_blocks_start => 3,
    extra_blocks_end => 6,
  );
}

sub init_filter_tracks {
  my $self = shift;
  my $filter_ptr = @_ ? shift : undef;
  return unless (defined $filter_ptr);

  unless (ref $filter_ptr) {
    my %filters_id = (
      simple_median_filter    => 1,
      simple_mean_filter      => 2,
      cond_median_filter      => 3,
      double_median_filter    => 4,
      cond_median2_filter     => 5,
      rms_filter              => 6,
      copyonly_filter         => 7,
      monoize_filter          => 8,
      cond_median3_filter     => 9,
      simple_normalize_filter => 10,
      experiment_filter       => 11,
    );
    my @name_list = ($filter_ptr, @_); # unshift the element we shifted to test
    my @num_list;
    foreach my $filter (@name_list) {
      croak "Invalid filter name, $filter, $!" unless (defined $filters_id{$filter});
      push @num_list, $filters_id{$filter};
    }
    $filter_ptr = \@num_list;
  }

  $self->{filter_num} = @$filter_ptr;
  $self->{filter_ptr} = $filter_ptr;
}

sub default_filter_tracks {
  shift->init_filter_tracks([ 5 ]);
}

sub init_simple_median_filter {
  my $self = shift;

  my %hash = (@_);
  my $num_samples = delete $hash{num_samples};
  croak "BAD ELEMENT in INIT_SIMPLE_MEDIAN_FILTER HASH" if %hash;

  _odd_error_check("simple_median_num_samples", $num_samples);
  $self->{simple_median}->{num_samples} = $num_samples if (defined $num_samples);
}

sub default_simple_median_filter {
  shift->init_simple_median_filter(num_samples => 3);
}

sub init_double_median_filter {
  my $self = shift;
  my %hash = (@_);
  my $first_num_samples = delete $hash{first_num_samples};
  my $second_num_samples = delete $hash{second_num_samples};
  croak "BAD ELEMENT in INIT_DOUBLE_MEDIAN_FILTER HASH" if %hash;

  _odd_error_check("double_median_first_num_samples", $first_num_samples);
  _odd_error_check("double_median_second_num_samples", $second_num_samples);

  $self->{double_median}->{first_num_samples} = $first_num_samples if (defined $first_num_samples);
  $self->{double_median}->{second_num_samples} = $second_num_samples if (defined $second_num_samples);
}

sub default_double_median_filter {
  shift->init_double_median_filter(
    first_num_samples => 5,
    second_num_samples => 5,
  );
}

sub init_simple_mean_filter {
  my $self = shift;
  my %hash = (@_);
  my $num_samples = delete $hash{num_samples};
  croak "BAD ELEMENT in INIT_SIMPLE_MEAN_FILTER HASH" if %hash;

  _odd_error_check("simple_mean_num_samples", $num_samples);

  $self->{simple_mean}->{num_samples} = $num_samples if (defined $num_samples);
}

sub default_simple_mean_filter {
  shift->init_simple_mean_filter(num_samples => 3);
}

sub init_rms_filter {
  my $self = shift;
  my %hash = (@_);
  my $num_samples = delete $hash{num_samples};
  croak "BAD ELEMENT in INIT_RMS_FILTER HASH" if %hash;

  _odd_error_check("rms_filter_num_samples", $num_samples);

  $self->{rms}->{num_samples} = $num_samples if (defined $num_samples);
}

sub default_rms_filter {
  shift->init_rms_filter(num_samples => 3);
}

sub init_cmf_filter {
  my $self = shift;
  my %hash = (@_);
  my $num_samples = delete $hash{num_samples};
  my $rms_length = delete $hash{rms_length};
  my $rec_med_len = delete $hash{rec_med_len};
  my $rec_med_dec = delete $hash{rec_med_dec};
  my $tick_threshold = delete $hash{tick_threshold};
  croak "BAD ELEMENT in INIT_CMF_FILTER HASH" if %hash;

  _odd_error_check("cmf_median_tick_num_samples", $num_samples);
  _odd_error_check("cmf_rms_length", $rms_length);
  _odd_error_check("cmf_recursive_median_length", $rec_med_len);
  _error_check("cmf_decimation_factor", $rec_med_dec, 0);
  _error_check("cmf_tick_detection_threshold", $tick_threshold, 999);

  $self->{cmf}->{num_samples} = $num_samples if (defined $num_samples);
  $self->{cmf}->{rms_length} = $rms_length if (defined $rms_length);
  $self->{cmf}->{rec_med_len} = $rec_med_len if (defined $rec_med_len);
  $self->{cmf}->{rec_med_dec} = $rec_med_dec if (defined $rec_med_dec);
  $self->{cmf}->{tick_threshold} = $tick_threshold if (defined $tick_threshold);
}

sub default_cmf_filter {
  shift->init_cmf_filter(
    num_samples => 21,
    rms_length => 9,
    rec_med_len => 11,
    rec_med_dec => 5,
    tick_threshold => 2500,
  );
}

sub init_cmf2_filter {
  my $self = shift;
  my %hash = (@_);
  my $rms_length = delete $hash{rms_length};
  my $rec_med_len = delete $hash{rec_med_len};
  my $rec_med_dec = delete $hash{rec_med_dec};
  my $fine_threshold = delete $hash{fine_threshold};
  my $tick_threshold = delete $hash{tick_threshold};
  croak "BAD ELEMENT in INIT_CMF2_FILTER HASH" if %hash;

  _odd_error_check("cmf2_rms_length", $rms_length);
  _odd_error_check("cmf2_recursive_median_length", $rec_med_len);
  _error_check("cmf2_decimation_factor", $rec_med_dec, 0);
  _error_check("cmf2_tick_fine_threshold", $fine_threshold, 0);
  _error_check("cmf2_tick_detection_threshold", $tick_threshold, 999);

  $self->{cmf2}->{rms_length} = $rms_length if (defined $rms_length);
  $self->{cmf2}->{rec_med_len} = $rec_med_len if (defined $rec_med_len);
  $self->{cmf2}->{rec_med_dec} = $rec_med_dec if (defined $rec_med_dec);
  $self->{cmf2}->{fine_threshold} = $fine_threshold if (defined $fine_threshold);
  $self->{cmf2}->{tick_threshold} = $tick_threshold if (defined $tick_threshold);
}

sub default_cmf2_filter {
  shift->init_cmf2_filter(
    rms_length => 9,
    rec_med_len => 11,
    rec_med_dec => 12,
    fine_threshold => 2000,
    tick_threshold => 8500,
  );
}

sub init_cmf3_filter {
  my $self = shift;
  my %hash = (@_);
  my $rms_length = delete $hash{rms_length};
  my $rec_med_len = delete $hash{rec_med_len};
  my $rec_med_dec = delete $hash{rec_med_dec};
  my $fine_threshold = delete $hash{fine_threshold};
  my $tick_threshold = delete $hash{tick_threshold};
  my $fft_length = delete $hash{fft_length};
  croak "BAD ELEMENT in INIT_CMF3_FILTER HASH" if %hash;

  _odd_error_check("cmf3_rms_length", $rms_length);
  _odd_error_check("cmf3_recursive_median_length", $rec_med_len);
  _error_check("cmf3_decimation_factor", $rec_med_dec, 0);
  _error_check("cmf3_tick_fine_threshold", $fine_threshold, 0);
  _error_check("cmf3_tick_detection_threshold", $tick_threshold, 999);
  _error_check("cmf3_fft_length", $fft_length, 5, 13);

  $self->{cmf3}->{rms_length} = $rms_length if (defined $rms_length);
  $self->{cmf3}->{rec_med_len} = $rec_med_len if (defined $rec_med_len);
  $self->{cmf3}->{rec_med_dec} = $rec_med_dec if (defined $rec_med_dec);
  $self->{cmf3}->{fine_threshold} = $fine_threshold if (defined $fine_threshold);
  $self->{cmf3}->{tick_threshold} = $tick_threshold if (defined $tick_threshold);
  $self->{cmf3}->{fft_length} = $fft_length if (defined $fft_length);
}

sub default_cmf3_filter {
  shift->init_cmf3_filter(
    rms_length => 9,
    rec_med_len => 11,
    rec_med_dec => 12,
    fine_threshold => 2000,
    tick_threshold => 8500,
    fft_length => 9,
  );
}

sub init_simple_normalize_filter {
  my $self = shift;
  my %hash = (@_);
  my $normalize_factor = delete $hash{normalize_factor};
  croak "BAD ELEMENT in INIT_SIMPLE_NORMALIZE_FILTER HASH" if %hash;

  _error_check("simple_normalize_factor", $normalize_factor, -1, 101);
  $self->{simple_normalize}->{normalize_factor} = $normalize_factor if (defined $normalize_factor);
}

sub default_simple_normalize_filter {
  shift->init_simple_normalize_filter(normalize_factor => 0);
}

sub split_to_tracks {
  my $self = shift;

  croak "Input .wav file needs to be set, $!" unless (defined $self->{input_file});

  Audio::Gramofile::tracksplit_main($self->{input_file},
                                    $self->{tracksplit}->{make_use_rms},
                                    $self->{tracksplit}->{make_graphs},
                                    $self->{tracksplit}->{blocklen},
                                    $self->{tracksplit}->{global_silence_factor},
                                    $self->{tracksplit}->{local_silence_threshold},
                                    $self->{tracksplit}->{min_silence_blocks},
                                    $self->{tracksplit}->{min_track_blocks},
                                    $self->{tracksplit}->{extra_blocks_start},
                                    $self->{tracksplit}->{extra_blocks_end})
}

sub use_begin_end_time {
  my $self = shift;
  my $begin_time = shift;
  my $end_time = shift;

  $self->{begin_time} = _to_seconds($begin_time);
  $self->{end_time} = _to_seconds($end_time);
  $self->{usebeginendtime} = 1;
  $self->{usetracktimes} = 0;
}

sub process_whole_file {
  my $self = shift;

  $self->{usebeginendtime} = 0;
  $self->{usetracktimes} = 0;
}

sub adjust_frames {
  my $self = shift;
  my $framesize = shift;

  $self->{adjustframes} = 1;
  $self->{framesize} = $framesize;
}

sub filter_tracks {
  my $self = shift;

  croak "Input .wav file needs to be set, $!" unless (defined $self->{input_file});
  croak "Output .wav file needs to be set, $!" unless (defined $self->{output_file});
  croak "List of filters needs to be set, $!" unless (defined $self->{filter_ptr});

  my $simple_median_num_samples = $self->{simple_median}->{num_samples};
  my $double_median_init_params_ptr = [ $self->{double_median}->{first_num_samples},
                                        $self->{double_median}->{second_num_samples} ];
  my $simple_mean_num_samples = $self->{simple_mean}->{num_samples};
  my $rms_filter_num_samples = $self->{rms}->{num_samples};
  my $cmf_init_params_ptr = [ $self->{cmf}->{num_samples},
                              $self->{cmf}->{rms_length},
                              $self->{cmf}->{rec_med_len},
                              $self->{cmf}->{rec_med_dec},
                              $self->{cmf}->{tick_threshold} ];
  my $cmf2_init_params_ptr = [ $self->{cmf2}->{rms_length},
                               $self->{cmf2}->{rec_med_len},
                               $self->{cmf2}->{rec_med_dec},
                               $self->{cmf2}->{fine_threshold},
                               $self->{cmf2}->{tick_threshold} ];
  my $cmf3_init_params_ptr = [ $self->{cmf3}->{rms_length},
                               $self->{cmf3}->{rec_med_len},
                               $self->{cmf3}->{rec_med_dec},
                               $self->{cmf3}->{fine_threshold},
                               $self->{cmf3}->{tick_threshold},
			       $self->{cmf3}->{fft_length} ];
  my $simple_normalize_factor = $self->{simple_normalize}->{normalize_factor};

  my $usebeginendtime = (defined $self->{usebeginendtime}) ? $self->{usebeginendtime} : 0;
  my $usetracktimes = (defined $self->{usetracktimes}) ? $self->{usetracktimes} : 0;
  my $begintime = (defined $self->{begin_time}) ? $self->{begin_time} : 0.0;
  my $endtime = (defined $self->{end_time}) ? $self->{end_time} : 0.0;
  my $adjustframes = (defined $self->{adjustframes}) ? $self->{adjustframes} : 0;
  my $framesize = (defined $self->{framesize}) ? $self->{framesize} : 588;

  Audio::Gramofile::signproc_main($self->{input_file},
                                  $self->{output_file},
                                  $self->{filter_num},
                                  $self->{filter_ptr},
                                  $simple_median_num_samples,
                                  $double_median_init_params_ptr,
                                  $simple_mean_num_samples,
                                  $rms_filter_num_samples,
                                  $cmf_init_params_ptr,
                                  $cmf2_init_params_ptr,
                                  $cmf3_init_params_ptr,
                                  $simple_normalize_factor,
				  $usebeginendtime,
				  $usetracktimes,
				  $begintime,
				  $endtime,
				  $adjustframes,
				  $framesize);
}

sub _odd_error_check {
  my $name = shift;
  my $value = shift;

  return unless (defined $value);
  croak "Param $name (value is $value) needs to be odd and greater than 0"
    unless (($value > 0) and ($value % 2));
}

sub _error_check {
  my $name = shift;
  my $value = shift;
  my $min = shift;
  my $max = @_ ? shift : undef;

  return unless (defined $value);
  croak "Param $name (value is $value) needs to be greater than $min" unless ($value > $min);
  if (defined $max) {
    croak "Param $name (value is $value) needs to be less than $max" unless ($value < $max);
  }
}

sub _to_seconds {
  my $time_string = shift;

# converts hh:mm:ss.sss, or mm:ss.sss to seconds

  $time_string =~ s/^://;
  my $colons = $time_string =~ tr/:/:/;

  if ($colons == 2) {
    my ($hours, $minutes, $seconds) = split(/:/, $time_string);
    return SECONDS_PER_HOUR*$hours + SECONDS_PER_MINUTE*$minutes + $seconds;
  } elsif ($colons == 1) {
    my ($minutes, $seconds) = split(/:/, $time_string);
    return SECONDS_PER_MINUTE*$minutes + $seconds;
  }
  return $time_string;
}

1;