| Audio-Gramofile documentation | Contained in the Audio-Gramofile distribution. |
Audio::Gramofile - Perl interface to libgramofile, a library derived from Gramofile
use Audio::Gramofile;
my $gramofile = Audio::Gramofile->new;
$gramofile->set_input_file($wav_file);
$gramofile->set_output_file($out_file);
# track splitting methods
$gramofile->init_tracksplit("make_use_rms" => 1);
$gramofile->split_to_tracks;
# signal processing methods
$gramofile->init_filter_tracks(@filter_list);
$gramofile->init_simple_median_filter("num_samples" => 7);
$gramofile->init_double_median_filter("first_num_samples" => 5);
$gramofile->init_simple_mean_filter("num_samples" => 9);
$gramofile->init_rms_filter("num_samples" => 3);
$gramofile->init_cmf_filter("rms_length" => 9);
$gramofile->init_cmf2_filter("rec_med_len" => 11);
$gramofile->init_cmf3_filter("fft_length" => 8);
$gramofile->init_simple_normalize_filter("normalize_factor" => 25);
$gramofile->use_begin_end_time($begin_time, $end_time);
$gramofile->process_whole_file;
$gramofile->adjust_frames($framesize);
$gramofile->filter_tracks;
This module provides a Perl interface to Gramofile, a program for recording gramophone records. It is able to record hours of CD quality music, split long sound files in separate tracks, and remove ticks and pops from recordings.
Gramofile was written by Anne Bezemer and Ton Le.
Gramofile is available from http://www.opensourcepartners.nl/~costar/gramofile/
libgramofile - a library derived from Gramofile is available from http://sourceforge.net/projects/libgramofile
returns an object initialised with the parameters specified in the original C code.
e.g. my $gramofile = Audio::Gramofile->new;
sets the input .wav file for track splitting and signal processing methods.
e.g. $gramofile->set_input_file($wav_file);
The following elements may be initialised by this method. All are used to modify the track splitting algorithm.
make_use_rms # Save/load signal power (RMS) data to/from .rms file
make_graphs # Generate graph files
blocklen # Length of blocks of signal power data (samples)
global_silence_factor # Global silence factor (0.1 %)
local_silence_threshold # Local silence factor (%)
min_silence_blocks # Minimal length of inter-track silence (blocks)
min_track_blocks # Minimal length of tracks (blocks)
extra_blocks_start # Number of extra blocks at track start
extra_blocks_end # Number of extra blocks at track end
e.g. $gramofile->init_tracksplit("make_use_rms" => 1, "min_silence_blocks" => 10);
The input file is split into a number of tracks. A file with name new.wav will be split into tracks called new01.wav, new02.wav etc.
e.g. $gramofile->split_to_tracks;
Any, or all, of the following filters may be specified:
The filters are applied in the order given in the list.
e.g. $gramofile->init_filter_tracks("rms_filter", "simple_mean_filter");
by default the cond_median2_filter is used.
sets the output .wav file name for the signal processing method, filter_tracks.
e.g. $gramofile->set_output_file($out_file);
This method allows the parameters to be set for the simple median filter.
The following elements may be set for this filter:
e.g. $gramofile->init_simple_median_filter("num_samples" => 7);
This method allows the parameters to be set for the simple median filter.
The following elements may be set for this filter:
e.g. $gramofile->init_double_median_filter("first_num_samples" => 5);
This method allows the parameters to be set for the simple mean filter.
The following elements may be set for this filter:
e.g. $gramofile->init_simple_mean_filter("num_samples" => 9);
This method allows the parameters to be set for the rms filter.
The following elements may be set for this filter:
e.g. $gramofile->init_rms_filter("num_samples" => 3);
This method allows the parameters to be set for the conditional mean filter.
The following elements may be set for this filter:
e.g. $gramofile->init_cmf_filter("rms_length" => 9, "tick_threshold" => 3000);
This method allows the parameters to be set for the second conditional mean filter.
The following elements may be set for this filter:
e.g. $gramofile->init_cmf2_filter("rec_med_len" => 11, "fine_threshold" => 2500);
This method allows the parameters to be set for the second conditional mean (frequency domain using fft) filter.
The following elements may be set for this filter:
e.g. $gramofile->init_cmf3_filter("fft_length" => 8, "fine_threshold" => 2500);
This method allows the parameters to be set for the simple normalize filter.
The following elements may be set for this filter:
e.g. $gramofile->init_simple_normalize_filter("normalize_factor" => 50);
A begin time and end time can be specified. These times will be used instead of the track times derived from the split_to_tracks method.
e.g. $gramofile->use_begin_end_time($begin_time, $end_time);
The whole file can be passed through the signal processing routines if this method is used.
e.g. $gramofile->process_whole_file;
The default frame size is 588 (1/75 sec. @ 44.1 khz). This method allows this value to be user-defined.
e.g. $gramofile->adjust_frames($framesize);
This method filters the tracks with the previously specified (or default) parameters.
e.g. $gramofile->filter_tracks;
None by default.
Gramofile : available from http://www.opensourcepartners.nl/~costar/gramofile/
libgramofile : A dynamically linked library derived from Gramofile, which this module needs, available from http://sourceforge.net/projects/libgramofile
fftw - the fastest Fourier Transform in the west : available from http://www.fftw.org
Signproc.txt, Tracksplit.txt, Tracksplit2.txt supplied with the Gramofile source code.
Bob Wilkinson, <bob@fourtheye.org>
Copyright 2003-2005 by Bob Wilkinson
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
| Audio-Gramofile documentation | Contained in the Audio-Gramofile distribution. |
package Audio::Gramofile; use 5.006; use strict; use Carp; use constant SECONDS_PER_HOUR => 3600; use constant SECONDS_PER_MINUTE => 60; require Exporter; require DynaLoader; use vars qw($VERSION @ISA); @ISA = qw(Exporter DynaLoader); $VERSION = '0.08'; bootstrap Audio::Gramofile $VERSION;
sub new { my $proto = shift; my $class = ref($proto) || $proto; my $self = {}; bless $self, $class; $self->_init; $self; } sub set_input_file { my $self = shift; my $file = shift; croak "Need an input file, $!" unless (defined $file); $self->{input_file} = $file; } sub set_output_file { my $self = shift; my $file = shift; croak "Need an output file, $!" unless (defined $file); $self->{output_file} = $file; } sub _init { my $self = shift; $self->default_tracksplit; $self->default_filter_tracks; $self->default_simple_median_filter; $self->default_double_median_filter; $self->default_simple_mean_filter; $self->default_rms_filter; $self->default_cmf_filter; $self->default_cmf2_filter; $self->default_cmf3_filter; $self->default_simple_normalize_filter; $self->{usetracktimes} = 1; } sub init_tracksplit { my $self = shift; my %hash = (@_); my $make_use_rms = delete $hash{make_use_rms}; my $make_graphs = delete $hash{make_graphs}; my $blocklen = delete $hash{blocklen}; my $global_silence_factor = delete $hash{global_silence_factor}; my $local_silence_threshold = delete $hash{local_silence_threshold}; my $min_silence_blocks = delete $hash{min_silence_blocks}; my $min_track_blocks = delete $hash{min_track_blocks}; my $extra_blocks_start = delete $hash{extra_blocks_start}; my $extra_blocks_end = delete $hash{extra_blocks_end}; croak "BAD ELEMENT in TRACKSPLIT HASH" if %hash; $self->{tracksplit}->{make_use_rms} = $make_use_rms if (defined $make_use_rms); $self->{tracksplit}->{make_graphs} = $make_graphs if (defined $make_graphs); $self->{tracksplit}->{blocklen} = $blocklen if (defined $blocklen); $self->{tracksplit}->{global_silence_factor} = $global_silence_factor if (defined $global_silence_factor); $self->{tracksplit}->{local_silence_threshold} = $local_silence_threshold if (defined $local_silence_threshold); $self->{tracksplit}->{min_silence_blocks} = $min_silence_blocks if (defined $min_silence_blocks); $self->{tracksplit}->{min_track_blocks} = $min_track_blocks if (defined $min_track_blocks); $self->{tracksplit}->{extra_blocks_start} = $extra_blocks_start if (defined $extra_blocks_start); $self->{tracksplit}->{extra_blocks_end} = $extra_blocks_end if (defined $extra_blocks_end); } sub default_tracksplit { shift->init_tracksplit( make_use_rms => 1, make_graphs => 0, blocklen => 4410, global_silence_factor => 150, local_silence_threshold => 5, min_silence_blocks => 20, min_track_blocks => 50, extra_blocks_start => 3, extra_blocks_end => 6, ); } sub init_filter_tracks { my $self = shift; my $filter_ptr = @_ ? shift : undef; return unless (defined $filter_ptr); unless (ref $filter_ptr) { my %filters_id = ( simple_median_filter => 1, simple_mean_filter => 2, cond_median_filter => 3, double_median_filter => 4, cond_median2_filter => 5, rms_filter => 6, copyonly_filter => 7, monoize_filter => 8, cond_median3_filter => 9, simple_normalize_filter => 10, experiment_filter => 11, ); my @name_list = ($filter_ptr, @_); # unshift the element we shifted to test my @num_list; foreach my $filter (@name_list) { croak "Invalid filter name, $filter, $!" unless (defined $filters_id{$filter}); push @num_list, $filters_id{$filter}; } $filter_ptr = \@num_list; } $self->{filter_num} = @$filter_ptr; $self->{filter_ptr} = $filter_ptr; } sub default_filter_tracks { shift->init_filter_tracks([ 5 ]); } sub init_simple_median_filter { my $self = shift; my %hash = (@_); my $num_samples = delete $hash{num_samples}; croak "BAD ELEMENT in INIT_SIMPLE_MEDIAN_FILTER HASH" if %hash; _odd_error_check("simple_median_num_samples", $num_samples); $self->{simple_median}->{num_samples} = $num_samples if (defined $num_samples); } sub default_simple_median_filter { shift->init_simple_median_filter(num_samples => 3); } sub init_double_median_filter { my $self = shift; my %hash = (@_); my $first_num_samples = delete $hash{first_num_samples}; my $second_num_samples = delete $hash{second_num_samples}; croak "BAD ELEMENT in INIT_DOUBLE_MEDIAN_FILTER HASH" if %hash; _odd_error_check("double_median_first_num_samples", $first_num_samples); _odd_error_check("double_median_second_num_samples", $second_num_samples); $self->{double_median}->{first_num_samples} = $first_num_samples if (defined $first_num_samples); $self->{double_median}->{second_num_samples} = $second_num_samples if (defined $second_num_samples); } sub default_double_median_filter { shift->init_double_median_filter( first_num_samples => 5, second_num_samples => 5, ); } sub init_simple_mean_filter { my $self = shift; my %hash = (@_); my $num_samples = delete $hash{num_samples}; croak "BAD ELEMENT in INIT_SIMPLE_MEAN_FILTER HASH" if %hash; _odd_error_check("simple_mean_num_samples", $num_samples); $self->{simple_mean}->{num_samples} = $num_samples if (defined $num_samples); } sub default_simple_mean_filter { shift->init_simple_mean_filter(num_samples => 3); } sub init_rms_filter { my $self = shift; my %hash = (@_); my $num_samples = delete $hash{num_samples}; croak "BAD ELEMENT in INIT_RMS_FILTER HASH" if %hash; _odd_error_check("rms_filter_num_samples", $num_samples); $self->{rms}->{num_samples} = $num_samples if (defined $num_samples); } sub default_rms_filter { shift->init_rms_filter(num_samples => 3); } sub init_cmf_filter { my $self = shift; my %hash = (@_); my $num_samples = delete $hash{num_samples}; my $rms_length = delete $hash{rms_length}; my $rec_med_len = delete $hash{rec_med_len}; my $rec_med_dec = delete $hash{rec_med_dec}; my $tick_threshold = delete $hash{tick_threshold}; croak "BAD ELEMENT in INIT_CMF_FILTER HASH" if %hash; _odd_error_check("cmf_median_tick_num_samples", $num_samples); _odd_error_check("cmf_rms_length", $rms_length); _odd_error_check("cmf_recursive_median_length", $rec_med_len); _error_check("cmf_decimation_factor", $rec_med_dec, 0); _error_check("cmf_tick_detection_threshold", $tick_threshold, 999); $self->{cmf}->{num_samples} = $num_samples if (defined $num_samples); $self->{cmf}->{rms_length} = $rms_length if (defined $rms_length); $self->{cmf}->{rec_med_len} = $rec_med_len if (defined $rec_med_len); $self->{cmf}->{rec_med_dec} = $rec_med_dec if (defined $rec_med_dec); $self->{cmf}->{tick_threshold} = $tick_threshold if (defined $tick_threshold); } sub default_cmf_filter { shift->init_cmf_filter( num_samples => 21, rms_length => 9, rec_med_len => 11, rec_med_dec => 5, tick_threshold => 2500, ); } sub init_cmf2_filter { my $self = shift; my %hash = (@_); my $rms_length = delete $hash{rms_length}; my $rec_med_len = delete $hash{rec_med_len}; my $rec_med_dec = delete $hash{rec_med_dec}; my $fine_threshold = delete $hash{fine_threshold}; my $tick_threshold = delete $hash{tick_threshold}; croak "BAD ELEMENT in INIT_CMF2_FILTER HASH" if %hash; _odd_error_check("cmf2_rms_length", $rms_length); _odd_error_check("cmf2_recursive_median_length", $rec_med_len); _error_check("cmf2_decimation_factor", $rec_med_dec, 0); _error_check("cmf2_tick_fine_threshold", $fine_threshold, 0); _error_check("cmf2_tick_detection_threshold", $tick_threshold, 999); $self->{cmf2}->{rms_length} = $rms_length if (defined $rms_length); $self->{cmf2}->{rec_med_len} = $rec_med_len if (defined $rec_med_len); $self->{cmf2}->{rec_med_dec} = $rec_med_dec if (defined $rec_med_dec); $self->{cmf2}->{fine_threshold} = $fine_threshold if (defined $fine_threshold); $self->{cmf2}->{tick_threshold} = $tick_threshold if (defined $tick_threshold); } sub default_cmf2_filter { shift->init_cmf2_filter( rms_length => 9, rec_med_len => 11, rec_med_dec => 12, fine_threshold => 2000, tick_threshold => 8500, ); } sub init_cmf3_filter { my $self = shift; my %hash = (@_); my $rms_length = delete $hash{rms_length}; my $rec_med_len = delete $hash{rec_med_len}; my $rec_med_dec = delete $hash{rec_med_dec}; my $fine_threshold = delete $hash{fine_threshold}; my $tick_threshold = delete $hash{tick_threshold}; my $fft_length = delete $hash{fft_length}; croak "BAD ELEMENT in INIT_CMF3_FILTER HASH" if %hash; _odd_error_check("cmf3_rms_length", $rms_length); _odd_error_check("cmf3_recursive_median_length", $rec_med_len); _error_check("cmf3_decimation_factor", $rec_med_dec, 0); _error_check("cmf3_tick_fine_threshold", $fine_threshold, 0); _error_check("cmf3_tick_detection_threshold", $tick_threshold, 999); _error_check("cmf3_fft_length", $fft_length, 5, 13); $self->{cmf3}->{rms_length} = $rms_length if (defined $rms_length); $self->{cmf3}->{rec_med_len} = $rec_med_len if (defined $rec_med_len); $self->{cmf3}->{rec_med_dec} = $rec_med_dec if (defined $rec_med_dec); $self->{cmf3}->{fine_threshold} = $fine_threshold if (defined $fine_threshold); $self->{cmf3}->{tick_threshold} = $tick_threshold if (defined $tick_threshold); $self->{cmf3}->{fft_length} = $fft_length if (defined $fft_length); } sub default_cmf3_filter { shift->init_cmf3_filter( rms_length => 9, rec_med_len => 11, rec_med_dec => 12, fine_threshold => 2000, tick_threshold => 8500, fft_length => 9, ); } sub init_simple_normalize_filter { my $self = shift; my %hash = (@_); my $normalize_factor = delete $hash{normalize_factor}; croak "BAD ELEMENT in INIT_SIMPLE_NORMALIZE_FILTER HASH" if %hash; _error_check("simple_normalize_factor", $normalize_factor, -1, 101); $self->{simple_normalize}->{normalize_factor} = $normalize_factor if (defined $normalize_factor); } sub default_simple_normalize_filter { shift->init_simple_normalize_filter(normalize_factor => 0); } sub split_to_tracks { my $self = shift; croak "Input .wav file needs to be set, $!" unless (defined $self->{input_file}); Audio::Gramofile::tracksplit_main($self->{input_file}, $self->{tracksplit}->{make_use_rms}, $self->{tracksplit}->{make_graphs}, $self->{tracksplit}->{blocklen}, $self->{tracksplit}->{global_silence_factor}, $self->{tracksplit}->{local_silence_threshold}, $self->{tracksplit}->{min_silence_blocks}, $self->{tracksplit}->{min_track_blocks}, $self->{tracksplit}->{extra_blocks_start}, $self->{tracksplit}->{extra_blocks_end}) } sub use_begin_end_time { my $self = shift; my $begin_time = shift; my $end_time = shift; $self->{begin_time} = _to_seconds($begin_time); $self->{end_time} = _to_seconds($end_time); $self->{usebeginendtime} = 1; $self->{usetracktimes} = 0; } sub process_whole_file { my $self = shift; $self->{usebeginendtime} = 0; $self->{usetracktimes} = 0; } sub adjust_frames { my $self = shift; my $framesize = shift; $self->{adjustframes} = 1; $self->{framesize} = $framesize; } sub filter_tracks { my $self = shift; croak "Input .wav file needs to be set, $!" unless (defined $self->{input_file}); croak "Output .wav file needs to be set, $!" unless (defined $self->{output_file}); croak "List of filters needs to be set, $!" unless (defined $self->{filter_ptr}); my $simple_median_num_samples = $self->{simple_median}->{num_samples}; my $double_median_init_params_ptr = [ $self->{double_median}->{first_num_samples}, $self->{double_median}->{second_num_samples} ]; my $simple_mean_num_samples = $self->{simple_mean}->{num_samples}; my $rms_filter_num_samples = $self->{rms}->{num_samples}; my $cmf_init_params_ptr = [ $self->{cmf}->{num_samples}, $self->{cmf}->{rms_length}, $self->{cmf}->{rec_med_len}, $self->{cmf}->{rec_med_dec}, $self->{cmf}->{tick_threshold} ]; my $cmf2_init_params_ptr = [ $self->{cmf2}->{rms_length}, $self->{cmf2}->{rec_med_len}, $self->{cmf2}->{rec_med_dec}, $self->{cmf2}->{fine_threshold}, $self->{cmf2}->{tick_threshold} ]; my $cmf3_init_params_ptr = [ $self->{cmf3}->{rms_length}, $self->{cmf3}->{rec_med_len}, $self->{cmf3}->{rec_med_dec}, $self->{cmf3}->{fine_threshold}, $self->{cmf3}->{tick_threshold}, $self->{cmf3}->{fft_length} ]; my $simple_normalize_factor = $self->{simple_normalize}->{normalize_factor}; my $usebeginendtime = (defined $self->{usebeginendtime}) ? $self->{usebeginendtime} : 0; my $usetracktimes = (defined $self->{usetracktimes}) ? $self->{usetracktimes} : 0; my $begintime = (defined $self->{begin_time}) ? $self->{begin_time} : 0.0; my $endtime = (defined $self->{end_time}) ? $self->{end_time} : 0.0; my $adjustframes = (defined $self->{adjustframes}) ? $self->{adjustframes} : 0; my $framesize = (defined $self->{framesize}) ? $self->{framesize} : 588; Audio::Gramofile::signproc_main($self->{input_file}, $self->{output_file}, $self->{filter_num}, $self->{filter_ptr}, $simple_median_num_samples, $double_median_init_params_ptr, $simple_mean_num_samples, $rms_filter_num_samples, $cmf_init_params_ptr, $cmf2_init_params_ptr, $cmf3_init_params_ptr, $simple_normalize_factor, $usebeginendtime, $usetracktimes, $begintime, $endtime, $adjustframes, $framesize); } sub _odd_error_check { my $name = shift; my $value = shift; return unless (defined $value); croak "Param $name (value is $value) needs to be odd and greater than 0" unless (($value > 0) and ($value % 2)); } sub _error_check { my $name = shift; my $value = shift; my $min = shift; my $max = @_ ? shift : undef; return unless (defined $value); croak "Param $name (value is $value) needs to be greater than $min" unless ($value > $min); if (defined $max) { croak "Param $name (value is $value) needs to be less than $max" unless ($value < $max); } } sub _to_seconds { my $time_string = shift; # converts hh:mm:ss.sss, or mm:ss.sss to seconds $time_string =~ s/^://; my $colons = $time_string =~ tr/:/:/; if ($colons == 2) { my ($hours, $minutes, $seconds) = split(/:/, $time_string); return SECONDS_PER_HOUR*$hours + SECONDS_PER_MINUTE*$minutes + $seconds; } elsif ($colons == 1) { my ($minutes, $seconds) = split(/:/, $time_string); return SECONDS_PER_MINUTE*$minutes + $seconds; } return $time_string; } 1;