| UUID-Tiny documentation | Contained in the UUID-Tiny distribution. |
UUID::Tiny - Pure Perl UUID Support With Functional Interface
Version 1.03
Create version 1, 3, 4 and 5 UUIDs:
use UUID::Tiny;
my $v1_mc_UUID = create_UUID();
my $v3_md5_UUID = create_UUID(UUID_V3, $str);
my $v3_md5_UUID = create_UUID(UUID_V3, UUID_NS_DNS, 'caugustin.de');
my $v4_rand_UUID = create_UUID(UUID_V4);
my $v5_sha1_UUID = create_UUID(UUID_V5, $str);
my $v5_with_NS_UUID = create_UUID(UUID_V5, UUID_NS_DNS, 'caugustin.de');
my $v1_mc_UUID_string = create_UUID_as_string(UUID_V1);
my $v3_md5_UUID_string = UUID_to_string($v3_md5_UUID);
if ( version_of_UUID($v1_mc_UUID) == 1 ) { ... };
if ( version_of_UUID($v5_sha1_UUID) == 5 ) { ... };
if ( is_UUID_string($v1_mc_UUID_string) ) { ... };
if ( equal_UUIDs($uuid1, $uuid2) ) { ... };
my $uuid_time = time_of_UUID($v1_mc_UUID);
my $uuid_clk_seq = clk_seq_of_UUID($v1_mc_UUID);
UUID::Tiny is a lightweight, low dependency Pure Perl module for UUID creation and testing. This module provides the creation of version 1 time based UUIDs (using random multicast MAC addresses), version 3 MD5 based UUIDs, version 4 random UUIDs, and version 5 SHA-1 based UUIDs.
ATTENTION! UUID::Tiny uses Perl's rand() to create the basic random
numbers, so the created v4 UUIDs are not cryptographically strong!
No fancy OO interface, no plethora of different UUID representation formats and transformations - just string and binary. Conversion, test and time functions equally accept UUIDs and UUID strings, so don't bother to convert UUIDs for them!
All constants and public functions are exported by default, because if you didn't need/want them, you wouldn't use this module ...
UUID::Tiny deliberately uses a minimal functional interface for UUID creation (and conversion/testing), because in this case OO looks like overkill to me and makes the creation and use of UUIDs unnecessarily complicated.
If you need raw performance for UUID creation, or the real MAC address in version 1 UUIDs, or an OO interface, and if you can afford module compilation and installation on the target system, then better look at other CPAN UUID modules like Data::UUID.
This module is "fork safe", especially for random UUIDs (it works around Perl's rand() problem when forking processes).
This module should be "thread safe," because its global variables are locked in the functions that access them. (Not tested - if you can provide some tests, please tell me!)
This module should run from Perl 5.8 up and uses mostly standard (5.8 core) modules for its job. No compilation or installation required. These are the modules UUID::Tiny depends on:
Carp
Digest::MD5 Perl 5.8 core
Digest::SHA Perl 5.10 core (or Digest::SHA1, or Digest::SHA::PurePerl)
MIME::Base64 Perl 5.8 core
Time::HiRes Perl 5.8 core
POSIX Perl 5.8 core
If you are using this module on a Perl prior to 5.10 and you don't have Digest::SHA1 installed, you can use Digest::SHA::PurePerl instead.
After some debate I'm convinced that it is more Perlish (and far easier to write) to use all-lowercase function names - without exceptions. And that it is more polite to export symbols only on demand.
While the 1.0x versions will continue to export the old, "legacy" interface on
default, the future standard interface is available using the :std tag on
import from version 1.02 on:
use UUID::Tiny ':std';
my $md5_uuid = create_uuid(UUID_MD5, $str);
In preparation for the upcoming version 2.00 of UUID::Tiny you should use the
:legacy tag if you want to stay with the version 1.0x interface:
use UUID::Tiny ':legacy';
my $md5_uuid = create_UUID(UUID_V3, $str);
This module provides the NIL UUID (shown with its string representation):
UUID_NIL: '00000000-0000-0000-0000-000000000000'
This module provides the common pre-defined namespace UUIDs (shown with their string representation):
UUID_NS_DNS: '6ba7b810-9dad-11d1-80b4-00c04fd430c8'
UUID_NS_URL: '6ba7b811-9dad-11d1-80b4-00c04fd430c8'
UUID_NS_OID: '6ba7b812-9dad-11d1-80b4-00c04fd430c8'
UUID_NS_X500: '6ba7b814-9dad-11d1-80b4-00c04fd430c8'
This module provides the UUID version numbers as constants:
UUID_V1
UUID_V3
UUID_V4
UUID_V5
With use UUID::Tiny ':std'; you get additional, "speaking" constants:
UUID_TIME
UUID_MD5
UUID_RANDOM
UUID_SHA1
my $uuid = create_UUID( UUID_SHA1_AVAIL? UUID_V5 : UUID_V3, $str );
This function returns 1 if a module to create SHA-1 digests could be loaded, 0 otherwise.
UUID::Tiny (since version 1.02) tries to load Digest::SHA, Digest::SHA1 or
Digest::SHA::PurePerl, but does not die if none of them is found. Instead
create_UUID() and create_UUID_as_string() die when trying to create an
SHA-1 based UUID without an appropriate module available.
All public functions are exported by default (they should not collide with other functions).
create_UUID() creates standard binary UUIDs in network byte order
(MSB first), create_UUID_as_string() creates the standard string
represantion of UUIDs.
All query and test functions (except is_UUID_string) accept both
representations.
my $v1_mc_UUID = create_UUID();
my $v1_mc_UUID = create_UUID(UUID_V1);
my $v3_md5_UUID = create_UUID(UUID_V3, $ns_uuid, $name_or_filehandle);
my $v3_md5_UUID = create_UUID(UUID_V3, $name_or_filehandle);
my $v4_rand_UUID = create_UUID(UUID_V4);
my $v5_sha1_UUID = create_UUID(UUID_V5, $ns_uuid, $name_or_filehandle);
my $v5_sha1_UUID = create_UUID(UUID_V5, $name_or_filehandle);
Creates a binary UUID in network byte order (MSB first). For v3 and v5 UUIDs a
SCALAR (normally a string), GLOB ("classic" file handle) or IO object
(i.e. IO::File) can be used; files have to be opened for reading.
I found no hint if and how UUIDs should be created from file content. It seems to be undefined, but it is useful - so I would suggest to use UUID_NIL as the namespace UUID, because no "real name" is used; UUID_NIL is used by default if a namespace UUID is missing (only 2 arguments are used).
Similar to create_UUID, but creates a UUID string.
my $bool = is_UUID_string($str);
my $uuid_str = UUID_to_string($uuid);
This function returns $uuid unchanged if it is a UUID string already.
my $uuid = string_to_UUID($uuid_str);
This function returns $uuid_str unchanged if it is a UUID already.
In addition to the standard UUID string representation and its URN forms
(starting with urn:uuid: or uuid:), this function accepts 32 digit hex
strings, variants with different positions of - and Base64 encoded UUIDs.
Throws an exception if string can't be interpreted as a UUID.
If you want to make shure to have a "pure" standard UUID representation, check
with is_UUID_string!
my $version = version_of_UUID($uuid);
This function accepts binary and string UUIDs.
my $uuid_time = time_of_UUID($uuid);
This function accepts UUIDs and UUID strings. Returns the time as a floating
point value, so use int() to get a time() compatible value.
Returns undef if the UUID is not version 1.
my $uuid_clk_seq = clk_seq_of_UUID($uuid);
This function accepts UUIDs and UUID strings. Returns the clock sequence for a
version 1 UUID. Returns undef if UUID is not version 1.
my $bool = equal_UUIDs($uuid1, $uuid2);
Returns true if the provided UUIDs are equal. Accepts UUIDs and UUID strings (can be mixed).
The random multi-cast MAC address gives privacy, and getting the real MAC address with Perl is really dirty (and slow);
Using SHA-1 reduces the probabillity of collisions and provides a better "randomness" of the resulting UUID compared to MD5. Version 5 is recommended in RFC 4122 if backward compatibility is not an issue.
Using MD5 (version 3) has a better performance. This could be important with creating UUIDs from file content rather than names.
See RFC 4122 (http://www.ietf.org/rfc/rfc4122.txt) for technical details on UUIDs.
Christian Augustin, <mail at caugustin.de>
Some of this code is based on UUID::Generator by ITO Nobuaki <banb@cpan.org>. But that module is announced to be marked as "deprecated" in the future and it is much too complicated for my liking.
So I decided to reduce it to the necessary parts and to re-implement those parts with a functional interface ...
Jesse Vincent, <jesse at bestpractical.com>, improved version 1.02 with
his tips and a heavy refactoring.
Please report any bugs or feature requests to bug-uuid-tiny at rt.cpan.org,
or through the web interface at
http://rt.cpan.org/NoAuth/ReportBug.html?Queue=UUID-Tiny.
I will be notified, and then you'll automatically be notified of progress on
your bug as I make changes.
You can find documentation for this module with the perldoc command.
perldoc UUID::Tiny
You can also look for information at:
Kudos to ITO Nobuaki <banb@cpan.org> for his UUID::Generator::PurePerl module! My work is based on his code, and without it I would've been lost with all those incomprehensible RFC texts and C codes ...
Thanks to Jesse Vincent (<jesse at bestpractical.com>) for his feedback, tips and refactoring!
Copyright 2009, 2010 Christian Augustin, all rights reserved.
This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
ITO Nobuaki has very graciously given me permission to take over copyright for the portions of code that are copied from or resemble his work (see rt.cpan.org #53642 https://rt.cpan.org/Public/Bug/Display.html?id=53642).
| UUID-Tiny documentation | Contained in the UUID-Tiny distribution. |
package UUID::Tiny; use 5.008; use warnings; use strict; use Carp; use Digest::MD5; use MIME::Base64; use Time::HiRes; use POSIX; our $SHA1_CALCULATOR = undef; { # Check for availability of SHA-1 ... local $@; # don't leak an error condition eval { require Digest::SHA; $SHA1_CALCULATOR = Digest::SHA->new(1) } || eval { require Digest::SHA1; $SHA1_CALCULATOR = Digest::SHA1->new() } || eval { require Digest::SHA::PurePerl; $SHA1_CALCULATOR = Digest::SHA::PurePerl->new(1) }; }; our $MD5_CALCULATOR = Digest::MD5->new(); # ToDo: # - Check and report for undefined UUIDs with all UUID manipulating functions! # - Better error propagation for better debugging.
our $VERSION = '1.03';
use Exporter; our @ISA = qw(Exporter); our @EXPORT; our @EXPORT_OK; our %EXPORT_TAGS = ( std => [qw( UUID_NIL UUID_NS_DNS UUID_NS_URL UUID_NS_OID UUID_NS_X500 UUID_V1 UUID_TIME UUID_V3 UUID_MD5 UUID_V4 UUID_RANDOM UUID_V5 UUID_SHA1 UUID_SHA1_AVAIL create_uuid create_uuid_as_string is_uuid_string uuid_to_string string_to_uuid version_of_uuid time_of_uuid clk_seq_of_uuid equal_uuids )], legacy => [qw( UUID_NIL UUID_NS_DNS UUID_NS_URL UUID_NS_OID UUID_NS_X500 UUID_V1 UUID_V3 UUID_V4 UUID_V5 UUID_SHA1_AVAIL create_UUID create_UUID_as_string is_UUID_string UUID_to_string string_to_UUID version_of_UUID time_of_UUID clk_seq_of_UUID equal_UUIDs )], ); Exporter::export_tags('legacy'); Exporter::export_ok_tags('std');
use constant UUID_NIL => "\x00" x 16;
use constant UUID_NS_DNS => "\x6b\xa7\xb8\x10\x9d\xad\x11\xd1\x80\xb4\x00\xc0\x4f\xd4\x30\xc8"; use constant UUID_NS_URL => "\x6b\xa7\xb8\x11\x9d\xad\x11\xd1\x80\xb4\x00\xc0\x4f\xd4\x30\xc8"; use constant UUID_NS_OID => "\x6b\xa7\xb8\x12\x9d\xad\x11\xd1\x80\xb4\x00\xc0\x4f\xd4\x30\xc8"; use constant UUID_NS_X500 => "\x6b\xa7\xb8\x14\x9d\xad\x11\xd1\x80\xb4\x00\xc0\x4f\xd4\x30\xc8";
use constant UUID_V1 => 1; use constant UUID_TIME => 1; use constant UUID_V3 => 3; use constant UUID_MD5 => 3; use constant UUID_V4 => 4; use constant UUID_RANDOM => 4; use constant UUID_V5 => 5; use constant UUID_SHA1 => 5;
sub UUID_SHA1_AVAIL { return defined $SHA1_CALCULATOR ? 1 : 0; }
sub create_uuid { use bytes; my ($v, $arg2, $arg3) = (shift || UUID_V1, shift, shift); my $uuid = UUID_NIL; my $ns_uuid = string_to_uuid(defined $arg3 ? $arg2 : UUID_NIL); my $name = defined $arg3 ? $arg3 : $arg2; if ($v == UUID_V1) { $uuid = _create_v1_uuid(); } elsif ($v == UUID_V3 ) { $uuid = _create_v3_uuid($ns_uuid, $name); } elsif ($v == UUID_V4) { $uuid = _create_v4_uuid(); } elsif ($v == UUID_V5) { $uuid = _create_v5_uuid($ns_uuid, $name); } else { croak __PACKAGE__ . "::create_uuid(): Invalid UUID version '$v'!"; } # Set variant 2 in UUID ... substr $uuid, 8, 1, chr(ord(substr $uuid, 8, 1) & 0x3f | 0x80); return $uuid; } *create_UUID = \&create_uuid; sub _create_v1_uuid { my $uuid = ''; # Create time and clock sequence ... my $timestamp = Time::HiRes::time(); my $clk_seq = _get_clk_seq($timestamp); # hi = time mod (1000000 / 0x100000000) my $hi = floor( $timestamp / 65536.0 / 512 * 78125 ); $timestamp -= $hi * 512.0 * 65536 / 78125; my $low = floor( $timestamp * 10000000.0 + 0.5 ); # MAGIC offset: 01B2-1DD2-13814000 if ( $low < 0xec7ec000 ) { $low += 0x13814000; } else { $low -= 0xec7ec000; $hi++; } if ( $hi < 0x0e4de22e ) { $hi += 0x01b21dd2; } else { $hi -= 0x0e4de22e; # wrap around } # Set time in UUID ... substr $uuid, 0, 4, pack( 'N', $low ); # set time low substr $uuid, 4, 2, pack( 'n', $hi & 0xffff ); # set time mid substr $uuid, 6, 2, pack( 'n', ( $hi >> 16 ) & 0x0fff ); # set time high # Set clock sequence in UUID ... substr $uuid, 8, 2, pack( 'n', $clk_seq ); # Set random node in UUID ... substr $uuid, 10, 6, _random_node_id(); return _set_uuid_version($uuid, 0x10); } sub _create_v3_uuid { my $ns_uuid = shift; my $name = shift; my $uuid = ''; lock $MD5_CALCULATOR; # Create digest in UUID ... $MD5_CALCULATOR->reset(); $MD5_CALCULATOR->add($ns_uuid); if ( ref($name) =~ m/^(?:GLOB|IO::)/ ) { $MD5_CALCULATOR->addfile($name); } elsif ( ref $name ) { croak __PACKAGE__ . '::create_uuid(): Name for v3 UUID' . ' has to be SCALAR, GLOB or IO object, not ' . ref($name) .'!' ; } elsif ( defined $name ) { $MD5_CALCULATOR->add($name); } else { croak __PACKAGE__ . '::create_uuid(): Name for v3 UUID is not defined!'; } # Use only first 16 Bytes ... $uuid = substr( $MD5_CALCULATOR->digest(), 0, 16 ); return _set_uuid_version( $uuid, 0x30 ); } sub _create_v4_uuid { # Create random value in UUID ... my $uuid = ''; for ( 1 .. 4 ) { $uuid .= pack 'I', _rand_32bit(); } return _set_uuid_version($uuid, 0x40); } sub _create_v5_uuid { my $ns_uuid = shift; my $name = shift; my $uuid = ''; if (!$SHA1_CALCULATOR) { croak __PACKAGE__ . '::create_uuid(): No SHA-1 implementation available! ' . 'Please install Digest::SHA1, Digest::SHA or ' . 'Digest::SHA::PurePerl to use SHA-1 based UUIDs.' ; } lock $SHA1_CALCULATOR; $SHA1_CALCULATOR->reset(); $SHA1_CALCULATOR->add($ns_uuid); if ( ref($name) =~ m/^(?:GLOB|IO::)/ ) { $SHA1_CALCULATOR->addfile($name); } elsif ( ref $name ) { croak __PACKAGE__ . '::create_uuid(): Name for v5 UUID' . ' has to be SCALAR, GLOB or IO object, not ' . ref($name) .'!' ; } elsif ( defined $name ) { $SHA1_CALCULATOR->add($name); } else { croak __PACKAGE__ . '::create_uuid(): Name for v5 UUID is not defined!'; } # Use only first 16 Bytes ... $uuid = substr( $SHA1_CALCULATOR->digest(), 0, 16 ); return _set_uuid_version($uuid, 0x50); } sub _set_uuid_version { my $uuid = shift; my $version = shift; substr $uuid, 6, 1, chr( ord( substr( $uuid, 6, 1 ) ) & 0x0f | $version ); return $uuid; }
sub create_uuid_as_string { return uuid_to_string(create_uuid(@_)); } *create_UUID_as_string = \&create_uuid_as_string;
our $IS_UUID_STRING = qr/^[0-9a-f]{8}(?:-[0-9a-f]{4}){3}-[0-9a-f]{12}$/is; our $IS_UUID_HEX = qr/^[0-9a-f]{32}$/is; our $IS_UUID_Base64 = qr/^[+\/0-9A-Za-z]{22}(?:==)?$/s; sub is_uuid_string { my $uuid = shift; return $uuid =~ m/$IS_UUID_STRING/; } *is_UUID_string = \&is_uuid_string;
sub uuid_to_string { my $uuid = shift; use bytes; return $uuid if $uuid =~ m/$IS_UUID_STRING/; croak __PACKAGE__ . "::uuid_to_string(): Invalid UUID!" unless length $uuid == 16; return join '-', map { unpack 'H*', $_ } map { substr $uuid, 0, $_, '' } ( 4, 2, 2, 2, 6 ); } *UUID_to_string = \&uuid_to_string;
sub string_to_uuid { my $uuid = shift; use bytes; return $uuid if length $uuid == 16; return decode_base64($uuid) if ($uuid =~ m/$IS_UUID_Base64/); my $str = $uuid; $uuid =~ s/^(?:urn:)?(?:uuid:)?//io; $uuid =~ tr/-//d; return pack 'H*', $uuid if $uuid =~ m/$IS_UUID_HEX/; croak __PACKAGE__ . "::string_to_uuid(): '$str' is no UUID string!"; } *string_to_UUID = \&string_to_uuid;
sub version_of_uuid { my $uuid = shift; use bytes; $uuid = string_to_uuid($uuid); return (ord(substr($uuid, 6, 1)) & 0xf0) >> 4; } *version_of_UUID = \&version_of_uuid;
sub time_of_uuid { my $uuid = shift; use bytes; $uuid = string_to_uuid($uuid); return unless version_of_uuid($uuid) == 1; my $low = unpack 'N', substr($uuid, 0, 4); my $mid = unpack 'n', substr($uuid, 4, 2); my $high = unpack('n', substr($uuid, 6, 2)) & 0x0fff; my $hi = $mid | $high << 16; # MAGIC offset: 01B2-1DD2-13814000 if ($low >= 0x13814000) { $low -= 0x13814000; } else { $low += 0xec7ec000; $hi --; } if ($hi >= 0x01b21dd2) { $hi -= 0x01b21dd2; } else { $hi += 0x0e4de22e; # wrap around } $low /= 10000000.0; $hi /= 78125.0 / 512 / 65536; # / 1000000 * 0x10000000 return $hi + $low; } *time_of_UUID = \&time_of_uuid;
sub clk_seq_of_uuid { use bytes; my $uuid = shift; $uuid = string_to_uuid($uuid); return unless version_of_uuid($uuid) == 1; my $r = unpack 'n', substr($uuid, 8, 2); my $v = $r >> 13; my $w = ($v >= 6) ? 3 # 11x : ($v >= 4) ? 2 # 10- : 1 # 0-- ; $w = 16 - $w; return $r & ((1 << $w) - 1); } *clk_seq_of_UUID = \&clk_seq_of_uuid;
sub equal_uuids { my ($u1, $u2) = @_; return unless defined $u1 && defined $u2; return string_to_uuid($u1) eq string_to_uuid($u2); } *equal_UUIDs = \&equal_uuids; # # Private functions ... # my $Last_Pid; my $Clk_Seq; # There is a problem with $Clk_Seq and rand() on forking a process using # UUID::Tiny, because the forked process would use the same basic $Clk_Seq and # the same seed (!) for rand(). $Clk_Seq is UUID::Tiny's problem, but with # rand() it is Perl's bad behavior. So _init_globals() has to be called every # time before using $Clk_Seq or rand() ... sub _init_globals { lock $Last_Pid; lock $Clk_Seq; if (!defined $Last_Pid || $Last_Pid != $$) { $Last_Pid = $$; # $Clk_Seq = _generate_clk_seq(); # There's a slight chance to get the same value as $Clk_Seq ... for (my $i = 0; $i <= 5; $i++) { my $new_clk_seq = _generate_clk_seq(); if (!defined($Clk_Seq) || $new_clk_seq != $Clk_Seq) { $Clk_Seq = $new_clk_seq; last; } if ($i == 5) { croak __PACKAGE__ . "::_init_globals(): Can't get unique clk_seq!"; } } srand(); } return; } my $Last_Timestamp; sub _get_clk_seq { my $ts = shift; _init_globals(); lock $Last_Timestamp; lock $Clk_Seq; #if (!defined $Last_Timestamp || $ts <= $Last_Timestamp) { if (defined $Last_Timestamp && $ts <= $Last_Timestamp) { #$Clk_Seq = ($Clk_Seq + 1) % 65536; # The old variant used modulo, but this looks unnecessary, # because we should only use the signigicant part of the # number, and that also lets the counter circle around: $Clk_Seq = ($Clk_Seq + 1) & 0x3fff; } $Last_Timestamp = $ts; #return $Clk_Seq & 0x03ff; # no longer needed - and it was wrong too! return $Clk_Seq; } sub _generate_clk_seq { my $self = shift; # _init_globals(); my @data; push @data, '' . $$; push @data, ':' . Time::HiRes::time(); # 16 bit digest # We should return only the significant part of the number! return (unpack 'n', _digest_as_octets(2, @data)) & 0x3fff; } sub _random_node_id { my $self = shift; my $r1 = _rand_32bit(); my $r2 = _rand_32bit(); my $hi = ($r1 >> 8) ^ ($r2 & 0xff); my $lo = ($r2 >> 8) ^ ($r1 & 0xff); $hi |= 0x80; my $id = substr pack('V', $hi), 0, 3; $id .= substr pack('V', $lo), 0, 3; return $id; } sub _rand_32bit { _init_globals(); my $v1 = int(rand(65536)) % 65536; my $v2 = int(rand(65536)) % 65536; return ($v1 << 16) | $v2; } sub _fold_into_octets { use bytes; my ($num_octets, $s) = @_; my $x = "\x0" x $num_octets; while (length $s > 0) { my $n = ''; while (length $x > 0) { my $c = ord(substr $x, -1, 1, '') ^ ord(substr $s, -1, 1, ''); $n = chr($c) . $n; last if length $s <= 0; } $n = $x . $n; $x = $n; } return $x; } sub _digest_as_octets { my $num_octets = shift; $MD5_CALCULATOR->reset(); $MD5_CALCULATOR->add($_) for @_; return _fold_into_octets($num_octets, $MD5_CALCULATOR->digest); }
1; # End of UUID::Tiny