Discover Life in America

Don Windsor - 23 July, 1999

Unique identifiers -- institution codes -Reply

Date: Fri, 23 Jul 1999 19:53:38 -0400
From: "Donald Windsor " <WINDSORD@tivoli.si.edu>
To: pick@pick.uga.edu
Subject: Unique identifiers -- institution codes -Reply

John,  In the best of world's I would have  the unique
identifier in the barcode and in the text on the label.
But constraints rule the world.  For us, going with code 128
means we can piggy back onto the ICBG... and our costs
plummet from 5-8K or more to  the costs of the label stock.
Our local rep. is working on smallest possible lable sizes
for 4 alpha and 7 numerics using code 128 (and on fewer
alpha chars).  If the product is too large then I would
prefer the KU compromise--include the full unique identifier
in text and a reduced identifer in barcode.    But,  you are
right,  this is not a global-digital solution.  But, the
less than global solution is not too bad.  It will be
trivial to expand the digital records in our data bases  so
they are uniquely coded...and specimens will be globally
unique to anyone who can read the label.
  Gotta hit the road to Chiriqui!  See you Cyberville on
Monday.  Don

>>> John Pickering <pick@pick.uga.edu> 07/23/99 01:35pm >>>
Don (and others),

Why do you think that we only need 7 or 9 bar-coded digits?
We currently
are using 4 letters and 6 digits, something that code 128
cannot do on a
small insect label.  Until we decide what to put on our
labels, lets not
limit our options. It would be a big mistake for us (or STRI
or anyone
else) not to use a unique institution code on museum labels.

Steve Ashe and KU have adopted an efficient LOCAL solution,
using "KU" (or
"SM" ?) and 7 digits.  However, rather than following in
KU's code 128
footsteps, we should all work toward developing an efficient
GLOBAL
solution -- one that best allow's us all to share speciemens
and data with
other institutions using unique identifiers.

For entomology collections, I recommend that we generally
adopt the unique
institution identifiers in R. H. Arnett & G. A. Samuelson's
publication
(1996.  Insect & Spider Collections of the World, E. J.
Brill: Gainesville,
FL, pp. 220).  However, at this early stage in adopting
unique identifiers
among collections, we should allow some exceptions to the
Arnett &
Samuelson identifiers so as to protect the investment of
anyone who has
already barcoded specimens.  Existing labels should take
precedence over
Arnett & Samuelson's institution identifiers.  Thus, nobody
else should use
INBIOCRI, KU (SM?), etc., as these identifiers are already
used by
INBio/ALAS and KU.  I recommend that we ask the Association
of Systematic
Collections (ASC) to maintain such a list and manage it in
conjunction with
similar lists for herbaria, vertebate, and other
collections.

Code 128 is an efficient local solution, but because it only
allows 2
letters for an institution identifier, it is not a good
global solution.
Code 49 labels could serve as a global solution.  Several
issues to
consider:

1) Our first priority is to assure that each specimen
receives a unique
identifier.  We must collectively come to a consensus on how
to assign
these so that they do not conflict among institutions and
projects.  If
code 128 works for a short acronym, such as KU, then that
institution
should use code 128.  If an institution has a longer
acronym, then they
should use code 49.  Let's standardize the unique
identifiers, and let
institutions change their technology for making and reading
them as they
see fit.

2) As Jack so aptly noted, there is a great danger in coding
only part of a
unique identifier in the barcode symbology to save space.
What got left
out?  When we go to the Web to seach for data on specimens,
it becomes
difficult to share info if we are all using abbreviations
for our unique
identifiers, either on our barcodes or in our databases.  As
a society,
we're  spending billions to solve the Y2K problem of
"saving" the first 2
digits at the beginning of the year.  Let's not repeat the
folly of saving
characters -- disk space is cheap and code 49 can handle the
larger
institutional identifiers that we need.  Yes, I too was
guilty of cutting
my institutional "UGCA" out in some of my programs and data
records.  It
saved disk space and was easier to work with just digits.
However, I now
have my own Y2K false economy, because I'm now mixing
"INBio" specimens
with my "UGCA" records and specimens.  Like Jack, I'm now
converting my
database to one that uses complete unique identifiers.  I
now have no
abbreviations, no saving disk space, but complete
compatibility with all
other collections that use unique identifiers and that don't
economize on
digits!

3) Regarding reading code 49, in the hands of a trained
operator it can be
exceedingly fast, on the order of a few seconds per specimen
(if labels are
face up), a little longer if specimens need to be handled.
Error checking
is included in the symbology, so errors should occur very
rarely.  Reading
with the Imager 1470, rather than with the old scanner
should be even
easier, as it works like a digital camera.  The better
scanners and imagers
can be programmed to accept code 39, 49, 128, etc. so that
we can read unit
trays of mixed codes.  In short, we should all have to
accept the same
technological solution, only a solution that keeps our
identifiers unique.

4) The folks at Intermec are helping to develop and test the
technology
that we need.  In a week or so, I expect to test their
Imager 1470 in my
lab.  By next week Christy Deal hopes to have a set of proof
labels printed
on their 3240 printer that I'll send to each of you who
wishes to see them.
I recommend waiting on making any purchase decision until we
are sure that
this new technology works.  It won't be long.

5) Because of error checking, we are likely to continue to
use barcodes and
not eventually switch to reading alphanumerics on the
labels.  Barcodes
include error checks to avoid recognizing an "F" plus a
"leg" and an "E."
Hence, let's think long-term now and not assume that
technology will bail
us out and let us read the rest of the label that is not in
the barcode
symbol.

6) Finally, regarding putting barcodes face up versus face
down, my vision
is that we will eventually have the hardware and software to
scan a unit
tray of insects instantaneously if the labels are face up
and the bugs are
not too large to obscure completely the symbols.  Computer
manufacturers
are already reading multiple barcodes on boards moving past
sensors on
conveyor belts.  When Sprague was here we made a jpg image
with my digital
camera of appoximately 50 specimens in a unit tray.  He will
test how well
his existing software can decode the barcodes in this image.
 Although the
image's resolution appeared fine, we would need to develop
some "leg
removing" algorithms before this becomes a practical
solution to our needs.
Anyway, reading multiple barcodes simultaneously is closer
to reality than
you may suspect.  After all, Ian Gauld's Daisy project is
using artificial
intelligence to identifying species from images of wing
veination.

Must go to empty some traps in the Smokies.  More next week.

Cheers,
Pick



Discover Life in America | Science | Unique Identifiers & Barcodes | Correspondence | Don Windsor - 23 July, 1999