Linda's Readme file for ...

Literature on the Polychaeta (Annelida). 2nd Edition.

By Linda A. Ward & Kristian Fauchald

1997

Released by the Department of Invertebrate Zoology
Smithsonian Institution, Washington, D.C., USA.
April 1997.

Copyright: Smithsonian Institution, 1990, 1997; all rights reserved. This bibliography is presented for scholarly use only; copying or redistributing the data in any way for personal or corporate gain are not permitted. This bibliography represents a "work in progress" and much of the data within has not been verified. Any errors found within are the responsibility of the authors (Ward & Fauchald) and not the Smithsonian Institution.

Availability: This release of the "Polychaete Bibliography" will only be available as files that can be downloaded from the World Wide Web. There are three reasons for doing it this way: 1) The cost of producing and mailing a CD-ROM or paper version, 2) The time involved in mailing out a hard copy and finally, 3) The ease of updating files. The original CD-Rom version came out in 1990 and it has taken 7 years to create a new version. We expect to be able to do yearly updates to the WEB version.

State of completion: The bibliography is by no means complete or in perfect condition. We felt it was time to get it out to the Polychaete Community as is, rather than waiting for us to finish editing it. It has just under 17500 records and includes papers that deal with general ecology, methodology, general science, phylogenetics as well as more specific papers on polychaetes. Any additions or correction will be gladly accepted. If you are sending corrections to records in the bibliography please include the reference number so that we fix the correct one. Send additions or corrections to Linda at [email protected].

Keywords were added to many, but not all, of the records in the Papyrus version. For those that have keywords it was not done in a consistent fashion. We tried to indicate which records had polychaete information "POLYCHAETA"; family names were also added (in many cases you will see old names such as AMMOCHARIDAE or CHLORAEMIDAE as well as the currently accepted names like Oweniidae or Flabelligeridae).

In the COMMENTS field for many records, in the Papyrus version, you will find long lists of species separated by semicolons. You can search for all records with a given species such as Term= "*Capitella capitata*" but the search is a little slow. These lists are by no means complete, in some cases only the new taxa were listed but it is useful as a starting point for information on a particular taxon. The 3 letter family code (all capitalized abbreviations, such as SPI, CAP, OPH) that occur in some records are being phased out and replaced with the family name KEYWORD. Other abbreviations are n.gen. or n.g. for new genus; n.sp. = new species; n.spp.= new species (2 or more); n.fam. = new family; n.comb. = new combination; n.subgen. = new subgenus, etc.

The data files are of two types - program data files and text files:

1) The files as ready for use in bibliographic programs

Papyrus files: (*.bb) are the back-up files created by Papyrus (a bibliographic program created by Research Software Design), and can be retrieved by either the PC or MAC version of that program. This is the most complete version of the bibliography. To learn more about Papyrus or to get the Alpha version for MAC's (or the free viewer) see Research Software Design's WEB site at http://www.teleport.com/~rsd/. If you already have a Papyrus database of your own we would recommend putting this one in a separate subdirectory (or folder for you MAC types). To import the data for the MAC version you go to the import option "Papyrus-Papyrus transfer" under "File" and this would be a Version 7 to Version 8 conversion. For the PC version go to Utilities and run Restore. (A word of warning: it only took about 20 minutes to set up the new version of the PC database but more than 24 hours to load the records into the MAC version.)

The files are as follows:

Backup.bb

Key1.bb

Jrn1.bb

Fmt1.bb

Ref1.bb through Ref16.bb

(When these files were sent as BinHex from a PC Eudora mail account to a MAC Eudora account the number of files multiplied but everything seemed to work just fine.)

(Downloading information added by GBR)

**The zipped Papyrus *.bb files**

Step 1. Download and save to disk the ref*.bb files contained in allrefbb.zip (1.5 Megabytes) - Your browser should warn you that this is a binary file.
Step 2. Download and save to disk the rest of the *.bb files contained (together with a copy of the file you are reading now) in otherbb.zip (70 Kb) - Again your browser should warn you that this is a binary file.

The installed bibliography occupies about 15 Megabytes of disk space.

Macintosh Papyrus users

Macintosh users can either unzip the files above (ZipIt works well) or download this version below of the same files for Papyrus packaged into a self-extracting Mac archive by Thomas van Wissen ([email protected]). Your browser will handle this properly if it is already configured to deal with Macbinary files

Download and save to disk the Mac SEA file contained in allrefbb.sea.bin (1.6 Megabytes)

EndNote users

This version has been exported from Papyrus and converted into an EndNote v2 library file (by GBR). Such files can be read by any EndNote program on a Macintosh or PC. Additional information on the EndNote bibliography is included in the zip file (together with a copy of the file you are reading now).

Download and save to disk the EndNote library contained in e-n-poly.zip (2.6 Megabytes).

There were six citations that had Papyrus 'Notecards' (a peculiar method in Papyrus for dealing with data that does not fit the 8000 character Papyrus field limit). These were papers with long species lists (Campoy 1979, Day & Hutchings 1979, Fauchald 1977, Fauvel 1932, Hobson & Banse 1981, Straughan & Klink 1980). 'Notecard' information was overlooked during the original export. If this information is of interest here are the six species lists in a separate EndNote file.

Download and save to disk the EndNote library contained in W-F-Spp-Lists.zip (33 Kilobytes).

2.) Text Files.

ASCII files: (*.txt) are ascii comma delimited files which use quotes around text fields, commas between fields and a hard return as the record delimiter. We have successfully brought these files into Access (Microsoft Corporation's Windows database program), just remember to tell the program that they are ASCII rather than ANSI files or else the "European characters" come out looking a little strange. The ASCII version of the bibliography lacks the keywords and species lists and comments that appear in the Papyrus version. Also a number of the titles were truncated because of the 255 field length limitation of most database programs.

There are 28 fields in these files. If a record did not have data for a particular field a field delimiter was still inserted so that all the data could be imported into a single table. The alternative would have been to create separate export files for the Articles, Books, Chapters, Dissertations/Theses, Other formats. This would have been easy enough for all except the "articles" which would have needed to be subdivided in some fashion as it would have been huge. It seemed easiest to just export the data by reference number 0-2000 (2000.txt), 2001-4000 (4000.txt) 4001-6000 (6000.txt) etc. The files are 2000.txt, 4000.txt, 6000.txt, 8000.txt, 10000.txt, 12000.txt, 14000.txt, 16000.txt, 18000.txt, 20000.txt and 21000.txt.

The fields and their order are as follows:

1) Reference Number

2) Author or Editor of Book

3) "Editor"

4) Year

5) Title of Article/Chapter/Other

6) Title of Book or Thesis

7) Journal in Full

8) Journal Abbreviated in World List format

9) Journal Series

10) Journal Volume

11) Issue

12) Supplement

13) Issue or Series Title

14) Place in Series

15) Editor of Issue/Series

16) "Editor"

17) Editors of Book (for chapters)

18) Pagination, Chapters/Articles

19) Pagination Book/Thesis

20) Edition of Book

21) Volume of Book

22) Publisher

23) City of Publication

24) Degree

25) Document Type

26) Institution/City

27) "also print"

28) Other information

Additional information about the text version

This information was provided later by Linda to members of ANNELIDA list.

1. Opening the text file and getting diacritics, etc. changed to nonsense characters or entirely lost.

This comes from opening the files in a word processor or database program without the software converting it from ASCII to ANSI. Dr. Akesson's name, for example, comes out with a big box instead of the correct accent at the beginning if opened without undergoing the correct conversion in MS-Word. Below I describe a couple of ways you might be able to solve this problem. If your software can't do the correct conversion then I am sorry but I can't help you any further. The text files can be successfully opened in MS Word, but if you open the file "as is" it does look like garbage. I experimented with that program (Word) and found that you need to tell it that the file it is opening is not a text file but rather it is an "MS-DOS text" file. To do this you need to go to FILE then OPEN and then click on the "CONFIRM CONVERSIONS" box that appears in the lower right hand side of the screen. This then brings up a list of choices for file types. Select 'MS-DOS TEXT'.

If you are importing the data into a database you need to tell it that it is an ASCII delimited text file. In MS-Access the sequence is to go to "Get External Data" - "Import"; tell the program that the file type is "text"; click on the button marked "Advanced" and tell the program that the file is "DOS OS/2" rather than ANSI. Other programs should have similar steps. I know with Paradox that there was a way to change the preferences for the import process and you may need to re-set those to get the data in correctly. This may also be true for some of the other database programs.

2. Some of you have successfully brought the text files up in MS-Access but wondered about the error messages and possible lost records.

The first field is a reference number but; it is not a complete number series, ie. there are huge gaps in the numbering sequence. These are somewhat intentional and do not represent missing records. If you were to create a new record in the Papyrus version it should be assigned a number 20969 but there are not 20968 records in the file. The actual number of records is 17487 in the Papyrus version but may be fewer in the text version due to duplicate records or records that are eliminated for other reasons.

Error messages about lost data on import. When the data is imported in most of the database programs you will lose some data and possibly even some records because the citations have fields that exceed the standard 256 field length. A number of the citations in the bibliography have really huge titles (Lamarck's books come to mind) and I know in MS-Access the program accepted the records but truncated the title field. I tried changing that field to a memo field to see if that would solve the problem. There were also apparently a number of either blank or duplicate records in the text files that are kicked out on import. I never could figure out where they are in the text files but they may have come from a number of duplicate citations that occur in the bibliography.

The zipped ascii comma-delimited files

Download and save to disk the *.txt files contained in bibtxt.zip (1.3 Megabytes) - Again your browser should warn you that this is a binary file. NOTE: You do not need these files unless you DON'T have either the Papyrus bibliographic program or EndNote, OR you want a plain text copy for some other purpose. Remember there are free viewers available for EndNote and Papyrus (see the introductory page).

Return to top

22 April 1997, links updated 24 April 2007 by GBR