|
|
1.1 root 1: GBREL.TXT Genetic Sequence Data Bank
2: 15 December 1990
3:
4: GenBank(R) Release 66.0
5:
6: Distribution Tape Release Notes
7:
8: 41057 loci, 51306092 bases, from 50908 reported sequences
9:
10: This document describes the data written on GenBank distribution
11: tapes. The examples used are from the current release. If you have any
12: questions or comments about the data bank, the distribution tape, or
13: this document, please call (415)962-7364 or write to:
14:
15: GenBank
16: c/o IntelliGenetics Inc.
17: 700 East El Camino Real
18: Mountain View, California 94040
19: USA
20:
21: The electronic mail address is: [email protected]
22:
23: 1. INTRODUCTION
24:
25: 1.1 Release 66.0
26:
27: Release 66.0 has 41,057 loci representing 51,306,092 bases. Release
28: 65.0 had 39,553 loci with 49,179,285 bases. Release 66.0 thus is 4.3%
29: larger in bases than Release 65.0. A statistical summary of Release
30: 66.0 is presented in Appendix A.
31:
32: 1.2 Organization of This Document
33:
34: This introduction notes the changes to GenBank since the last release.
35: The next section describes the contents of the tape files. The third
36: section illustrates the formats of the tape files. The fourth section
37: describes the proposed changes planned for future releases. The fifth
38: section describes known problems in this release. The last section
39: contains notes about the administration of GenBank.
40:
41: 1.3 Recent Changes in the Data Bank
42:
43: 1.3.1 Changes in This Release
44:
45: 1.3.1.1 Data from EMBL and the DNA Data Bank of Japan
46:
47: New sequence data from EMBL Release 23 have been incorporated into
48: this release of GenBank. Sequence data from Release 7.0 of the DNA
49: Data Bank of Japan (DDBJ) have also been incorporated into this
50: release. Entries with accession numbers beginning with the letter `D'
51: have been created and annotated by DDBJ. Release 66 contains
52: approximately 467 entries from DDBJ.
53:
54: 1.3.1.2 Changes in the Source and Definition Lines
55:
56: The Source and Definition lines for new entries are now generated by
57: the GenBank database software from information in the GenBank
58: relational database, rather than being entered by an annotator.
59:
60: These lines include information such as the organism name, the name
61: and type of the molecule, and the gene product. An example is:
62:
63: DEFINITION A.auricula-judae (mushroom) 5S ribosomal RNA.
64: SOURCE A.auricula-judae (mushroom) ribosomal RNA.
65:
66: In Release 66.0, these lines contain the same information, but have a
67: format that is less English-like. This change only applies to new
68: entries.
69:
70: 1.3.1.3 Changes in Locus Names
71:
72: Several locus names have been changed to make the application of the
73: organism codes consistent throughout the data bank. These changes are
74: listed in Appendix B.
75:
76: 1.3.1.4 Date on LOCUS Line
77:
78: The date on the LOCUS line now indicates the actual date on which the
79: data first appeared in the GenBank relational database or the date of
80: the last revision. Previously, this was the date of the release in
81: which the data first appeared or was revised.
82:
83: 1.3.1.5 Release 66 "Close-of-Data"
84:
85: The freeze date for data to appear in Release 66 was November 22,
86: 1990. This is the date on which flatfile generation began. The process
87: of converting the data from the relational database into the flatfile
88: ASCII format takes several days to complete. New data continue to be
89: added during this time; if data are added before their division has
90: been processed, they may appear in the release (even though dated
91: after the freeze date).
92:
93: 1.3.2 Changes in Earlier Releases
94:
95: The following changes in GenBank format were implemented in previous
96: releases and described in their Release Notes. These changes are
97: described here again for those users who may not have received those
98: releases.
99:
100: 1.3.2.1 International Feature Table Format (Release 64)
101:
102: The EMBL Data Library and GenBank, with the assistance of the DNA Data
103: Bank of Japan, have developed a standard feature table to be
104: implemented by all three data banks. The new feature table is designed
105: to be more understandable and useful. The common feature table will
106: also make software development easier and allow simpler data
107: conversion between data banks.
108:
109: The new feature table was implemented in Release 64.0 (June 1990).
110: Details of the new format are described in a document entitled: `The
111: DDBJ/EMBL/GenBank feature table: Definition, Version 1.01, September
112: 10, 1988.' Copies of this document are available on request at the
113: address given on the first page of these release notes.
114:
115: Section 3.5.11 provides further information on the new feature table
116: format.
117:
118: 1.3.2.2 Minor Differences in GenBank Format (Release 64)
119:
120: Beginning in the second quarter of 1990, the GenBank data bank has
121: been maintained in relational format. The data bank will continue to
122: be distributed in the standard flatfile format described in this
123: document. Starting with Release 64.0, the standard flatfile GenBank
124: data bank (generated from the relational data base tables) contains a
125: few minor format differences from previous releases. These differences
126: are described in the remainder of this section.
127:
128: Information about the relational database format can be requested by
129: calling (505) 665-2177 or by writing to:
130:
131: GenBank
132: Group T-10, Mail Stop K710
133: Los Alamos National Laboratories
134: Los Alamos, NM 87545
135:
136: 1.3.2.2.1 Keyword Order
137:
138: The order of keyword phrases on the KEYWORDS line is alphabetical.
139:
140: 1.3.2.2.2 Accession Number Order
141:
142: The order of non-primary accession numbers on the ACCESSION line is
143: not necessarily preserved from one release to the next.
144:
145: 1.3.2.2.3 Reference Order
146:
147: The order of the references in an entry is not necessarily preserved
148: from one release to the next.
149:
150: 1.3.2.2.4 Changes in Taxonomy
151:
152: The taxonomic classification of many organisms was changed to ensure
153: uniformity throughout the data bank. A few organisms are listed as
154: unclassified as a result of inconsistencies in the data. The
155: annotation staff is addressing these inconsistencies.
156:
157: 1.3.2.2.5 Inconsistencies in Reference Information
158:
159: Inconsistencies in reference information have not been preserved. All
160: occurrences of a single reference are identical, usually matching the
161: first occurrence in the data bank.
162:
163: 1.3.2.2.6 Formatting Changes
164:
165: Certain text fields (for example, definition, comment, title, etc.)
166: have been reformatted slightly, in some entries. In most cases, the
167: actual text remains unchanged.
168:
169: 1.3.2.2.7 Comment Field Formatting
170:
171: Any special formatting in the comment field may not be preserved. This
172: may be corrected in future releases.
173:
174: 1.3.2.2.8 Reference Line Changes
175:
176: The reference lines for sites and review papers have been slightly
177: modified. All of these papers are classified as `sites', with the
178: original text describing the citation appearing in the comment field.
179: Also, when a reference is cited elsewhere in an entry, in most cases
180: the entire citation appears in square brackets. Previously, only the
181: number of the reference appeared in square brackets.
182:
183: 2. ORGANIZATION OF TAPE FILES
184:
185: 2.1 Tape Formats
186:
187: The GenBank data bank is available in three formats on three different
188: physical media (see Section 5.4 for further details on which formats
189: are available on each medium), and on CD ROM.
190:
191: GenBank is available on 9-track, unlabelled, industry-standard, ASCII
192: magnetic tapes. These tapes have been written in fixed-length records
193: of 80 characters, each with no carriage-return or line-feed
194: characters. Each record corresponds to one line in the data bank;
195: trailing blanks have been added to the lines to make them all exactly
196: 80 characters long. (A completely blank line is therefore represented
197: by 80 blanks.)
198:
199: The label affixed to the tape reel indicates its block size and
200: density. If no specifications are received from you, the tape is
201: written with a fixed block size of 160 records (12,800 characters) and
202: a density of 6250 bpi (bits per inch). We also offer tapes written at
203: a density of 1600 bpi and a block size of 40 records (3200
204: characters).
205:
206: GenBank is also available as a VAX/VMS Backup saveset (on 9-track
207: tapes or TK-50 cartridges) or as compressed Unix tar archives (on 9
208: track tapes and Sun 1/4" QIC 24 format tape cartridges).
209:
210: The GenBank tape distribution files are also available on ISO-9660
211: compatible CD ROM. The data are written as ASCII files with variable
212: length records. Each record corresponds to one line in the data bank
213: and ends with a carriage return and a line-feed character.
214:
215: The data on the tapes have both uppercase and lowercase characters.
216: Upon special request, the unlabelled, 9 track tapes can be written
217: using uppercase characters only (Section 6.4 specifies which formats
218: are available in uppercase only).
219:
220: 2.2 Files
221:
222: GenBank consists of twenty-two files in all magnetic tape
223: distributions. The list which follows describes each of the files
224: included in the distribution. In the following sections there are
225: additional lists indicating the breakdown of files on the various
226: media and formats.
227:
228: 2.2.1 File Descriptions
229:
230: 1. GBREL.TXT - Release notes (this document).
231: 2. GBSDR.TXT - Short directory of the data bank.
232: 3. GBNEW.TXT - List of new or substantially revised entries.
233: 4. GBACC.IDX - Index of the entries according to accession number.
234: 5. GBKEY.IDX - Index of the entries according to keyword phrase.
235: 6. GBAUT.IDX - Index of the entries according to author.
236: 7. GBJOU.IDX - Index of the entries according to journal citation.
237: 8. GBHGM.IDX - Index of the entries according to gene symbol.
238: 9. GBDAT.FRM - Forms for submitting sequences or corrections to GenBank.
239: 10. GBPRI.SEQ - Primate sequence entries.
240: 11. GBROD.SEQ - Rodent sequence entries.
241: 12. GBMAM.SEQ - Other mammalian sequence entries.
242: 13. GBVRT.SEQ - Other vertebrate sequence entries.
243: 14. GBINV.SEQ - Invertebrate sequence entries.
244: 15. GBPLN.SEQ - Plant sequence entries (including fungi and algae).
245: 16. GBORG.SEQ - Eukaryotic organelle sequence entries.
246: 17. GBBCT.SEQ - Bacterial sequence entries.
247: 18. GBRNA.SEQ - Structural RNA sequence entries.
248: 19. GBVRL.SEQ - Viral sequence entries.
249: 20. GBPHG.SEQ - Phage sequence entries.
250: 21. GBSYN.SEQ - Synthetic and chimeric sequence entries.
251: 22. GBUNA.SEQ - Unannotated sequence entries.
252:
253: 2.2.2 Fixed Length Records
254:
255: Approximately 197 MB of disk space is required for the Release 66.0
256: files in fixed-length record format. All the files fit on two 6250 bpi
257: tapes and are divided between the tapes as follows.
258:
259: Tape 1
260:
261: GBREL.TXT
262: GBSDR.TXT
263: GBNEW.TXT
264: GBACC.IDX
265: GBKEY.IDX
266: GBAUT.IDX
267: GBJOU.IDX
268: GBHGM.IDX
269: GBDAT.FRM
270: GBPRI.SEQ
271: GBROD.SEQ
272: GBMAM.SEQ
273: GBVRT.SEQ
274: GBINV.SEQ
275: GBPLN.SEQ
276: GBORG.SEQ
277:
278:
279: Tape 2
280:
281: GBBCT.SEQ
282: GBRNA.SEQ
283: GBVRL.SEQ
284: GBPHG.SEQ
285: GBSYN.SEQ
286: GBUNA.SEQ
287:
288: At 1600 bpi, seven tapes are required and the files are divided among
289: the tapes as follows:
290:
291: Tape 1 Tape 4
292:
293: GBREL.TXT GBVRT.SEQ
294: GBSDR.TXT GBBCT.SEQ
295: GBNEW.TXT
296: GBACC.IDX
297: GBKEY.IDX Tape 5
298: GBAUT.IDX
299: GBJOU.IDX GBINV.SEQ
300: GBHGM.IDX GBPLN.SEQ
301: GBDAT.FRM GBPHG.SEQ
302: GBMAM.SEQ
303:
304: Tape 6
305: Tape 2
306: GBVRL.SEQ
307: GBPRI.SEQ GBSYN.SEQ
308:
309: Tape 3 Tape 7
310:
311: GBROD.SEQ GBUNA.SEQ
312: GBORG.SEQ
313: GBRNA.SEQ
314:
315:
316: 2.2.3 VAX/VMS Backup Saveset
317:
318: Saveset files are in directory order rather than in the order shown
319: for the formats above. The files are in compressed format (See Section
320: 1.3.1.2 for details). Approximately 139 MB of disk space is required
321: for Release 66.0 files in VAX/VMS Backup Saveset format. The files
322: archived in the Backup Saveset use variable-length records, not the
323: 80-character fixed-length records described above. All files fit on
324: one 6250 bpi tape. At 1600 bpi, two tapes are required. The division
325: of the files between the two tapes was not available at the time these
326: Release Notes were prepared. The files will appear in the following
327: order:
328:
329: AAAREADME.TXT
330: DCOMPRESS.CLD
331: DCOMPRESS.EXE
332: DECMPRESS.COM
333: GBACC_IDX.Z
334: GBAUT_IDX.Z
335: GBBCT_SEQ.Z
336: GBDAT_FRM.Z
337: GBHGM_IDX.Z
338: GBINV_SEQ.Z
339: GBJOU_IDX.Z
340: GBKEY_IDX.Z
341: GBMAM_SEQ.Z
342: GBNEW_TXT.Z
343: GBORG_SEQ.Z
344: GBPHG_SEQ.Z
345: GBPLN_SEQ.Z
346: GBPRI_SEQ.Z
347: GBREL_TXT.Z
348: GBRNA_SEQ.Z
349: GBROD_SEQ.Z
350: GBSDR_TXT.Z
351: GBSYN_SEQ.Z
352: GBUNA_SEQ.Z
353: GBVRL_SEQ.Z
354: GBVRT_SEQ.Z
355:
356: NOTE: When the files are uncompressed (as instructed in Section 2.3)
357: the `.Z' will be removed from the end of the file name and the
358: characters after the underscore will become the file extension. For
359: example, `GBACC_IDX.Z' will be named `GBACC.IDX'.
360:
361: One TK-50 cartridge is required; the files are in directory order and
362: are compressed as described above.
363:
364: 2.2.4 Unix tar Format
365:
366: The files are compressed with the Unix compress utility before the tar
367: command is executed; they must therefore be uncompressed before use
368: (see Section 2.4 below for details). Approximately 45 MB of disk space
369: is required for the Release 66.0 files when in the compressed format;
370: the uncompressed files require approximately 139 MB.
371:
372: The tar file uses variable length records; the records are not padded
373: to 80 characters with space characters. To get fixed-length,
374: 80-character records, first uncompress the.Z files. Then use dd with
375: the conv=block and cbs=80 options set to filter the file. If you pad
376: the records, it adds approximately 58 MB of disk space.
377:
378: In the Unix tar file, the files are in directory order rather than in
379: the order shown for the fixed-length record formats. In addition, the
380: file names are in lowercase letters. All files fit on one 6250 bpi
381: tape or Sun cartridge. At 1600 bpi, two tapes are required, and the
382: files are divided between the tapes as follows:
383:
384: Unix Tar File Order:
385:
386: Tape 1
387:
388: gbacc.idx.Z
389: gbaut.idx.Z
390: gbbct.seq.Z
391: gbdat.frm.Z
392: gbhgm.idx.Z
393: gbinv.seq.Z
394: gbjou.idx.Z
395: gbkey.idx.Z
396: gbmam.seq.Z
397: gbnew.txt.Z
398: gborg.seq.Z
399: gbphg.seq.Z
400:
401:
402: Tape 2
403:
404: gbpln.seq.Z
405: gbpri.seq.Z
406: gbrel.txt.Z
407: gbrna.seq.Z
408: gbrod.seq.Z
409: gbsdr.txt.Z
410: gbsyn.seq.Z
411: gbuna.seq.Z
412: gbvrl.seq.Z
413: gbvrt.seq.Z
414:
415: NOTE: When the files are uncompressed the `.Z' extension is removed
416: from the file names.
417:
418: 2.2.5 File Sizes
419:
420: The following table indicates the approximate sizes of the individual
421: files in this release. Since minor changes to some of the files may
422: occur after the release notes are printed, these sizes should not be
423: used to determine
424:
425: file integrity. They are provided as an aid to planning only. The
426: columns in the table have the following meanings:
427:
428: (1) - Sizes (in bytes) of the fixed-length record files (described in
429: Section 2.2.2)
430:
431: (2) - Sizes (in bytes) of the compressed files included in the Unix
432: tarfile (Section 2.2.4)
433:
434: (3) - Sizes (in bytes) of the files in the Unix tarfile after
435: uncompression (Section 2.2.4)
436:
437: (4) - Sizes (in blocks) of the compressed files included in the VMS
438: Backup saveset (Section 2.2.3 and 2.4)
439:
440: (5) - Sizes (in blocks) of the files in the VMS Backup saveset after
441: decompression (Sections 2.2.3 and 2.3)
442:
443:
444: File (1) (2) (3) (4) (5)__
445:
446: GBACC.IDX 3616640 558701 1641276 1101 3294
447: GBAUT.IDX 9839040 1978063 5541281 4040 11106
448: GBBCT.SEQ 20356960 4995239 15062023 10510 30263
449: GBDAT.FRM 39600 8450 21155 19 43
450: GBHGM.IDX 168160 42557 127617 86 254
451: GBINV.SEQ 13488080 3145845 9751044 6643 19599
452: GBJOU.IDX 4474400 750259 2454640 1530 4928
453: GBKEY.IDX 3676480 866411 2497752 1754 4981
454: GBMAM.SEQ 6563120 1524109 4708672 3223 9463
455: GBNEW.TXT 96800 16960 55668 37 113
456: GBORG.SEQ 5763760 1375541 4241530 2893 8524
457: GBPHG.SEQ 2378720 560271 1671525 1179 3363
458: GBPLN.SEQ 14084400 3392955 10325013 7124 20746
459: GBPRI.SEQ 32726160 7473028 23428438 15823 47079
460: GBREL.TXT 367680 96837 298595 201 600
461: GBRNA.SEQ 4694720 821816 2794838 1798 5636
462: GBROD.SEQ 30187600 6727065 21356518 14252 42920
463: GBSDR.TXT 3289440 1074335 3285363 2181 6578
464: GBSYN.SEQ 2838560 611437 1841890 1308 3706
465: GBUNA.SEQ 11974000 2752439 8627808 5717 17370
466: GBVRL.SEQ 18141760 4437537 13647843 9285 27420
467: GBVRT.SEQ 7638320 1747319 5408466 3705 10877
468: AAAREADME.TXT 2 2
469: DCOMPRESS.CLD 4 4
470: DCOMPRESS.EXE 150 150
471: DECMPRESS.COM 2 2
472:
473:
474: Totals 196404400 44957174 138788955 94567 279021
475:
476: NOTE: The sizes of the CD ROM files are approximately the same as
477: those of the uncompressed Unix tar files (Column 3). The addition of
478: carriage-return/line-feed characters at the end of each line in the CD
479: ROM files increases the total size of the distribution by
480: approximately 3 Mb.
481:
482: 2.3 Loading Data Bank Files in VAX/VMS Backup Format
483:
484: In order to use the VAX/VMS Backup Saveset format, you must be running
485: release 5.0 or greater of the VMS operating system. If you are not
486: running release 5.0 or greater, you should order the unlabelled ASCII
487: format instead of VAX/VMS Backup.
488:
489: The following command should be used to load the saveset into the
490: current directory on your disk:
491:
492: BACKUP/LOG MSA0:GENBANK []
493:
494: (NOTE: Replace `MSA0' with the identifier for your disk.)
495:
496: The following command should be used to uncompress the files. NOTE: If
497: you do not want to keep all of the files, delete those you do not want
498: before you run the uncompress procedure. The uncompress routine works
499: on all the files in the directory that have a `.Z' extension.
500:
501: @DECMPRESS
502:
503: The following commands were used to create the VAX/VMS Backup Saveset.
504: NOTE: The `...' indicates that the following line is a continuation
505: and should be typed without a break.
506:
507: For 6250 bpi tape: BACKUP/DENSITY=6250/BUFFER=5/VERIFY/INTERCHANGE/...
508:
509: LIST=GB.LST GB1:[GENBANK.PROD]GB*.* TAPE:GENBANK
510:
511: For 1600 bpi tape: BACKUP/DENSITY=1600/BUFFER=5/VERIFY/INTERCHANGE/...
512:
513: LIST=GB.LST GB1:[GENBANK.PROD]GB*.* TAPE:GENBANK
514:
515: For TK-50 cartridge: BACKUP/BUFFER=5/VERIFY/INTERCHANGE/...
516:
517: LIST=GB.LST GB1:[GENBANK.PROD]GB*.* TAPE:GENBANK
518:
519: 2.4 Loading Data Bank Files in Unix tar Format
520:
521: The following commands should be used to load the Unix tar files into
522: the current directory on your disk:
523:
524: tar xvfb /dev/rmt8 126 gb*.Z
525:
526: uncompress gb*.Z
527:
528: (NOTE: Replace `rmt8' with the identifier for your device.)
529:
530: The following command was used to write the tarfile on the
531: distribution tape:
532:
533: For 6250 and 1600 bpi tapes (execute the command twice, once for each
534: tape, for 1600 bpi):
535:
536: tar cvfb /dev/rmt8 20 gb*.Z
537:
538: For Sun cartridge:
539:
540: tar cvfb /dev/rst8 126 gb*.Z
541:
542: 3. FILE FORMATS
543:
544: 3.1 File Header Information
545:
546: Each of the twenty-two files on the distribution tape begins with the
547: same header, except for the first line, which contains the file name,
548: and the sixth line, which contains the title of the file. The first
549: line of the file contains the file name in character positions 1 to 9
550: and the full data bank name (Genetic Sequence Data Bank) starting in
551: column 20. The brief names of the files in this release are listed in
552: section 2.2.
553:
554: The second line contains the date of the current release in the form
555: `day month year', beginning in position 26. The fourth line contains
556: the current GenBank release number. The release number appears in
557: positions 41 to 45 and consists of two numbers separated by a decimal
558: point. The number to the left of the decimal is the major release
559: number. The digit to the right of the decimal indicates the version of
560: the major release; it is zero for the first version. The sixth line
561: contains a title for the file. The eighth line lists the number of
562: entries (loci), number of bases (or base pairs), and number of reports
563: of sequences in this release of GenBank. These numbers are
564: right-justified at fixed positions. The number of entries appears in
565: positions 1 to 7, the number of bases in positions 15 to 22, and the
566: number of reports in positions 36 to 40. (There are more reports of
567: sequences than entries since reported sequences that overlap or
568: duplicate each other are combined into single entries.) The third,
569: fifth, seventh, and ninth lines are blank.
570:
571: 1 10 20 30 40 50 60 70 79
572:
573: ---------+---------+---------+---------+---------+---------+---------+---------
574:
575: GBACC.IDX Genetic Sequence Data Bank
576: 15 December 1990
577:
578: GenBank(R) Release 66.0
579:
580: Accession Number Index
581:
582: 41057 loci, 51306092 bases, from 50908 reported sequences
583:
584: ---------+---------+---------+---------+---------+---------+---------+---------
585:
586: 1 10 20 30 40 50 60 70 79
587:
588: Example 1. Sample File Header
589:
590: 3.2 Directory Files
591:
592: 3.2.1 Short Directory File
593:
594: The short directory file contains brief descriptions of all of the
595: sequence entries contained in this release. These descriptions are in
596: thirteen groups, one group for each of the thirteen sequence entry
597: data files. The first record at the beginning of a group of entries
598: contains the name of the group in uppercase characters, beginning in
599: position 21. The organism groups are PRIMATE, RODENT, OTHER MAMMAL,
600: OTHER VERTEBRATE, INVERTEBRATE, PLANT, ORGANELLE, BACTERIAL,
601: STRUCTURAL RNA, VIRAL, PHAGE, SYNTHETIC, or UNANNOTATED. The second
602: record is blank.
603:
604: Each record in the short directory contains the sequence entry name
605: (LOCUS) in the first 12 positions, followed by a brief definition of
606: the sequence beginning in column 13. The definition is truncated (at
607: the end of a word) to leave room at the right margin for at least one
608: space, the sequence length, and the letters `bp'. The length of the
609: sequence is printed right-justified to column 77, followed by the
610: letters `bp' in columns 78 and 79. The next-to-last record for a group
611: has `ZZZZZZZZZZ' in its first ten positions (where the entry name
612: would normally appear). The last record is a blank line. An example of
613: the short directory file format, showing the descriptions of the last
614: entries in the Other Vertebrate sequence data file and the first
615: entries of the Invertebrate sequence data file, is reproduced below:
616:
617: 1 10 20 30 40 50 60 70 79
618:
619: ---------+---------+---------+---------+---------+---------+---------+---------
620:
621: ZEFHOX21 Zebrafish Hox-2.1 gene homologue (ZF-21). 291bp
622: ZEFRZF21 Zebrafish mRNA for homeotic protein ZF-21. 2073bp
623: ZEFZF54 Zebrafish homeotic gene ZF-54. 246bp
624: ZEFZFEN Zebrafish engrailed-like homeobox sequence. 327bp
625: ZZZZZZZZZZ
626:
627: INVERTEBRATE
628:
629: ACAACTI Amoeba (A. castellanii) actin gene-i. 1571bp
630: ACAJJE A.castellanii 18S ribosomal RNA. 241bp
631: ACAJJEA A.castellanii 18S ribosomal RNA. 258bp
632: ACAJJEB A.castellanii 18S ribosomal RNA. 257bp
633:
634: ---------+---------+---------+---------+---------+---------+---------+---------
635:
636: 1 10 20 30 40 50 60 70 79
637:
638: Example 2. Short Directory File
639:
640: 3.2.2 New and Updated Entry File
641:
642: The directory of new and updated entries is a list of those entries
643: that have been newly added or that have undergone substantive revision
644: in this release. These entries are listed in the same order in which
645: they appear in the actual data files; they are divided into thirteen
646: groups, one group for each of the thirteen sequence entry data files.
647: The first record at the beginning of a group of entries designates
648: that group, beginning in position 21. The second record is blank and
649: the third record has asterisks in its first ten positions. Within each
650: group, the entries are listed alphabetically. For each entry, the new
651: and updated entry file gives the information included under the LOCUS
652: and DEFINITION keywords in the same format in which they appear in the
653: actual sequence entry; these categories are described in section
654: 3.5.2. After the last record of an entry comes a record containing
655: asterisks in its first ten positions. At the end of each group, a
656: dummy entry contains only a LOCUS line with the entry name
657: `ZZZZZZZZZZ'. Therefore, the next-to-last record has ten asterisks in
658: its first ten positions; the last record of the group is blank.
659:
660: The following excerpt from the current release shows the last new or
661: revised entry from the Other Vertebrate sequence data file, followed
662: by the first new or revised entry from the Invertebrate sequence data
663: file:
664:
665: 1 10 20 30 40 50 60 70 79
666:
667: ---------+---------+---------+---------+---------+---------+---------+---------
668:
669: **********
670: LOCUS RANCRYR23 266 bp ds-DNA VRT 20-SEP-1990
671: DEFINITION R.temporaria rho-crystallin gene, exon X.
672: **********
673: LOCUS ZZZZZZZZZZ
674: **********
675:
676: INVERTEBRATE
677:
678: **********
679: LOCUS ACAJJE 241 bp ss-rRNA INV 05-NOV-1990
680: DEFINITION A.castellanii 18S ribosomal RNA.
681: **********
682:
683: ---------+---------+---------+---------+---------+---------+---------+---------
684:
685: 1 10 20 30 40 50 60 70 79
686:
687: Example 3. New and Updated Entry File
688:
689: 3.3 Index Files
690:
691: There are five files containing indices to the entries in this
692: release:
693:
694: Accession number index file
695: Keyword phrase index file
696: Author name index file
697: Journal citation index file
698: Gene symbol index file
699:
700: The index keys (accession numbers, keywords, authors, journals, and
701: gene symbols.) of an index are sorted alphabetically. (The index keys
702: for the keyword phrases and author names appear in uppercase
703: characters even though they appear in mixed case in the sequence
704: entries.) Under each index key, the names of the sequence entries
705: containing that index key are listed alphabetically. Each sequence
706: name is also followed by its data file division and primary accession
707: number. The following codes are used to designate the data file
708: divisions:
709:
710: 1. PRI - primates
711: 2. ROD - rodents
712: 3. MAM - other mammals
713: 4. VRT - other vertebrates
714: 5. INV - invertebrates
715: 6. PLN - plants, fungi, and algae
716: 7. ORG - organelles
717: 8. BCT - bacteria
718: 9. RNA - structural RNAs
719: 10. VRL - viruses
720: 11. PHG - bacteriophage
721: 12. SYN - synthetic sequences
722: 13. UNA - unannotated sequences
723:
724: The index key begins in column 1 of a record. An 11-character field
725: for the sequence entry name starts in position 14 of a record,
726: followed by a 3-character field for the data file division, starting
727: at position 25 and ending at position 27, and a 6-character field for
728: the primary accession number, starting at position 29 and ending at
729: position 34. All entries in the fields are left-justified.
730:
731: Beginning at positions 36 and 58, the three fields repeat, so three
732: sets of sequence information can appear in one record. If there are
733: more than three entry names, the next records are used; the index key
734: is not repeated. For the accession number and human gene symbol index
735: files, the entry names begin in the same record as the index key,
736: since the key is always less than 12 characters. In the other index
737: files, the entry names begin on the record following the index key
738: record.
739:
740: 3.3.1 Accession Number Index File
741:
742: Accession numbers consist of a single letter followed by five digits.
743: They provide an unchanging designation for the data with which they
744: are associated, and we encourage you to cite accession numbers
745: whenever you refer to data from the data bank. The primary accession
746: number is the first accession number of an entry. It is unique to that
747: entry. Citation of that number will enable other investigators to
748: locate the data no matter what entry name changes or other data bank
749: reorganizations may occur. The accession numbers, however, carry no
750: intrinsic information about the data.
751:
752: In addition to the primary accession number, some entries have
753: secondary accession numbers. Secondary accession numbers arise for a
754: number of reasons. For example, a single accession number may
755: initially be assigned to the sequence in an article. If it is later
756: discovered that the sequence must be entered into the data bank as
757: multiple entries, each entry would receive a new primary accession
758: number; the previous accession number would appear as the secondary
759: accession number in each entry.
760:
761: The following excerpt from the accession number index file illustrates
762: the format of the index:
763:
764: 1 10 20 30 40 50 60 70 79
765:
766: ---------+---------+---------+---------+---------+---------+---------+---------
767:
768: J00316 HUMTBB11P PRI J00316
769: J00317 HUMTBB46P PRI J00317
770: J00318 HUMUG1 PRI J00318
771: J00319 HUMUG1PA PRI J00319
772: J00320 HUMVIPMR1 PRI L00154 HUMVIPMR2 PRI L00155 HUMVIPMR3 PRI L00156
773: HUMVIPMR4 PRI L00157 HUMVIPMR5 PRI L00158
774: J00321 BABA1AT PRI J00321
775: J00322 CHPRSA PRI J00322
776: J00323 AGMRSASPC PRI J00323
777: J00324 BABATIII PRI J00324
778:
779: ---------+---------+---------+---------+---------+---------+---------+---------
780:
781: 1 10 20 30 40 50 60 70 79
782:
783: Example 4. Accession Number Index File
784:
785: If the same accession number is found in more than one entry (a result
786: of the infrequent occasions when a single entry is split into two or
787: more separate entries), then the additional entries and groups in
788: which the number appears are also given.
789:
790: 3.3.2 Keyword Phrase Index File
791:
792: Keyword phrases consist of names for gene products and other
793: characteristics of sequence entries. There are approximately 12,000
794: keyword phrases. An excerpt from the keyword phrase index file is
795: shown below:
796:
797: 1 10 20 30 40 50 60 70 79
798:
799: ---------+---------+---------+---------+---------+---------+---------+---------
800:
801: DNA GYRASE
802: ECOGYRA BCT X06744 ECORECF BCT K02179 ECORECFA BCT X04341
803: DNA HELICASE
804: ECOHELIV BCT J04726 ECOUVRD BCT X00738
805: DNA INVERTASE
806: ECOPIN BCT K00676 ECOPINP BCT K03521 PMUGINMOM PHG V01463
807: STAINVSA BCT M36694
808: DNA LIGASE
809: ECOLIG BCT M24278 ECOLIGA BCT M30255 PT4G30 PHG X00039
810: PT6LIG55 PHG M38465 PT7CG PHG J02518 YSCCDC9 PLN X03246
811: YSPCDC17 PLN X05107
812: DNA LIGASE I
813: HUMLIGAA PRI M36067
814: DNA MATURATION
815: HS1CAS VRL M22962
816: DNA METHYLASE
817: HEHMTS BCT J02677
818: DNA METHYLATION
819: HEHMTS BCT J02677 HUMSPM1 PRI X06585 HUMSPM2 PRI X06586
820: HUMSPM3 PRI X06587 HUMSPM4 PRI X06588 HUMSPM5 PRI X07490
821: HUMSPM6 PRI X07491 HUMSPM7 PRI X07492 HUMSPM8 PRI X07493
822: HUMSPM9 PRI X07494
823:
824: ---------+---------+---------+---------+---------+---------+---------+---------
825:
826: 1 10 20 30 40 50 60 70 79
827:
828: Example 5. Keyword Phrase Index File
829:
830:
831:
832: 3.3.3 Author Name Index File
833:
834: The author name index file lists all of the author names that appear
835: in the citations. An excerpt from the author name index file is shown
836: below:
837:
838:
839:
840: 1 10 20 30 40 50 60 70 79
841:
842: ---------+---------+---------+---------+---------+---------+---------+---------
843:
844: JACKOWSKI,S.
845: ECOPANF BCT M30953
846: JACKS,C.M.
847: MUSRP32A ROD M35397 MUSRPL32A ROD M23453
848: JACKS,T.
849: MMTGXPPR VRL M16766
850: JACKSON,A.
851: BOVMHBOLA MAM M21044 BOVMHBOLB MAM M21043
852: JACKSON,A.O.
853: BSMRVPS SYN M28702 M23023 UNA M23023 MBSRNAG VRL M11511
854: MBSRNAGND VRL M16577 MBSRNAGSA VRL M11509 MBSRNAGSB VRL M11510
855: MBSRNAGT VRL M16576 SAPCAP VRL M17182 SYENCP VRL M17210
856: SYERNA VRL M13950 SYESC6 VRL M35689
857:
858: ---------+---------+---------+---------+---------+---------+---------+---------
859:
860: 1 10 20 30 40 50 60 70 79
861:
862: Example 6. Author Name Index File
863:
864:
865:
866: 3.3.4 Journal Citation Index File
867:
868: The journal citation index file lists all of the citations that appear
869: in the references. All citations are truncated to 80 characters. An
870: excerpt from the citation index file is shown below:
871:
872:
873:
874: 1 10 20 30 40 50 60 70 79
875:
876: ---------+---------+---------+---------+---------+---------+---------+---------
877:
878: (IN) THE CELL NUCLEUS, VOLUME VIII: 261-305; ACADEMIC PRESS, NEW YORK (1981)
879: RATUR5A RNA K00783
880: (IN) THE IMMUNE SYSTEM: 132-138; S. KARGER, NEW YORK (1981).
881: HUMIGHVX PRI M35415
882: (IN) THE LENS: TRANSPARANCY AND CATARACT: 171-179; EURAGE, RIJSWIJK (1986)
883: RANCRYG2A VRT K02264 RANCRYG4A VRT K02266 RANCRYG5A VRT M22529
884: RANCRYG6A VRT M22530 RANCRYR VRT X00659
885: (IN) UCLA SYMP. MOL. CELL. BIOL. NEW SER., VOL. 77: 339-352; ALAN R. LISS, INC.
886: BOVTRNB2A MAM M36431 HUMTRNB PRI M36429 HUMTRNB1 PRI M36430
887: (IN) UCLA SYMPOSIA: 575-584; ALAN R. LISS, INC., NEW YORK (1987)
888: PFAHGPRT INV M54896
889: (IN) VIRUS RESEARCH. PROCEEDINGS OF 1973 ICN-UCLA SYMPOSIUM: 533-544; ACADEMIC
890: LAMCG PHG J02459
891: ACTA BIOCHIM. POL. 24, 301-318 (1977)
892: LUPTRFJ RNA K00345 LUPTRFN RNA K00346
893:
894: ---------+---------+---------+---------+---------+---------+---------+---------
895:
896: 1 10 20 30 40 50 60 70 79
897:
898: Example 7. Journal Citation Index File
899:
900: 3.3.5 Cross-Reference To Gene Symbol Libraries
901:
902: The gene symbol file contains the gene symbols used in the Genome Data
903: Base and other gene symbols, such as those for the E. coli genes. The
904: gene symbols are found in the feature table and have the form:
905: /gene="gene symbol"; an example is found in section 3.5.11.5. An
906: example of the format of the gene symbol index file follows:
907:
908: 1 10 20 30 40 50 60 70 79
909:
910: ---------+---------+---------+---------+---------+---------+---------+---------
911:
912: INFC ECOHIMA BCT K02844 ECOTHRINF BCT V00291
913: INHA HUMINHA PRI M13981 HUMINHAA PRI M13144 HUMINHAG1 PRI X04445
914: HUMINHAG2 PRI X04446 HUMINHAG2 PRI X04446
915: INHBA HUMINHBA PRI M13436
916: INHBB HUMINHBB PRI M13437 HUMINHBB1 PRI M31668 HUMINHBB2 PRI M31669
917: HUMINHBB2 PRI M31669 HUMINHIB PRI M31682
918: INS HUMINS01 PRI J00265 HUMINS01 PRI J00265 HUMINSPR PRI M10039
919: HUMINV2 PRI M13903
920: INSR HUMINSR PRI M10051 HUMINSR01 PRI M23100 HUMINSR02 PRI M32823
921: HUMINSR03 PRI M32824 HUMINSR04 PRI M32825 HUMINSR05 PRI M32826
922: HUMINSR06 PRI M32827 HUMINSR07 PRI M32828 HUMINSR08 PRI M32829
923: HUMINSR09 PRI M32830 HUMINSR10 PRI M32831 HUMINSR11 PRI M32832
924: HUMINSR12 PRI M32833 HUMINSR13 PRI M32834 HUMINSR14 PRI M32835
925: HUMINSR15 PRI M32836 HUMINSR16 PRI M32837 HUMINSR17 PRI M32838
926: HUMINSR18 PRI M32839 HUMINSR19 PRI M32840 HUMINSR20 PRI M32841
927: HUMINSR21 PRI M32842 HUMINSR22 PRI M32972 HUMINSRA PRI X02160
928: HUMINSRA01 PRI M27195 HUMINSRA02 PRI M27197 HUMINSRB PRI J03466
929: HUMINSRC PRI M29929 HUMINSRD PRI M29930 HUMINSRMUT PRI M27196
930: HUMIRSRE PRI J05043
931: INT1 HUMINT1G PRI X03072
932: INT1L1 HUMIRP PRI X07876
933: IRGA VCHIRGA BCT M37773 VCHIRGA BCT M37773 VCHIRGA BCT M37773
934: VCHIRGA BCT M37773 VCHIRGA BCT M37773 VCHIRGB BCT M55988
935: VCHIRGB BCT M55988
936:
937: ---------+---------+---------+---------+---------+---------+---------+---------
938:
939: 1 10 20 30 40 50 60 70 79
940:
941: Example 8. Gene Symbol Index File
942:
943:
944: 3.4 GenBank Data Submission Form and Error/Suggestion Report Form
945:
946: The distribution tape includes a data submission form in the file
947: GBDAT.FRM. Due to the large volume of new sequence data, we encourage
948: authors to complete this form and return it to the address listed on
949: the form. This will enable data to be entered more quickly into the
950: data bank.
951:
952: You can complete the form with any text editor. You can send the
953: completed form to GenBank on tape or floppy diskette, or
954: electronically via INTERNET or BITNET (the electronic mail address is:
955: gb-sub%[email protected]). We can use information saved on any computer
956: medium from any computer system. You can also print the form, fill it
957: in by hand, and send it to the mailing address given at the beginning
958: of the form.
959:
960: The second form in this file is the GenBank Error/Suggestion Report
961: Form. It is separated from the Data Submission Form by a form-feed
962: character (<CTRL>L, ASCII octal value 014, ASCII decimal value 12). We
963: encourage all GenBank users to report any errors to the data bank
964: staff using this form. Like the GenBank Data Submission Form, it may
965: be printed and filled in by hand and sent by mail to the address given
966: at the beginning of the form. It may also be filled out using a text
967: editor and sent to GenBank by electronic mail at the address given at
968: the top of the form.
969:
970: If you have an IBM PC or compatible computer, or a Macintosh personal
971: computer, we request that you use the Authorin program for submitting
972: sequences to the data bank. See section 5.5 for information about
973: obtaining the Authorin program at no charge.
974:
975: 3.5 Sequence Entry Files
976:
977: The distribution tape contains thirteen sequence entry data files, one
978: for each division of GenBank. Each file contains the entries for one
979: group of organisms.
980:
981: 3.5.1 File Organization
982:
983: Each of these files has the same format and consists of two parts:
984: header information (described in section 3.1) and sequence entries for
985: that division (described in the following sections).
986:
987: 3.5.2 Entry Organization
988:
989: In the second portion of a sequence entry file (containing the
990: sequence entries for that division), each record (line) consists of
991: two parts. The first part is found in positions 1 to 10 and may
992: contain:
993:
994: 1. A keyword, beginning in column 1 of the record (e.g., REFERENCE is
995: a keyword).
996:
997: 2. A subkeyword beginning in column 3, with columns 1 and 2 blank
998: (e.g., AUTHORS is a subkeyword of REFERENCE).
999:
1000: 3. Blank characters, indicating that this record is a continuation of
1001: the information under the keyword or subkeyword above it.
1002:
1003: 4. A code, beginning in column 5, indicating the nature of an entry
1004: (feature key) in the FEATURES table; these codes are described in
1005: Section 3.5.11.1 below.
1006:
1007: 5. A number, ending in column 9 of the record. This number occurs in
1008: the portion of the entry describing the actual nucleotide sequence and
1009: designates the numbering of sequence positions.
1010:
1011: 6. Two slashes (//) in positions 1 and 2, marking the end of an entry.
1012:
1013: The second part of each sequence entry record contains the information
1014: appropriate to its keyword, in positions 13 to 80 for keywords and
1015: positions 11 to 80 for the sequence.
1016:
1017: The following is a brief description of each entry field. Detailed
1018: information about each field may be found in Sections 3.5.4 to 3.5.13.
1019:
1020: LOCUS - A short unique name for the entry, chosen to suggest the
1021: sequence's definition. Mandatory keyword/exactly one record.
1022:
1023: DEFINITION - A concise description of the sequence. Mandatory
1024: keyword/one or more records.
1025:
1026: ACCESSION - The primary accession number is a unique, unchanging
1027: code assigned to each entry. (Please use this code when citing
1028: information from GenBank.) Mandatory keyword/one or more records.
1029:
1030: KEYWORDS - Short phrases describing gene products and other
1031: information about an entry. Mandatory keyword in all annotated
1032: entries/one or more records.
1033:
1034: SEGMENT - Information on the order in which this entry appears in a
1035: series of discontinuous sequences from the same molecule. Optional
1036: keyword (only in segmented entries)/exactly one record.
1037:
1038: SOURCE - Common name of the organism or the name most frequently used
1039: in the literature. Mandatory keyword in all annotated entries/one or
1040: more records/includes one subkeyword.
1041:
1042: ORGANISM - Formal scientific name of the organism (first line)
1043: and taxonomic classification levels (second and subsequent lines).
1044: Mandatory subkeyword in all annotated entries/two or more records.
1045:
1046: REFERENCE - Citations for all articles containing data reported
1047: in this entry. Includes four subkeywords and may repeat. Mandatory
1048: keyword/one or more records.
1049:
1050: AUTHORS - Lists the authors of the citation. Mandatory
1051: subkeyword/one or more records.
1052:
1053: TITLE - Full title of citation. Optional subkeyword (present
1054: in all but unpublished citations)/one or more records.
1055:
1056: JOURNAL - Lists the journal name, volume, year, and page
1057: numbers of the citation. Mandatory subkeyword/one or more records.
1058:
1059: STANDARD - Lists information about the degree to which the
1060: entry has been annotated and the level of review to which it has been
1061: subjected. Mandatory subkeyword/exactly one record.
1062:
1063: COMMENT - Cross-references to other sequence entries, comparisons to
1064: other collections, notes of changes in LOCUS names, and other remarks.
1065: Optional keyword/one or more records/may include blank records.
1066:
1067: FEATURES - Table containing information on portions of the
1068: sequence that code for proteins and RNA molecules and information on
1069: experimentally determined sites of biological significance. Optional
1070: keyword/one or more records.
1071:
1072: BASE COUNT - Summary of the number of occurrences of each base
1073: code in the sequence. Mandatory keyword/exactly one record.
1074:
1075: ORIGIN - Specification of how the first base of the reported sequence
1076: is operationally located within the genome. Where possible, this
1077: includes its location within a larger genetic map. Mandatory
1078: keyword/exactly one record.
1079:
1080: - The ORIGIN line is followed by sequence data (multiple
1081: records).
1082:
1083: // - Entry termination symbol. Mandatory at the end of an
1084: entry/exactly one record.
1085:
1086:
1087: 3.5.3 Sample Sequence Data File
1088:
1089: An example of a complete sequence entry file follows. (This example
1090: has only two entries.) Note that in this example, as throughout the
1091: data bank, numbers in square brackets indicate items in the REFERENCE
1092: list. For example, in ACARR58S, [1] refers to the paper by Mackay, et
1093: al.
1094:
1095: 1 10 20 30 40 50 60 70 79
1096:
1097: ---------+---------+---------+---------+---------+---------+---------+---------
1098:
1099: GBSMP.SEQ Genetic Sequence Data Bank
1100: 15 December 1990
1101:
1102: GenBank(R) Release 66.0
1103:
1104: Structural Rna Sequences
1105:
1106: 2 loci, 280 bases, from 2 reported sequences
1107:
1108: LOCUS AAURRA 118 bp ss-rRNA RNA 16-JUN-1986
1109: DEFINITION A.auricula-judae (mushroom) 5S ribosomal RNA.
1110: ACCESSION K03160
1111: KEYWORDS 5S ribosomal RNA; ribosomal RNA.
1112: SOURCE A.auricula-judae (mushroom) ribosomal RNA.
1113: ORGANISM Auricularia auricula-judae
1114: Eukaryota; Plantae; Thallobionta; Basidiomycotina;
1115: Phragmobasidiomycetes; Heterobasidiomycetidae; Eutremellales;
1116: Auriculariaceae; Auricularia; auricula-judae.
1117: REFERENCE 1 (bases 1 to 118)
1118: AUTHORS Huysmans,E., Dams,E., Vandenberghe,A. and De Wachter,R.
1119: TITLE The nucleotide sequences of the 5S rRNAs of four mushrooms and
1120: their use in studying the phylogenetic position of basidiomycetes
1121: among the eukaryotes
1122: JOURNAL Nucleic Acids Res. 11, 2871-2880 (1983)
1123: STANDARD full staff_review
1124: FEATURES Location/Qualifiers
1125: rRNA 1..118
1126: /note="5S ribosomal RNA"
1127: BASE COUNT 27 a 34 c 34 g 23 t
1128: ORIGIN 5' end of mature rRNA.
1129: 1 atccacggcc ataggactct gaaagcactg catcccgtcc gatctgcaaa gttaaccaga
1130: 61 gtaccgccca gttagtacca cggtggggga ccacgcggga atcctgggtg ctgtggtt
1131: //
1132: LOCUS ACARR58S 162 bp ss-rRNA RNA 15-MAR-1989
1133: DEFINITION A.castellanii (amoeba) 5.8S ribosomal RNA.
1134: ACCESSION K00471
1135: KEYWORDS 5.8S ribosomal RNA; ribosomal RNA.
1136: SOURCE A.castellani (amoeba; strain ATCC 30010) rRNA.
1137: ORGANISM Acanthamoeba castellanii
1138: Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina;
1139: Rhizopoda; Lobosa; Gymnamoeba; Amoebida; Acanthopodina;
1140: Acanthamoebidae; Acanthamoeba; castellanii.
1141: REFERENCE 1 (bases 1 to 162)
1142: AUTHORS Mackay,R.M. and Doolittle,W.F.
1143: TITLE Nucleotide sequences of AcanthamoebA.castellanii 5S and 5.8S
1144: ribosomal ribonucleic acids: Phylogenetic and comparative
1145: structural analyses
1146: JOURNAL Nucleic Acids Res. 9, 3321-3334 (1981)
1147: STANDARD simple staff_review
1148: COMMENT [1] also sequenced A.castellanii 5S rRNA <K03160>.
1149: FEATURES Location/Qualifiers
1150: rRNA 1..162
1151: /note="5.8S rRNA"
1152: BASE COUNT 40 a 39 c 44 g 39 t
1153: ORIGIN 5' end of mature rRNA.
1154: 1 aactcctaac aacggatatc ttggttctcg cgaggatgaa gaacgcagcg aaatgcgata
1155: 61 cgtagtgtga atcgcaggga tcagtgaatc atcgaatctt tgaacgcaag ttgcgctctc
1156: 121 gtggtttaac cccccgggag cacgttcgct tgagtgccgc tt
1157: //
1158:
1159: ---------+---------+---------+---------+---------+---------+---------+---------
1160:
1161: 1 10 20 30 40 50 60 70 79
1162:
1163: Example 9. Sample Sequence Data File
1164:
1165:
1166: 3.5.4 LOCUS Format
1167:
1168: The pieces of information contained in the LOCUS record are always
1169: found in fixed positions. The locus name (or entry name), which is
1170: always ten characters or less, begins in position 13. The locus name
1171: is designed to help group entries with similar sequences: the first
1172: three characters usually designate the organism; the fourth and fifth
1173: characters can be used to show other group designations, such as gene
1174: product; for segmented entries the last character is one of a series
1175: of sequential integers.
1176:
1177: The number of bases or base pairs in the sequence ends in position 29.
1178: The letters `bp' are in positions 31 to 32. Positions 34 to 36 give
1179: the number of strands of the sequence. Positions 37 to 40 give the
1180: topology of molecule sequenced. If the sequence is of a special type,
1181: a notation (such as `circular') is included in positions 43 to 52.
1182:
1183: GenBank sequence entries are divided among thirteen taxonomic
1184: divisions. Each entry's division is identified by a three-letter code
1185: in positions 53 to 55. See Section 3.3 for the division codes.
1186:
1187: Positions 63 to 73 of the record contain the date the entry was
1188: entered or underwent any substantial revisions, such as the addition
1189: of newly published data, in the form dd-MMM-yyyy.
1190:
1191: The detailed format for the LOCUS record is as follows:
1192:
1193: Positions Contents
1194:
1195: 1-12 LOCUS
1196: 13-22 Locus name
1197: 23-29 Length of sequence, right-justified
1198: 31-32 bp
1199: 34-36 Blank, ss- (single-stranded), ds- (double-stranded), or
1200: ms- (mixed-stranded)
1201: 37-40 Blank, DNA, RNA, tRNA (transfer RNA), rRNA (ribosomal RNA),
1202: mRNA (messenger RNA), or uRNA (small nuclear RNA)
1203: 43-52 Blank (implies linear), circular, or tandem
1204: 53-55 The division code (see Section 3.3)
1205: 63-73 Date, in the form dd-MMM-yyyy (e.g., 15-DEC-1990)
1206:
1207: 3.5.5 DEFINITION Format
1208:
1209: The DEFINITION record gives a brief description of the sequence,
1210: proceeding from general to specific. It starts with the common name of
1211: the source organism, then gives the criteria by which this sequence is
1212: distinguished from the remainder of the source genome, such as the
1213: gene name and what it codes for, or the protein name and mRNA, or some
1214: description of the sequence's function (if the sequence is
1215: non-coding). If the sequence has a coding region, the description may
1216: be followed by a completeness qualifier, such as cds (complete coding
1217: sequence). The length is limited to three lines and the last line must
1218: end with a period.
1219:
1220: 3.5.6 ACCESSION Format
1221:
1222: This field contains a series of six-character identifiers (accession
1223: numbers: first character a letter, the remainder digits). The primary
1224: (first) accession number occupies positions 13 to 18; subsequent
1225: accession numbers occupy positions 20 to 25, 27 to 32, 34 to 39, 41 to
1226: 46, 48 to 53, 55 to 60, 62 to 67, and 69 to 74. No punctuation occurs
1227: between accession numbers or after the final accession number;
1228: accession numbers are separated only by one space.
1229:
1230: 3.5.7 KEYWORDS Format
1231:
1232: The KEYWORDS field does not appear in unannotated entries, but is
1233: required in all annotated entries. Keywords are separated by
1234: semicolons; a keyword may be a single word or a phrase consisting of
1235: several words. Each line in the keywords field ends in a semicolon;
1236: the last line ends with a period. If no keywords are included in the
1237: entry, the KEYWORDS record contains only a period.
1238:
1239: 3.5.8 SEGMENT Format
1240:
1241: The SEGMENT keyword is used when two (or more) entries of known
1242: relative orientation are separated by a short (<10 kb) stretch of DNA.
1243: It is limited to one line of the form `n of m', where `n' is the
1244: segment number of the current entry and `m' is the total number of
1245: segments.
1246:
1247: 3.5.9 SOURCE Format
1248:
1249: The SOURCE field consists of two parts. The first part is found after
1250: the SOURCE keyword and contains free-format information including an
1251: abbreviated form of the organism name followed by a molecule type;
1252: multiple lines are allowed, but the last line must end with a period.
1253: The second part consists of information found after the ORGANISM
1254: subkeyword. The formal scientific name for the source organism (genus
1255: and species, where appropriate) is found on the same line as ORGANISM.
1256: The records following the ORGANISM line list the taxonomic
1257: classification levels, separated by semicolons and ending with a
1258: period.
1259:
1260: 3.5.10 REFERENCE Format
1261:
1262: The REFERENCE field consists of five parts: the keyword REFERENCE, and
1263: the subkeywords AUTHORS, TITLE (optional), JOURNAL and STANDARD.
1264:
1265: The REFERENCE line contains the number of the particular reference and
1266: (in parentheses) the range of bases in the sequence entry reported in
1267: this citation. Additional prose notes may also be found within the
1268: parentheses. The numbering of the references does not reflect
1269: publication dates or priorities.
1270:
1271: The AUTHORS line lists the authors in the order in which they appear
1272: in the cited article. Last names are separated from initials by a
1273: comma (no space); there is no comma before the final `and'. The list
1274: of authors ends with a period.
1275:
1276: The TITLE line is an optional field, although it appears in the
1277: majority of entries. It does not appear in unpublished sequence data
1278: entries that have been deposited directly into the GenBank data bank,
1279: the EMBL Nucleotide Sequence Data Library, or the DNA Data Bank of
1280: Japan. The TITLE field does not end with a period.
1281:
1282: The JOURNAL line gives the appropriate literature citation for the
1283: sequence in the entry. The word `Unpublished' will appear after the
1284: JOURNAL subkeyword if the data did not appear in the scientific
1285: literature, but was directly deposited into the data bank. For
1286: published sequences the JOURNAL line gives the Thesis, Journal, or
1287: Book citation, including the year of publication, the specific
1288: citation, or In press.
1289:
1290: The STANDARD line contains information about:
1291:
1292: The degree to which the entry has been annotated:
1293:
1294: `unannotated' for unannotated entries which include citation and
1295: sequence only.
1296:
1297: `simple' for unannotated entries which include the organism name and
1298: protein coding regions as well as the citation and sequence.
1299:
1300: `full' for fully annotated entries which include all the data items
1301: that were described by the author.
1302:
1303: The level of modification and review:
1304:
1305: `automatic' for data subjected only to automated (i.e., software)
1306: checks.
1307:
1308: `staff_entry' for data that passed both automated and annotator
1309: checks.
1310:
1311: `staff_review' for data that passed previous review levels as well as
1312: a review by senior annotators and/or outside experts.
1313:
1314: The format for the STANDARD line is: annotation degree <SPACE> review level
1315:
1316:
1317: 3.5.11 FEATURES Format
1318:
1319: This release uses the new feature table format. This format has been
1320: designed jointly by GenBank, the EMBL Nucleotide Sequence Data
1321: Library, and the DNA Data Bank of Japan, and will be common to all
1322: three data banks.
1323:
1324: The feature table contains information about genes and gene products,
1325: as well as regions of biological significance reported in the
1326: sequence. The feature table contains information on regions of the
1327: sequence that code for proteins and RNA molecules. It also enumerates
1328: differences between different reports of the same sequence, and
1329: provides cross-references to other data collections, as described in
1330: more detail below.
1331:
1332: The first line of the feature table is a header that includes the
1333: keyword `FEATURES' and the column header `Location/Qualifier.' Each
1334: feature consists of a descriptor line containing a feature key and a
1335: location (see sections below for details). If the location does not
1336: fit on this line, a continuation line may follow. If further
1337: information about the feature is required, one or more lines
1338: containing feature qualifiers may follow the descriptor line.
1339:
1340: The feature key begins in column 6 and may be no more than 15
1341: characters in length. The location begins in column 22. Feature
1342: qualifiers begin on subsequent lines at column 22. Location,
1343: qualifier, and continuation lines may extend from column 22 to 80.
1344:
1345: Feature tables are optional. However, a feature table must include one
1346: header line and at least one feature descriptor line.
1347:
1348: The sections below provide a brief introduction to the new feature
1349: table format. For a thorough description of the new feature table
1350: format, see the document `The DDBJ/EMBL/GenBank Feature Table:
1351: Definition.' If you would like a copy of this publication, contact
1352: GenBank at the address shown on the front page of these Release Notes.
1353:
1354: 3.5.11.1 Feature Key Names
1355:
1356: The first column of the feature descriptor line contains the feature
1357: key. It starts at column 6 and can continue to column 20. The list of
1358: valid feature keys is shown below.
1359:
1360: allele Related strain contains alternative gene form
1361: attenuator Sequence related to transcription termination
1362: CAAT_signal `CAAT box' in eukaryotic promoters
1363: CDS Sequence coding for amino acids in protein (includes stop
1364: codon)
1365: cellular Region of cellular DNA
1366: conflict Independent determinations differ
1367: D-loop Displacement loop
1368: enhancer Cis-acting enhancer of promoter function
1369: exon Region that codes for part of spliced mRNA
1370: GC_signal `GC box' in eukaryotic promoters
1371: iDNA Intervening DNA eliminated by recombination
1372: insertion_seq Insertion sequence (IS), a small transposon
1373: intron Transcribed region excised by mRNA splicing
1374: LTR Long terminal repeat
1375: mat_peptide Mature peptide coding region (does not include stop codon)
1376: misc_binding Miscellaneous binding site
1377: misc_difference Miscellaneous difference feature
1378: misc_feature Region of biological significance that cannot be described by
1379: any other feature
1380: misc_recomb Miscellaneous recombination feature
1381: misc_RNA Miscellaneous transcript feature not defined by other RNA keys
1382: misc_signal Miscellaneous signal
1383: misc_structure Miscellaneous DNA or RNA structure
1384: modified_base The indicated base is a modified nucleotide
1385: mRNA Messenger RNA
1386: mutation A mutation alters the sequence here
1387: old_sequence Presented sequence revises a previous version
1388: polyA_signal Signal for cleavage & polyadenylation
1389: polyA_site Site at which polyadenine is added to mRNA
1390: precursor_RNA Any RNA species that is not yet the mature RNA product
1391: prim_transcript Primary (unprocessed) transcript
1392: primer_bind Non-covalent primer binding site
1393: promoter A region involved in transcription initiation
1394: protein_bind Non-covalent protein binding site on DNA or RNA
1395: provirus Proviral sequence
1396: RBS Ribosome binding site
1397: rep_origin Replication origin for duplex DNA
1398: repeat_region Sequence containing repeated subsequences
1399: repeat_unit One repeated unit of a repeat_region
1400: rRNA Ribosomal RNA
1401: satellite Satellite repeated sequence
1402: scRNA Small cytoplasmic RNA
1403: sig_peptide Signal peptide coding region
1404: snRNA Small nuclear RNA
1405: stem_loop Hair-pin loop structure in DNA or RNA
1406: TATA_signal `TATA box' in eukaryotic promoters
1407: terminator Sequence causing transcription termination
1408: transit_peptide Transit peptide coding region
1409: transposon Transposable element (TN)
1410: tRNA Transfer RNA
1411: unsure Authors are unsure about the sequence in this region
1412: variation A related population contains stable mutation
1413: virion Virion (encapsidated) viral sequence
1414: - (hyphen) Placeholder
1415: -10_signal `Pribnow box' in prokaryotic promoters
1416: -35_signal `-35 box' in prokaryotic promoters
1417: 3'clip 3'-most region of a precursor transcript removed in processing
1418: 3'UTR 3' untranslated region (trailer)
1419: 5'clip 5'-most region of a precursor transcript removed in processing
1420: 5'UTR 5' untranslated region (leader)
1421:
1422: 3.5.11.2 Feature Location
1423:
1424: The second column of the feature descriptor line designates the
1425: location of the feature in the sequence. The location descriptor
1426: begins at position 22. Several conventions are used to indicate
1427: sequence location.
1428:
1429: Base numbers in location descriptors refer to numbering in the entry,
1430: which is not necessarily the same as the numbering scheme used in the
1431: published report. The first base in the presented sequence is numbered
1432: base 1. Sequences are presented in the 5' to 3' direction.
1433:
1434: Location descriptors can be one of the following:
1435:
1436: 1. A single base;
1437: 2. A contiguous span of bases;
1438: 3. A site between two bases;
1439: 4. A single base chosen from a range of bases;
1440: 5. A single base chosen from among two or more specified bases;
1441: 6. A joining of sequence spans;
1442: 7. A reference to an entry other than the one to which the feature
1443: belongs (i.e., a remote entry), followed by a location descriptor
1444: referring to the remote sequence;
1445: 8. A literal sequence (a string of bases enclosed in quotation marks).
1446:
1447: A site between two residues, such as an endonuclease cleavage site, is
1448: indicated by listing the two bases separated by a carat (e.g., 23^24).
1449:
1450: A single residue chosen from a range of residues is indicated by the
1451: number of the first and last bases in the range separated by a single
1452: period (e.g., 23.79). The symbols < and > indicate that the end point
1453: of the range is beyond the specified base number.
1454:
1455: A contiguous span of bases is indicated by the number of the first and
1456: last bases in the range separated by two periods (e.g., 23..79). The
1457: symbols < and > indicate that the end point of the range is beyond the
1458: specified base number. Starting and ending positions can be indicated
1459: by base number or by one of the operators described below.
1460:
1461: Operators are prefixes that specify what must be done to the indicated
1462: sequence to locate the feature. The following are the operators
1463: available, along with their most common format and a description.
1464:
1465: complement (location): The feature is complementary to the location
1466: indicated. Complementary strands are read 5' to 3'.
1467:
1468: join (location, location, .. location): The indicated elements should
1469: be placed end to end to form one contiguous sequence.
1470:
1471: order (location, location, .. location): The elements are found in the
1472: specified order in the 5' to 3' direction, but nothing is implied
1473: about the rationality of joining them.
1474:
1475: group (location, location, .. location): The elements are related and
1476: should be grouped together, but no order is implied.
1477:
1478: one-of (location, location, .. location): The element can be any one,
1479: but only one, of the items listed.
1480:
1481: replace (location, location): The first location indicated should be
1482: replaced by the sequence from the second location; used for
1483: insertions, deletions, and variants.
1484:
1485: 3.5.11.3 Feature Qualifiers
1486:
1487: Qualifiers provide additional information about features. They take
1488: the form of a slash (/) followed by a qualifier name and, if
1489: applicable, an equal sign (=) and a qualifier value. Feature
1490: qualifiers begin at column 22.
1491:
1492: Qualifiers convey many types of information. Their values can,
1493: therefore, take several forms:
1494:
1495: 1. Free text;
1496: 2. Controlled vocabulary or enumerated values;
1497: 3. Citations or reference numbers;
1498: 4. Sequences;
1499: 5. Feature labels.
1500:
1501: Text qualifier values must be enclosed in double quotation marks. The
1502: text can consist of any printable characters (ASCII values 32-126
1503: decimal). If the text string includes double quotation marks, each set
1504: must be `escaped' by placing a double quotation mark in front of it
1505: (e.g., /note="This is an example of ""escaped"" quotation marks").
1506:
1507: Some qualifiers require values selected from a limited set of choices.
1508: For example, the `/direction' qualifier has only three values `left,'
1509: `right,' or `both.' These are called controlled vocabulary qualifier
1510: values. Controlled qualifier values are not case sensitive; they can
1511: be entered in any combination of upper- and lowercase without changing
1512: their meaning.
1513:
1514: Citation or published reference numbers for the entry should be
1515: enclosed in square brackets ([]) to distinguish them from other
1516: numbers. Multiple citations are separated by commas (e.g.,
1517: [1],[2],[3]).
1518:
1519: A literal sequence of bases (e.g., "atgcatt") should be enclosed in
1520: quotation marks. Literal sequences are distinguished from free text by
1521: context. Qualifiers that take free text as their values do not take
1522: literal sequences, and vice versa.
1523:
1524: The `/label=' qualifier takes a feature label as its qualifier.
1525: Although feature labels are optional, they allow unambiguous
1526: references to the feature. The feature label identifies a feature
1527: within an entry; when combined with the accession number and the name
1528: of the data bank from which it came, it is a unique tag for that
1529: feature. Feature labels must be unique within an entry, but can be the
1530: same as a feature label in another entry. Feature labels are not case
1531: sensitive; they can be entered in any combination of upper-and
1532: lowercase without changing their meaning.
1533:
1534: The following is a list of valid feature qualifiers.
1535:
1536: /anticodon Location of the anticodon of tRNA and the amino acid
1537: for which it codes
1538: /bound_moiety Moiety bound
1539: /citation Reference to a citation providing the claim of or
1540: evidence for a feature
1541: /codon Specifies a codon that is different from any found in the
1542: reference genetic code
1543: /codon_start Indicates the reading frame of a protein coding region
1544: /cons_splice Identifies intron splice sites that do not conform to
1545: the 5'-GT... AG-3' splice site consensus
1546: /direction Direction of DNA replication
1547: /EC_number Enzyme Commission number for the enzyme product of the
1548: sequence
1549: /evidence Value indicating the nature of supporting evidence
1550: /frequency Frequency of the occurrence of a feature
1551: /function Function attributed to a sequence
1552: /gene Symbol of the gene corresponding to a sequence region
1553: /label A label used to permanently identify a feature
1554: /mod_base Abbreviation for a modified nucleotide base
1555: /note Any comment or additional information
1556: /number A number indicating the order of genetic elements (e.g., exons
1557: or introns) in the 5' to 3' direction
1558: /organism Name of organism if different from that contained in
1559: the entry's ORGANISM field
1560: /partial Differentiates between complete regions and partial ones
1561: /phenotype Phenotype conferred by the feature
1562: /product Name of a product encoded by the sequence
1563: /pseudo Indicates that this feature is a non-functional version of the
1564: element named by the feature key
1565: /rpt_family Type of repeated sequence; `Alu' or `Kpn,' for example
1566: /rpt_type Organization of repeated sequence
1567: /rpt_unit Identity of repeat unit that constitutes a
1568: repeat_region
1569: /standard_name Accepted standard name for this feature
1570: /transl_except Translational exception: single codon, the translation
1571: of which does not conform to the reference genetic code
1572: /type Name of a strain if different from that in the SOURCE field
1573: /usedin Indicates that feature is used in a compound feature in
1574: another entry
1575:
1576: 3.5.11.4 Cross-Reference Information
1577:
1578: One type of information in the feature table lists cross-references to
1579: the annual compilation of transfer RNA sequences in Nucleic Acids
1580: Research, which has kindly been sent to us on tape by Dr. Sprinzl.
1581: Each tRNA entry of the feature table contains a /note= qualifier that
1582: includes a reference such as `(NAR: 1234)' to identify code 1234 in
1583: the NAR compilation. When such a cross-reference appears in an entry
1584: that contains a gene coding for a transfer RNA molecule, it refers to
1585: the code in the tRNA gene compilation. Similar cross-references in
1586: entries containing mature transfer RNA sequences refer to the
1587: companion compilation of tRNA sequences published by D.H. Gauss and M.
1588: Sprinzl in Nucleic Acids Research. See section 3.5.11.6 for an
1589: example.
1590:
1591: The feature tables of human entries contain cross-references to the
1592: Genome Data Base (GDB) in Baltimore, MD. GDB includes information on
1593: mapped genes, probes, and restriction fragment length polymorphisms.
1594: Each entry in that data bank contains the official symbol for the gene
1595: or locus. GDB assigns each gene a unique identifier that remains
1596: associated with that gene, regardless of changes in gene names. In
1597: entries that contain sequences for mapped genes a /note= qualifier
1598: includes this identifier placed within single quotes following the
1599: term `/hgml_locus_uid='. The /note qualifier also includes the map
1600: location in single quotes following the term `/map'. The gene symbol
1601: formerly designated `/nomgen=' is contained in the /gene qualifier.
1602: See section 3.5.11.6 for an example.
1603:
1604: For more information about the Genome Data Base, contact:
1605:
1606: Genome Data Base
1607: 1830 East Monument Street
1608: Baltimore, MD 21205
1609: Telephone: (203) 786-5515
1610:
1611: 3.5.11.5 Feature Table Examples
1612:
1613: In the first example a number of key names, feature locations, and
1614: qualifiers are illustrated, taken from different sequences. The first
1615: table entry is a coding region consisting of a simple span of bases
1616: and including a /gene qualifier. In the second table entry, an NAR
1617: cross-reference is given (see the previous section for a discussion of
1618: these cross-references). The third and fourth table entries use the
1619: symbols `<`and `>' to indicate that the beginning or end of the
1620: feature is beyond the range of the presented sequence. In the fifth
1621: table entry, the symbol `^' indicates that the feature is between
1622: bases. In the sixth table entry, the replace operator is shown.
1623:
1624:
1625: 1 10 20 30 40 50 60 70 79
1626:
1627: ---------+---------+---------+---------+---------+---------+---------+---------
1628:
1629: CDS 5..1261
1630: /note="alpha-1-antitrypsin precursor /map=`14q32.1'
1631: /hgml_locus_uid=`LX0081X'"
1632: /gene="PI"
1633: tRNA 1..87
1634: /note="Leu-tRNA-CAA (NAR: 1057)"
1635: /anticodon=(pos:35..37,aa:Leu)
1636: mRNA 1..>66
1637: /note="alpha-1-acid glycoprotein mRNA"
1638: transposon <1..267
1639: /note="insertion element IS5"
1640: misc_recomb 105^106
1641: /note="B.subtilis DNA end/IS5 DNA start"
1642: conflict replace(258..258,"t")
1643: /citation=[2]
1644:
1645: ---------+---------+---------+---------+---------+---------+---------+---------
1646:
1647: 1 10 20 30 40 50 60 70 79
1648:
1649: Example 10. Feature Table Entries
1650:
1651: The next example shows the representation for a CDS that spans more
1652: than one entry.
1653:
1654: 1 10 20 30 40 50 60 70 79
1655:
1656: ---------+---------+---------+---------+---------+---------+---------+---------
1657:
1658: LOCUS HUMAPOB1 840 bp ds-DNA PRI 15-JUN-1989
1659: DEFINITION Human apolipoprotein B-100 gene, exons 1 and 2.
1660: ACCESSION M15053
1661: KEYWORDS apolipoprotein B-100.
1662: SEGMENT 1 of 2
1663: .
1664: .
1665: .
1666: FEATURES Location/Qualifiers
1667: sig_peptide 283..354
1668: /note="apolipoprotein B-100 signal peptide"
1669: precursor_RNA 155..>840
1670: /note="apoB100 mRNA"
1671: intron 356..669
1672: /note="apoB100 intron A"
1673: intron 709..>840
1674: /note="apoB100 intron B"
1675: .
1676: .
1677: .
1678: //
1679: LOCUS HUMAPOB2 13872 bp ss-mRNA PRI 15-JUN-1989
1680: DEFINITION Human apolipoprotein B-100 mRNA, starting at exon 3.
1681: ACCESSION M15051 M15054
1682: KEYWORDS apolipoprotein B-100.
1683: SEGMENT 2 of 2
1684: .
1685: .
1686: .
1687: FEATURES Location/Qualifiers
1688: precursor_RNA <1..13872
1689: /note="apoB100 mRNA"
1690: variation 3204
1691: /note="g in lambda-B25; c in lambda B1"
1692: CDS join(M15053:283..355,M15053:670..708,
1693: 1..13571)
1694: /note="apolipoprotein B-100 precursor"
1695: mat_peptide join(M15053:355..355,M15053:670..708,
1696: 1..13568)
1697: /note="apolipoprotein B-100"
1698: .
1699: .
1700: .
1701: //
1702:
1703: ---------+---------+---------+---------+---------+---------+---------+---------
1704:
1705: 1 10 20 30 40 50 60 70 79
1706:
1707: Example 11. Joining Sequences
1708:
1709:
1710: 3.5.12 ORIGIN Format
1711:
1712: The ORIGIN record may be left blank, may appear as `Unreported.' or
1713: may give a local pointer to the sequence start, usually involving an
1714: experimentally determined restriction cleavage site or the genetic
1715: locus (if available). The ORIGIN record ends in a period if it
1716: contains data, but does not include the period if the record is left
1717: empty (in contrast to the KEYWORDS field which contains a period
1718: rather than being left blank).
1719:
1720: 3.5.13 SEQUENCE Format
1721:
1722: The nucleotide sequence for an entry is found in the records following
1723: the ORIGIN record. The sequence is reported in the 5'to 3' direction.
1724: There are sixty bases per record, listed in groups of ten bases
1725: followed by a blank, starting at position 11 of each record. The
1726: number of the first nucleotide in the record is given in columns 4 to
1727: 9 (right justified) of the record.
1728:
1729: 4. FUTURE RELEASES
1730:
1731: 4.1 Changes Planned for Release 67.0
1732:
1733: No changes are planned for Release 67.0.
1734:
1735: 5. KNOWN PROBLEMS WITH THE GENBANK DATABASE
1736:
1737: 5.1 Incorrect Gene Symbols in Entries and Index
1738:
1739: The /gene qualifier should contain gene symbols. In this release,
1740: however, the /gene qualifier for many entries incorrectly contains
1741: values other than the gene symbol, such as the product or standard
1742: name of the gene. The gene symbol index (GBHGM.IDX) is created from
1743: the data in the /gene qualifier and therefore contains data other than
1744: gene symbols. These errors will be corrected as soon as possible.
1745:
1746: 6. GENBANK ADMINISTRATION
1747:
1748: IntelliGenetics Inc., a developer and distributor of molecular biology
1749: computer programs and instrumentation, is the primary contractor for
1750: the GenBank data bank. IntelliGenetics maintains the computerized data
1751: center and oversees data distribution on all media. Under an
1752: arrangement with IntelliGenetics, Los Alamos National Laboratory
1753: (LANL) gathers, annotates, and organizes sequence data and transmits
1754: it to IntelliGenetics. LANL is operated by the University of
1755: California for the Department of Energy.
1756:
1757: The electronic mail address of LANL is [email protected]; their
1758: telephone number is (505) 665-2177. The IntelliGenetics address is on
1759: the front page of these release notes.
1760:
1761: 6.1 Registered Trademark Notice
1762:
1763: GenBank (R) is a registered trademark of the U.S. Department of Health
1764: and Human Services for the Genetic Sequence Data Bank operated by
1765: IntelliGenetics and Los Alamos National Laboratory under contract with
1766: the National Institutes of Health.
1767:
1768: 6.2 GenBank Sponsorship
1769:
1770: GenBank is sponsored by the National Institute of General Medical
1771: Sciences, NIH; The National Library of Medicine, NIH; and the U.S.
1772: Department of Energy.
1773:
1774: 6.3 Citing GenBank
1775:
1776: If you have used GenBank in your research, we would appreciate it if
1777: you would include a reference to GenBank in all publications related
1778: to that research. You may also wish to note that the GenBank data bank
1779: is publicly available from IntelliGenetics.
1780:
1781: When citing data in GenBank, it is appropriate to give the sequence
1782: name, primary accession number, and the publication in which the
1783: sequence first appeared. If the data are unpublished, we urge you to
1784: contact the group which submitted the data to GenBank to see if there
1785: is a recent publication or if they have determined any revisions or
1786: extensions of the data.
1787:
1788: It is also appropriate to list a reference for GenBank itself. The
1789: following publication, which describes the GenBank data bank, should
1790: be cited:
1791:
1792: Bilofsky, H.S. and Burks, C. The GenBank (R) Genetic Sequence Data Bank.
1793: Nucl. Acids Res. 16: 1861-1864 (1988)
1794:
1795: The following statement is an example of how you may cite GenBank
1796: data. It cites the sequence, its primary accession number, the group
1797: who determined the sequence, and GenBank. The numbers in brackets
1798: refer to one of the GenBank citations above and the REFERENCE in the
1799: GenBank sequence entry.
1800:
1801: `We scanned the GenBank (1) data bank for sequence similarities and
1802: found one sequence (2),
1803:
1804: GenBank accession number J01016, which showed significant
1805: similarity...'
1806:
1807: (1) Bilofsky, H.S. and Burks, C. Nucl. Acids Res. 16: 1861-1864 (1988)
1808: (2) Nellen, W. and Gallwitz, D. J. Mol. Biol. 159, 1-18 (1982)
1809:
1810: 6.4 GenBank Distribution Formats and Media
1811:
1812: The GenBank data bank is available in three formats on three physical
1813: media. The three formats are fixed-length 80-character records,
1814: VAX/VMS Backup saveset, and compressed Unix tar archive format. The
1815: three media are industry-standard 9-track magnetic tapes, Sun 1/4" QIC
1816: 24 format cartridges, and TK-50 cartridges. The following chart
1817: specifies which formats are available in each medium.
1818:
1819: To request a change in the format, media, or density of the tapes you
1820: receive, write to the address (or call the telephone number) on the
1821: first page of these release notes.
1822:
1823: FILE FORMATS TAPE MEDIA
1824:
1825: Unlabelled ASCII VAX/VMS Unix
1826: (fixed-length records) Backup Saveset tar tarfile
1827:
1828: 9-track, 2400' reel
1829:
1830: 1600 bpi MU M M
1831:
1832: 6250 bpi MU M M
1833:
1834: TK-50 cartridge (DEC) NA M NA
1835:
1836: 1/4" QIC 24 cartridge (Sun) NA NA M
1837:
1838: MU tapes are available in both mixed-case and uppercase-only formats
1839: M tapes are available only in mixed-case characters
1840: NA not available
1841:
1842: Table 1. Tape Media and Formats
1843:
1844:
1845: 6.5 Request for Direct Submission of Sequence Data
1846:
1847: The growth of nucleotide sequence data is close to exponential. Both
1848: the proposed Human Genome sequencing project and the increasing
1849: automation of sequencing make it clear that GenBank is going to
1850: continue to grow rapidly. A successful GenBank requires that the data
1851: enter the data bank as soon as possible after publication, that the
1852: annotations be as complete as possible, and that the sequence and
1853: annotation data be accurate. All three of these requirements are best
1854: met if authors of sequence data submit their data directly to GenBank
1855: in a usable form. It is especially important that these submissions be
1856: in computer-readable form.
1857:
1858: GenBank must rely on direct author submission of data to ensure that
1859: it achieves its goals of complete, accurate, and timely data. To
1860: assist researchers in entering their own sequence data, GenBank has
1861: developed AUTHORIN, an easy-to-use program that enables authors to
1862: enter a sequence, annotate it, and submit it to GenBank or any of the
1863: other data banks. The IBM PC compatible and Macintosh versions of
1864: AUTHORIN may be obtained by completing the enclosed AUTHORIN request
1865: card or by contacting GenBank at the address shown on the front of
1866: these release notes. Versions for the VAX and Sun workstations are
1867: also planned and will be announced in future release notes as they
1868: become available.
1869:
1870: For those who are unable to use the Authorin program, GenBank has a
1871: printed data submission form. This form is now standardized among
1872: EMBL, DDBJ, GenBank, PIR, MIPS, and JIPID. GenBank also provides a
1873: corresponding computer-readable data submission form that can be used
1874: for electronic mail and floppy disk submissions. The GenBank Data
1875: Submission Form (located in the file GBDAT.FRM) can be used to submit
1876: your sequence and annotations. Electronic mail submissions should go
1877: to the address "GB-SUB%[email protected]"; direct mail should go to our
1878: postal address in Los Alamos, which is on the data submission form.
1879:
1880: 6.6 Request for Corrections and Comments
1881:
1882: We welcome your suggestions for improvements to GenBank. We are
1883: especially interested to learn of errors or inconsistencies in the
1884: data. Please use the GenBank Error/Suggestion Report Form, which is
1885: part of this distribution of GenBank (located in the file GBDAT.FRM),
1886: to send your suggestions and corrections to the address on the first
1887: page of these release notes. Please be certain to indicate the GenBank
1888: release number (e.g., Release 66.0) and the primary accession number
1889: of the entry to which your comments apply; it is helpful if you also
1890: give the entry name and the current contents of any data field for
1891: which you are recommending a change.
1892:
1893: 6.7 Disclaimer
1894:
1895: IntelliGenetics Inc., Los Alamos National Laboratory, and the United
1896: States Government make no representations or warranties regarding the
1897: content or accuracy of the information. IntelliGenetics Inc., Los
1898: Alamos National Laboratory, and the United States Government also make
1899: no representations or warranties of merchantibility or fitness for a
1900: particular purpose and accept no responsibility for any consequences
1901: of the receipt or use of the information.
1902:
1903: APPENDIX A. Statistical Summary
1904:
1905: Division Entries Bases Reports
1906:
1907: PRIMATE 7511 9003383 9493
1908: RODENT 7652 7841099 9176
1909: OTHER MAMMALIAN 1552 1935603 1817
1910: OTHER VERTEBRATE 1876 2142926 2263
1911: INVERTEBRATE 3195 4005462 3811
1912: PLANT 2976 4659180 3636
1913: ORGANELLE 1271 1848854 1569
1914: BACTERIAL 4293 6992664 5528
1915: STRUCTURAL RNA 1647 445723 1946
1916: VIRAL 3707 6439492 4751
1917: PHAGE 593 682556 880
1918: SYNTHETIC 1028 516186 1129
1919: UNANNOTATED 3756 4792964 4909
1920: Total (13 divisions) 41057 51306092 50908
1921:
1922:
1923: Sequences with greater than 30,000 bp
1924:
1925: Locus Div Accession Length
1926:
1927: ADBCG VRL J01917 35937bp
1928: CHKMYHE VRT J02714 31111bp
1929: HS11UL VRL D00317 108360bp
1930: HS4 VRL V01555 172282bp
1931: HS5HCMVU VRL X04650 43275bp
1932: HUMADAG PRI M13792 36741bp
1933: HUMFIXG PRI K02402 38059bp
1934: HUMGHCSA PRI J03071 66495bp
1935: HUMHBB PRI J00179 73326bp
1936: HUMHPRTB PRI M26434 56736bp
1937: HUMTPA PRI K03021 36594bp
1938: LAMCG PHG J02459 48502bp
1939: MPOCPCG ORG X04465 121024bp
1940: MUSBGCXD ROD X14061 55856bp
1941: PT7CG PHG J02518 39936bp
1942: RABBGLOB MAM M18818 44594bp
1943: RATCRYG ROD M19359 54670bp
1944: TOBCPCG ORG Z00044 155844bp
1945: VACCG VRL M35027 191737bp
1946: VAZCG VRL X04370 124884bp
1947: X14112 UNA X14112 152260bp
1948: X14720 UNA X14720 35100bp
1949: X15423 UNA X15423 47081bp
1950: X15917 UNA X15917 40469bp
1951: X17012 UNA X17012 30000bp
1952: X17403 UNA X17403 229354bp
1953:
1954:
1955: APPENDIX B. Entries with a change in locus name
1956:
1957: Accession Rel 65.0 Rel 66.0
1958: -------- --------- ---------
1959: J03132 HUMICAM1 HUMICAM1M
1960: J04134 MUSCNR MUSCNRA
1961: K00319 BEACPTRMF PHVCPTRMF
1962: K00336 BEACPTRF PHVCPTRF
1963: M22244 BOVSVSP BOVSVSPA
1964: M24637 CHKCEK CHKCEK1
1965: M29035 BSUPEPF BSUPEPFA
1966: M29579 RATGLUSA RATGLUS
1967: M30953 ECOPANA ECOPANF
1968: M31077 HUMSTATHG1 HUMSTATH1
1969: M31612 TRBESAGC TRBESAGCA
1970: M31628 STMHISOP STMHISOPA
1971: M32331 HUMICAM1AA HUMICAM1
1972: M32639 HUMSTATHG2 HUMSTATH2
1973: M36473 ACLP322P ACCP322P
1974: X02297 PARSP51A3 PARSP51A4
1975: X03499 DDITRNV DDITGNV
1976: X03500 DDITRNE DDITGNE
1977: X04083 TVMXGG TVMXCG
1978: X12646 HUMRPHO2A HUMAPP2A
1979: X12656 HUMPP2A HUMBPP2A
1980: X17615 ECOFHUE X17615
1981: Y00448 ECOK2KORB RK2KORB
1982:
1983:
1984: APPENDIX C. Number of entries, reports, and bases by organism
1985:
1986: PRIMATE
1987:
1988: Key Name Reports Entries Bases
1989: -------------------------------------------------------------------------------
1990: 1. AGM Cercopithecus aethiops 61 42 31391
1991: 2. ATR Aotus trivirgatus 7 7 7557
1992: 3. BAB Papio anubis 3 3 2653
1993: 4. BAB Papio doguera 1 1 2000
1994: 5. BAB Papio hamadryas 7 7 11052
1995: 6. BAB Papio papio 1 1 343
1996: 7. BAB Papio sp. 3 3 1601
1997: 8. CEB Cebus sp. 2 2 190
1998: 9. CEP Cebus apella 7 7 1819
1999: 10. CHP Pan paniscus 1 1 1683
2000: 11. CHP Pan troglodytes 68 53 72174
2001: 12. COL Colobus polykomos 2 2 1494
2002: 13. GCR Galago crassicaudatus 38 38 16345
2003: 14. GIB Hylobates lar 8 6 17024
2004: 15. GOR Gorilla gorilla 19 11 19990
2005: 16. GSE Galago senegalensis 1 1 369
2006: 17. HUM Homo sapiens 9156 7235 8687688
2007: 18. LEM Cheirogaleus medius 1 1 1899
2008: 19. LEM Lemur albifrons 1 1 1786
2009: 20. LEM Lemur macaco 3 3 5590
2010: 21. LEM Lemur sp. 1 1 1380
2011: 22. MAC Macaca fascicularis 8 7 6355
2012: 23. MAC Macaca mulatta 40 28 33283
2013: 24. MAC Macaca nemestrina 1 1 1115
2014: 25. MAC Macaca radiata 2 2 342
2015: 26. MAC Macaca sp. 7 6 7502
2016: 27. MNK Ateles geoffroyi 4 4 13966
2017: 28. MNK Monkey 11 11 2081
2018: 29. ORA Pongo pygmaeus 19 17 36667
2019: 30. SOE Saguinus oedipus 4 4 5207
2020: 31. TAR Tarsius sp. 6 5 10837
2021:
2022: Total 9493 7511 9003383
2023:
2024: RODENT
2025:
2026: Key Name Reports Entries Bases
2027: -------------------------------------------------------------------------------
2028: 1. DIP Dipodomys ordii 1 1 3318
2029: 2. GPI Cavia cobaya 1 1 491
2030: 3. GPI Cavia cutleri 3 2 4959
2031: 4. GPI Cavia porcellus 23 22 36118
2032: 5. GPI Cavia sp. 2 2 3226
2033: 6. HAM Cricetulus griseus 32 28 38686
2034: 7. HAM Cricetulus longicaudatus 44 31 20752
2035: 8. HAM Cricetulus sp. 37 33 31583
2036: 9. HAM Cricetus cricetus 6 6 11446
2037: 10. HAM Mesocricetus auratus 81 58 77935
2038: 11. HAM Mesocricetus sp. 94 62 47934
2039: 12. MAR Marmota monax 8 6 8563
2040: 13. MUS Mus caroli 18 17 13904
2041: 14. MUS Mus domesticus 24 20 12906
2042: 15. MUS Mus muscaris 56 56 28343
2043: 16. MUS Mus musculus 5821 4884 4299073
2044: 17. MUS Mus pahari 7 7 7763
2045: 18. MUS Mus platythrix 1 1 315
2046: 19. MUS Mus sp. 11 11 9130
2047: 20. MUS Mus spretus 9 7 7454
2048: 21. PER Peromyscus leucopus 3 3 3640
2049: 22. PER Peromyscus maniculatus 11 11 1791
2050: 23. RAT Rattus colletti 4 4 7849
2051: 24. RAT Rattus fuscipes 1 1 1161
2052: 25. RAT Rattus leucopus 3 3 3481
2053: 26. RAT Rattus norvegicus 2599 2144 2797456
2054: 27. RAT Rattus rattus 209 177 284465
2055: 28. RAT Rattus sordidus 1 1 1161
2056: 29. RAT Rattus sp. 56 45 61254
2057: 30. RAT Rattus tunneyi 1 1 1161
2058: 31. RAT Rattus villosissimus 2 2 3369
2059: 32. SEH Spalax ehrenbergi 7 5 10412
2060:
2061: Total 9176 7652 7841099
2062:
2063: OTHER MAMMALIAN
2064:
2065: Key Name Reports Entries Bases
2066: -------------------------------------------------------------------------------
2067: 1. AXI Axis axis 2 2 1758
2068: 2. BOV Bos bovis 1 1 2363
2069: 3. BOV Bos taurus 755 642 741804
2070: 4. CAT Felis catus 40 40 23293
2071: 5. CAT Felis domesticus 3 3 4748
2072: 6. CAT Felis silvestris 8 6 9516
2073: 7. CAT Felis sp. 1 1 3534
2074: 8. DAV Dasyurus viverrinus 2 2 939
2075: 9. DOG Canis familiaris 55 45 52602
2076: 10. DOG Canis lupus 8 8 7867
2077: 11. DOG Canis sp. 19 18 22470
2078: 12. GOT Capra hircus 46 41 42286
2079: 13. HRS Equus caballus 66 38 32167
2080: 14. LEE Lepus capensis 1 1 434
2081: 15. LEE Lepus europaeus 1 1 3646
2082: 16. MMU Muntiacus muntjak 1 1 807
2083: 17. MVI Mustela vison 3 3 1909
2084: 18. OPO Didelphis virginiana 9 9 7139
2085: 19. ORC Orcinus orca 1 1 1579
2086: 20. PIG Sus scrofa 190 155 242401
2087: 21. RAB Basilea sp. 1 1 377
2088: 22. RAB Oryctolagus cuniculus 420 363 525433
2089: 23. RAB Oryctolagus sp. 57 57 70289
2090: 24. RAB Sylvilagus floridanus 1 1 1065
2091: 25. SEA Halichoerus grypus 3 3 2288
2092: 26. SHP Ovis aries 69 61 87157
2093: 27. SHP Ovis sp. 41 38 35507
2094: 28. SUN Suncus murinus 8 6 6231
2095: 29. VMP Desmodus rotundus 2 1 1725
2096: 30. WAL Macropus eugenii 1 1 754
2097: 31. WAL Macropus robustus 1 1 1465
2098: 32. WAL Macropus rufus 1 1 50
2099:
2100: Total 1817 1552 1935603
2101:
2102: OTHER VERTEBRATE
2103:
2104: Key Name Reports Entries Bases
2105: -------------------------------------------------------------------------------
2106: 1. ANG Anguilla australis 1 1 2191
2107: 2. APT Ascaphus truei 2 1 1897
2108: 3. BUJ Bufo japonicus 2 2 1116
2109: 4. CHK Gallus domesticus 158 120 174064
2110: 5. CHK Gallus gallus 1040 851 1029779
2111: 6. CPL Carcharhinus plumbeus 4 4 1821
2112: 7. CRC Caiman crocodylus 5 5 2902
2113: 8. DUK Aix sp. 1 1 165
2114: 9. DUK Anas platyrhynchos 15 14 21280
2115: 10. DUK Cairina moschata 30 22 25269
2116: 11. FAL Falco columbarius 1 1 174
2117: 12. FSA Myxine glutinosa 1 1 915
2118: 13. FSA Petromyzon marinus 5 5 9661
2119: 14. FSB Acipenser transmontano 2 1 1140
2120: 15. FSB Anarhichas lupus 1 1 3395
2121: 16. FSB Carassius auratus 12 11 9222
2122: 17. FSB Catostomus commersoni 4 3 1936
2123: 18. FSB Ctenopharyngobon idella 1 1 4243
2124: 19. FSB Cyprinus carpio 18 17 16913
2125: 20. FSB Electrophorus electricus 5 5 8692
2126: 21. FSB Elops saurus 9 5 3870
2127: 22. FSB Fundulus heteroclitus 1 1 1673
2128: 23. FSB Ictalurus punctatus 7 6 5679
2129: 24. FSB Limanda ferruginea 1 1 416
2130: 25. FSB Lophius americanus 8 5 2942
2131: 26. FSB Macrozoarces americanus 3 3 2657
2132: 27. FSB Misgurnus fossilis 2 2 1697
2133: 28. FSB Oncorhynchus keta 30 26 23515
2134: 29. FSB Oncorhynchus kisutch 2 2 3221
2135: 30. FSB Oncorhynchus tschawytscha 3 3 1862
2136: 31. FSB Paralichthys olivaceus 7 5 4859
2137: 32. FSB Pseudopleuronectus americanus 8 6 3002
2138: 33. FSB Salmo gairdneri 52 48 40813
2139: 34. FSB Salmo irideus 1 1 1278
2140: 35. FSB Salmo salar 8 6 6764
2141: 36. FSB Thunnus thynnus 1 1 911
2142: 37. FSC Torpedo californica 21 13 23035
2143: 38. FSC Torpedo marmorata 7 7 9992
2144: 39. GOO Anser anser 2 2 4906
2145: 40. GRE Geoclemys reevessi 1 1 239
2146: 41. HEF Heterodontus francisci 34 29 19173
2147: 42. KRY Kryptophaneron alfredi 1 1 1230
2148: 43. LSE Laticauda semifasciata 1 1 483
2149: 44. LSL Laticauda laticaudata 2 1 632
2150: 45. MRG Mergus serrator 1 1 2574
2151: 46. NEW Cynops pyrrhogaster 1 1 629
2152: 47. NVI Notophthalmus viridescens 10 10 1458
2153: 48. ORN Oreochromis mossambicus 2 1 237
2154: 49. ORN Oreochromis niloticus 1 1 847
2155: 50. PAG Pagrus major 2 1 906
2156: 51. PGN Columba sp. 2 2 1665
2157: 52. PHS Phasianus colchicus 1 1 739
2158: 53. PHU Phyllomedusa bicolor 1 1 781
2159: 54. PHU Phyllomedusa sauvagei 2 2 1315
2160: 55. PLS Phylloscopus trochilus 6 3 2593
2161: 56. PYU Pyura stolonifera 7 7 1029
2162: 57. QUL Coturnix coturnix 50 35 36670
2163: 58. QUL Coturnix japonica 1 1 311
2164: 59. RAN Rana catesbeiana 10 8 6949
2165: 60. RAN Rana pipiens 4 3 1625
2166: 61. RAN Rana temporaria 19 14 7422
2167: 62. SCC Scyliorhinus caniculus 2 2 667
2168: 63. SEQ Seriola quinqueradiata 1 1 879
2169: 64. SKT Raja erinacea 8 8 10209
2170: 65. SMD Triturus vulgaris 4 4 901
2171: 66. SMR Pleurodeles waltlii 5 4 2305
2172: 67. SNK Aipysurus laevis 6 3 1332
2173: 68. SNK Bothrops atrox 8 7 11423
2174: 69. SNK Crotalus durissus 3 2 1263
2175: 70. SNK Elaphe radiata 1 1 2483
2176: 71. SNK Naja naja 1 1 312
2177: 72. SNK Natrix tessellata 1 1 312
2178: 73. SNK Notechis scutatus 2 1 621
2179: 74. SRA Hemitripterus americanus 2 2 3294
2180: 75. TKY Meleagris gallopavo 16 14 6836
2181: 76. VIA Vipera ammodytes 2 1 607
2182: 77. XEB Xenopus borealis 27 26 19249
2183: 78. XEC Xenopus clivii 6 6 1406
2184: 79. XEL Xenopus laevis 503 440 505221
2185: 80. XET Xenopus tropicalis 11 8 14850
2186: 81. XIP Xiphophorus maculatus 4 2 983
2187: 82. XIP Xiphophorus sp. 2 1 4135
2188: 83. ZEF Brachydanio rerio 8 6 4264
2189:
2190: Total 2263 1876 2142926
2191:
2192: INVERTEBRATE
2193:
2194: Key Name Reports Entries Bases
2195: -------------------------------------------------------------------------------
2196: 1. ACA Acanthamoeba castellanii 32 28 28156
2197: 2. ACP Acropora formosa 2 2 236
2198: 3. ACP Acropora latistella 3 3 354
2199: 4. ADO Acheta domesticus 1 1 541
2200: 5. AEI Aequipecten irradians 2 2 1253
2201: 6. AEV Aequorea victoria 4 4 2595
2202: 7. AME Apis melifica 2 2 897
2203: 8. AMF Apis mellifera 8 8 2082
2204: 9. APL Aplysia californica 35 32 28418
2205: 10. APL Aplysia sp. 9 9 9086
2206: 11. APO Antheraea polyphemus 33 33 16883
2207: 12. APY Antheraea yamamai 1 1 1200
2208: 13. ARB Arbacia punctulata 3 2 5078
2209: 14. BBO Babesia bovis 4 4 2482
2210: 15. BBO Babesia rodhaini 1 1 3238
2211: 16. BMO Bombyx mandarina 3 3 2712
2212: 17. BMO Bombyx mori 152 120 87988
2213: 18. BPL Brachionus plicatilis 1 1 120
2214: 19. BRP Brugia malayi 9 8 8730
2215: 20. BRP Brugia pahangi 2 2 4893
2216: 21. BUG Bugula neritina 1 1 120
2217: 22. CAF Calanus finmarchicus 1 1 1487
2218: 23. CAR Carcinoscorpius rotundicauda 1 1 983
2219: 24. CAV Calliphora vicina 11 7 16088
2220: 25. CBL Trichoplusia ni 1 1 2475
2221: 26. CCA Caledia captiva 12 12 11713
2222: 27. CEI Ceratitis capitata 1 1 1115
2223: 28. CEL Caenorhabditis briggsae 7 4 5300
2224: 29. CEL Caenorhabditis elegans 139 108 228239
2225: 30. CER Calliphora erythrocephala 6 6 1677
2226: 31. CHI Chironomus pallidivittatus 21 16 5529
2227: 32. CHI Chironomus tentans 22 20 20059
2228: 33. CHI Chironomus thummi 32 30 32150
2229: 34. CLM Clam sp. 1 1 2163
2230: 35. CLM Spisula solidissima 2 2 3577
2231: 36. CPD Colpidium campylum 1 1 76
2232: 37. CPD Colpidium colpoda 1 1 76
2233: 38. CRB Cardisoma guanhumi 3 3 1992
2234: 39. CRB Gecarcinus lateralis 4 4 6714
2235: 40. CRB Geryon quinquedens 1 1 85
2236: 41. CRB Limulus polyphemus 4 4 4886
2237: 42. CRB Paguras pollicaris 4 4 419
2238: 43. CUL Culex pipiens 1 1 3105
2239: 44. DDI Dictyostelium discoideum 282 215 215990
2240: 45. DDI Dictyostelium sp. 2 1 4439
2241: 46. DEM Dermasterias imbricata 2 2 1016
2242: 47. DEP Dermatophagoides pteronyssinus 2 1 824
2243: 48. DIA Diadromus pulchellus 1 1 324
2244: 49. DIC Dicyema misakiense 1 1 116
2245: 50. DIR Dirofilaria immitis 2 2 2791
2246: 51. DRE Drosophila erecta 4 4 4847
2247: 52. DRF Drosophila funebris 2 1 3002
2248: 53. DRG Drosophila gymnobasis 10 10 1793
2249: 54. DRH Drosophila heteroneura 1 1 922
2250: 55. DRH Drosophila hydei 18 15 26936
2251: 56. DRH Drosophila irilis 1 1 1602
2252: 57. DRI Drosophila grimshawi 3 3 3860
2253: 58. DRL Drosophila silvarentis 2 2 356
2254: 59. DRM Drosophila mauritiana 6 4 8528
2255: 60. DRN Drosophila navojoa 2 1 2334
2256: 61. DRN Drosophila nebulosa 2 2 2991
2257: 62. DRO Drosophila melanogaster 1242 1001 1581360
2258: 63. DRO Drosophila subobscura 2 2 4827
2259: 64. DRP Drosophila pseudoobscura 14 7 38083
2260: 65. DRQ Drosophila sechellia 3 2 5559
2261: 66. DRR Drosophila orena 7 4 7293
2262: 67. DRS Drosophila simulans 23 18 21926
2263: 68. DRS Drosophila sp. 2 2 5761
2264: 69. DRT Drosophila teissieri 4 4 7787
2265: 70. DRU Drosophila mulleri 1 1 6778
2266: 71. DRV Drosophila virilis 47 37 72725
2267: 72. DRW Drosophila mojavensis 6 5 21612
2268: 73. DRY Drosophila yakuba 1 1 1853
2269: 74. ECC Echinococcus granulosus 2 2 846
2270: 75. EIM Eimeria acervulina 4 2 1764
2271: 76. EIM Eimeria tenella 5 4 4629
2272: 77. ENH Entamoeba histolytica 15 14 10247
2273: 78. EPH Ephydatia mulleri 1 1 2895
2274: 79. EUC Eurypelma californicum 1 1 1579
2275: 80. EWA Euplotes aediculatus 1 1 1882
2276: 81. EWC Euplotes crassus 2 2 1427
2277: 82. EWE Euplotes eurystomus 2 1 930
2278: 83. EWR Euplotes raikovi 1 1 593
2279: 84. FFL Luciola cruciata 1 1 1985
2280: 85. FHE Fasciola hepatica 1 1 894
2281: 86. GCH Glaucoma chattoni 3 3 564
2282: 87. GLA Giardia lamblia 14 10 9487
2283: 88. GLA Giardia sp. 1 1 831
2284: 89. GLY Glycera dibranchiata 1 1 745
2285: 90. GMO Glossina austeni 1 1 653
2286: 91. GMO Glossina fuscipes 1 1 239
2287: 92. GMO Glossina morsitans 7 7 3244
2288: 93. GMO Glossina palpalis 1 1 236
2289: 94. HAE Haemonchus contortus 4 4 3557
2290: 95. HCE Hyalophora cecropia 11 11 6608
2291: 96. HEL Heliothis virescens 1 1 2977
2292: 97. HIR Hirudo medicinalis 1 1 379
2293: 98. HLT Haliotis corrugata 1 1 650
2294: 99. HLT Haliotis rufescens 1 1 642
2295: 100. HOL Holothuria polii 5 5 1964
2296: 101. HOL Holothuria tubulosa 1 1 441
2297: 102. HYD Hydra sp. 2 2 4555
2298: 103. LAN Lingula anatina 1 1 120
2299: 104. LAR Lampetra reissneri 1 1 120
2300: 105. LEI Leishmania donovani 10 6 10013
2301: 106. LEI Leishmania enriettae 4 4 1562
2302: 107. LEI Leishmania enriettii 4 4 4213
2303: 108. LEI Leishmania major 10 8 9788
2304: 109. LEI Leishmania sp. 4 4 8884
2305: 110. LEI Leishmania tropica 1 1 1851
2306: 111. LIT Litomosoides carinii 2 2 214
2307: 112. LMI Locusta migratoria 5 5 3039
2308: 113. LOA Loa loa 1 1 839
2309: 114. LUM Lumbricus terrestris 2 2 5061
2310: 115. LYM Lymnaea stagnalis 2 2 1764
2311: 116. MDO Musca domestica 3 2 2916
2312: 117. MOT Manduca sexta 25 22 40693
2313: 118. MSQ Aedes aegypti 4 4 8350
2314: 119. MSQ Anopheles gambiae 21 21 7322
2315: 120. NEM Ascaris lumbricoides 35 34 17038
2316: 121. NEM Ascaris suum 2 2 6079
2317: 122. NEP Nephila clavipes 1 1 2336
2318: 123. NGR Naegleria gruberi 4 4 6389
2319: 124. OCT Octopus dofleini 1 1 1315
2320: 125. OCT Paroctopus defleini 1 1 1675
2321: 126. OFA Oxytricha fallax 34 13 9421
2322: 127. OMM Ommastrephes sloanei 2 2 1825
2323: 128. ONG Onchocerca sp. 2 2 214
2324: 129. ONG Onchocerca volvulus 25 24 15477
2325: 130. ONO Oxytricha nova 21 21 20172
2326: 131. OWE Owenia fusiformis 1 1 1548
2327: 132. PAA Parascaris sp. 2 2 215
2328: 133. PAL Paracentrotus lividus 11 9 13481
2329: 134. PAR Paramecium aurelia 5 5 1190
2330: 135. PAR Paramecium primaurelia 8 8 12822
2331: 136. PAR Paramecium tetraurelia 20 17 9174
2332: 137. PBA Plasmodium gallinaceum 1 1 799
2333: 138. PBE Plasmodium berghei 9 9 11176
2334: 139. PBS Plasmodium brasilianum 2 2 3010
2335: 140. PCH Plasmodium chabaudi 11 9 17151
2336: 141. PCR Philosamia cynthia ricini 2 2 240
2337: 142. PCY Plasmodium cynomolgi 6 6 7875
2338: 143. PFA Plasmodium falciparum 168 138 216729
2339: 144. PFA Plasmodium fragile 2 1 2307
2340: 145. PIO Pisaster ochraceus 5 5 9699
2341: 146. PKN Plasmodium knowlesi 11 9 8880
2342: 147. PLM Plasmodium malariae 1 1 1545
2343: 148. PLM Plasmodium reichenowi 1 1 654
2344: 149. PLO Plasmodium lophurae 6 5 6087
2345: 150. PMC Pneumocystis carinii 10 6 5670
2346: 151. PMI Prorocentrum micans 2 1 2451
2347: 152. PNG Panagrellus redivivus 2 2 322
2348: 153. PNG Panagrellus silusiae 1 1 682
2349: 154. PPY Photinus pyralis 1 1 2387
2350: 155. PVI Plasmodium vivax 13 7 8966
2351: 156. PYO Plasmodium yoelii 15 11 21793
2352: 157. SAR Sarcocystis gigantea 3 3 473
2353: 158. SCA Schistocerca americana 3 3 3458
2354: 159. SCA Schistocerca nitans 2 2 711
2355: 160. SCI Sciara coprophila 2 2 822
2356: 161. SCM Schistosoma japonicum 10 9 9090
2357: 162. SCM Schistosoma mansoni 58 52 46518
2358: 163. SCR Androctonus australis 7 7 2563
2359: 164. SEM Parastichopus parvimensis 1 1 1458
2360: 165. SHR Artemia salina 21 20 14517
2361: 166. SHR Artemia sp. 11 8 6407
2362: 167. SLE Stylonychia lemnae 4 4 7237
2363: 168. SLU Stylonychia pustulata 5 5 2820
2364: 169. SPE Sarcophaga peregrina 7 7 6405
2365: 170. SPF Spodoptera frugiperda 5 5 7382
2366: 171. SQD Loligo pealii 1 1 3693
2367: 172. STF Asterina pectinifera 1 1 2180
2368: 173. SUH Hemicentrotus pulcherrimus 1 1 2413
2369: 174. SUL Lytechinus pictus 14 13 10476
2370: 175. SUL Lytechinus variegatus 16 16 12102
2371: 176. SUP Psammechinus miliaris 50 38 25964
2372: 177. SUS Strongylocentrotus drobachiensis 4 4 1256
2373: 178. SUS Strongylocentrotus franciscanus 2 2 3832
2374: 179. SUS Strongylocentrotus purpuratus 167 158 114993
2375: 180. SUT Tripneustes gratilla 13 7 10207
2376: 181. SUU Sea urchin 12 12 2632
2377: 182. TAE Taenia solium 3 2 3737
2378: 183. TAT Tachypleus tridentatus 1 1 946
2379: 184. TCA Tribolium castaneum 1 1 707
2380: 185. TCK Boophilus microplus 2 1 2225
2381: 186. TCS Trichostrongylus colubriformis 2 2 1987
2382: 187. TEC Tetrahymena cosmopolitanis 1 1 511
2383: 188. TEH Tetrahymena hyperangularis 2 2 767
2384: 189. TEL Tetrahymena leucophrys 1 1 75
2385: 190. TEM Tetrahymena malaccensis 1 1 507
2386: 191. TEN Tenebrio molitor 23 23 5207
2387: 192. TEP Tetrahymena paravorax 1 1 74
2388: 193. TEP Tetrahymena pigmentosa 4 4 3072
2389: 194. TES Tetrahymena sonneborni 1 1 511
2390: 195. TET Tetrahymena thermophila 64 61 49075
2391: 196. TEU Tetrahymena patula 1 1 75
2392: 197. TEX Tetrahymena vorax 1 1 75
2393: 198. TEY Tetrahymena pyriformis 13 11 10560
2394: 199. THE Theileria annulata 2 2 2859
2395: 200. THE Theileria parva 3 3 6729
2396: 201. TOX Toxoplasma gondii 10 10 12582
2397: 202. TRB Trypanosoma brucei 252 223 288704
2398: 203. TRC Trypanosoma cruzi 35 33 31153
2399: 204. TRE Trypanosoma equiperdum 6 4 3816
2400: 205. TRF Crithidia fasciculata 16 12 20239
2401: 206. TRI Trichomonas vaginalis 1 1 582
2402: 207. TRL Leptomonas collosoma 1 1 154
2403: 208. TRL Leptomonas seymouri 6 4 2130
2404: 209. TRO Trypanosoma congolense 9 9 8879
2405: 210. TRV Trypanosoma vivax 2 2 857
2406: 211. TRY Trypanosoma rangeli 1 1 153
2407: 212. TSR Trichinella spiralis 3 2 2138
2408: 213. UCA Urechis caupo 1 1 718
2409: 214. VAH Vargula hilgendorfii 1 1 1818
2410: 215. VAI Vairimorpha necatrix 1 1 1244
2411: 216. VUI Eupelmus vuilleti 1 1 106
2412: 217. WSP Dolichovespula maculata 2 2 1367
2413: 218. WUC Wuchereria bancrofti 1 1 1323
2414:
2415: Total 3811 3195 4005462
2416:
2417: PLANT
2418:
2419: Key Name Reports Entries Bases
2420: -------------------------------------------------------------------------------
2421: 1. ABG Absidia glauca 2 1 1011
2422: 2. ACK Achlya ambisexualis 1 1 1121
2423: 3. ACK Achlya bisexualis 1 1 1809
2424: 4. ACK Achlya klebsiana 1 1 1254
2425: 5. ACT Actinidia chinensis 8 5 5231
2426: 6. ACT Actinidia deliciosa 1 1 634
2427: 7. AEG Aegilops tauschii 1 1 421
2428: 8. ALC Allium cepa 3 3 1135
2429: 9. ALF Medicago sativa 29 18 26748
2430: 10. AMA Antirrhinum majus 11 7 11860
2431: 11. APE Acremonium chrysogenum 2 2 2410
2432: 12. ASA Aspergillus awamori 4 3 7873
2433: 13. ASG Aspergillus niger 12 9 18564
2434: 14. ASL Aessosporon salmonicolor 1 1 119
2435: 15. ASN Aspergillus nidulans 49 42 89815
2436: 16. ASO Aspergillus oryzae 13 10 14743
2437: 17. AST Avena sativa 8 8 22641
2438: 18. ATH Arabidopsis thaliana 94 76 136042
2439: 19. AVO Persea americana 3 3 4672
2440: 20. BJE Bjerkandera adusta 1 1 118
2441: 21. BLY Hordeum vulgare 91 72 86901
2442: 22. BNA Brassica napus 25 21 23597
2443: 23. BOL Brassica campestris 8 5 5320
2444: 24. BOL Brassica juncea 3 2 356
2445: 25. BOL Brassica oleracea 12 9 3859
2446: 26. BRM Bremia lactucae 1 1 2869
2447: 27. BRN Bertholletia excelsa 1 1 621
2448: 28. CAG Canavalia gladiata 3 2 4797
2449: 29. CCI Coprinus cinereus 2 2 6091
2450: 30. CEN Canavalia ensiformis 1 1 1027
2451: 31. CFU Caldariomyces fumago 2 1 2787
2452: 32. CHE Chenopodium rubrum 6 3 1865
2453: 33. CHL Chlorella protothecoides 2 1 1332
2454: 34. CHL Chlorella sp. 6 6 1019
2455: 35. CIP Mesembryanthemum crystallinum 11 8 25401
2456: 36. CLC Claviceps purpurea 2 2 758
2457: 37. CLI Citrus limon 3 3 4958
2458: 38. CLR Clarkia unguiculata 2 1 2040
2459: 39. CNA Citrullus vulgaris 1 1 1334
2460: 40. COA Convolvulus arvensis 4 4 4549
2461: 41. COC Cochliobolus heterostrophus 2 2 2634
2462: 42. COG Colletotrichum capsici 1 1 2557
2463: 43. COG Colletotrichum gloeosporioides 1 1 1749
2464: 44. COG Colletotrichum graminicola 2 2 5286
2465: 45. COT Gossypium hirsutum 9 9 13403
2466: 46. CPA Carica papaya 3 3 2037
2467: 47. CPA Cyanophora paradoxa 1 1 528
2468: 48. CRE Chlamydomonas reinhardtii 49 38 52939
2469: 49. CTR Catharanthus roseus 1 1 1740
2470: 50. CUC Cucurbita maxima 5 3 15360
2471: 51. CUC Cucurbita moschata 2 1 1781
2472: 52. CUC Cucurbita pepo 8 8 7363
2473: 53. CUS Cucumis melo 1 1 1137
2474: 54. CUS Cucumis sativus 13 12 12742
2475: 55. DAR Daucus carota 11 9 11820
2476: 56. DBI Dolichos biflorus 3 3 5384
2477: 57. DUN Dunaliella salina 3 2 2541
2478: 58. EGR Euglena gracilis 9 6 9011
2479: 59. EPA Endothia parasitica 5 3 3809
2480: 60. EPK Ephedra kokanica 2 1 120
2481: 61. ERG Erysiphe graminis 1 1 2475
2482: 62. FIL Filobasidium capsuligenum 1 1 118
2483: 63. FIL Filobasidium floriforme 1 1 118
2484: 64. FLX Linum usitatissimum 5 5 1726
2485: 65. FSO Fusarium oxysporum 2 2 1347
2486: 66. FSO Fusarium solani 3 3 5508
2487: 67. FSO Fusarium sporotrichioides 2 1 1908
2488: 68. FTR Flaveria trinervia 1 1 752
2489: 69. GCO Gracilaria tikvahiae 1 1 1771
2490: 70. GCO Gracilaria verrucosa 1 1 1771
2491: 71. GNG Gnetum gnemon 2 1 120
2492: 72. GRO Gracilariopsis sp. 1 1 1782
2493: 73. HAS Hansenula anomala 2 1 2132
2494: 74. HAS Hansenula polymorpha 2 1 3637
2495: 75. HEV Hevea brasiliensis 1 1 1008
2496: 76. HNN Helianthus annuus 6 4 5752
2497: 77. HRA Armoracia rusticana 2 2 5828
2498: 78. IPB Ipomoea batatas 9 7 11859
2499: 79. LGI Lemna gibba 6 6 5024
2500: 80. LGI Lemna minor 1 1 119
2501: 81. LIL Lilium henryi 1 1 9345
2502: 82. LOL Lolium perenne 2 2 2082
2503: 83. LUP Lupinus luteus 20 15 5280
2504: 84. MAQ Marsilia quadrifolia 2 1 155
2505: 85. MIN Matthiola incana 1 1 509
2506: 86. MRA Mucor racemosus 8 7 6601
2507: 87. MRM Mucor circinelloides 1 1 4399
2508: 88. MRM Mucor miehei 2 2 3316
2509: 89. MRP Mucor pusillus 1 1 1965
2510: 90. MZE Zea mays 235 193 295885
2511: 91. MZE Zea mexicana 2 2 360
2512: 92. NAN Nanochlorum eucaryotum 2 1 1796
2513: 93. NEU Neurospora crassa 116 102 168075
2514: 94. OCH Ochromonas danica 1 1 1789
2515: 95. PAN Podospora anserina 3 3 1901
2516: 96. PAP Papaver somniferum 2 2 864
2517: 97. PBL Phycomyces blakesleeanus 3 3 545
2518: 98. PCP Physcomitrella patens 1 1 2544
2519: 99. PEA Pisum sativum 81 72 85166
2520: 100. PEC Penicillium chrysogenum 7 6 20758
2521: 101. PEP Penicillium patulum 1 1 6357
2522: 102. PET Petunia hybrida 37 31 28787
2523: 103. PET Petunia sp. 25 24 13154
2524: 104. PHA Phanerochaete chrysosporium 14 10 18832
2525: 105. PHN Pharbitis nil 10 5 534
2526: 106. PHO Petroselinum hortense 1 1 1431
2527: 107. PHT Phytophthora megasperma 4 2 3031
2528: 108. PHV Phaseolus lunatus 1 1 926
2529: 109. PHV Phaseolus vulgaris 48 37 43358
2530: 110. PIN Pinus contorta 1 1 745
2531: 111. PIN Pinus sylvestris 2 1 583
2532: 112. PIN Pinus thunbergii 4 2 1889
2533: 113. PMI Prorocentrum micans 2 1 3408
2534: 114. POA Polytomella agilis 3 3 6616
2535: 115. POM Polystichum munitum 4 4 4645
2536: 116. POP Populus sp. 17 10 3776
2537: 117. POT Solanum tuberosum 59 52 81544
2538: 118. PSJ Psathyrostachys juncea 2 2 2035
2539: 119. PTE Porphyra umbilicalis 2 1 121
2540: 120. PUM Petroselinum crispum 10 10 6270
2541: 121. PYL Pylaiella littoralis 2 1 1644
2542: 122. RAD Raphanus sativus 9 6 2470
2543: 123. RCC Ricinus communis 6 6 10485
2544: 124. RCH Rhizopus chinensis 1 1 1133
2545: 125. RCH Rhizopus niveus 2 2 3448
2546: 126. RCH Rhizopus oryzae 1 1 2290
2547: 127. RDT Rhodotorula rubra 2 1 3586
2548: 128. RHD Rhodosporidium toruloides 2 2 3181
2549: 129. RHP Parasponia andersonii 1 1 1520
2550: 130. RHP Parasponia rhizobium 2 2 5530
2551: 131. RIC Oryza sativa 81 56 78071
2552: 132. RYE Secale cereale 3 3 5870
2553: 133. SAL Sinapis alba 12 8 8316
2554: 134. SCN Schwanniomyces occidentalis 2 1 2292
2555: 135. SCO Schizophyllum commune 5 5 4836
2556: 136. SES Sesbania rostrata 6 5 3948
2557: 137. SIP Silene pratensis 4 4 3165
2558: 138. SLM Physarum polycephalum 59 49 52689
2559: 139. SOY Glycine max 144 111 165611
2560: 140. SPI Spinacia oleracea 37 30 34346
2561: 141. SRG Sorghum bicolor 9 5 6336
2562: 142. SRG Sorghum sp. 2 1 4638
2563: 143. SSI Scilla siberica 4 4 204
2564: 144. SSY Sisymbrium irio 2 1 433
2565: 145. TDA Thaumatococcus daniellii 1 1 931
2566: 146. TFR Trifolium repens 2 1 1268
2567: 147. THI Thinopyrum elongatum 1 1 1375
2568: 148. TLA Thermomyces lanuginosus 6 5 3804
2569: 149. TOB Nicotiana alata 1 1 804
2570: 150. TOB Nicotiana plumbaginifolia 12 11 22286
2571: 151. TOB Nicotiana rustica 2 2 593
2572: 152. TOB Nicotiana sylvestris 4 4 1382
2573: 153. TOB Nicotiana tabacum 67 51 71765
2574: 154. TOM Lycopersicon esculentum 91 72 94920
2575: 155. TOM Lycopersicon peruvianum 1 1 480
2576: 156. TRD Tripsacum dactyloides 6 6 1528
2577: 157. TRH Trichosanthes kirilowii 2 2 2237
2578: 158. TRR Trichoderma reesei 4 4 8343
2579: 159. TRT Trema tomentosa 2 1 1727
2580: 160. URO Uromyces appendiculatus 1 1 1449
2581: 161. USM Ustilago maydis 3 2 2656
2582: 162. VFA Vicia faba 39 33 29444
2583: 163. VIR Vigna mungo 2 1 1314
2584: 164. VIR Vigna radiata 5 4 4354
2585: 165. VVC Volvox carteri 11 9 17792
2586: 166. WHT Triticum aestivum 100 79 115801
2587: 167. WHT Triticum durum 1 1 898
2588: 168. WHT Triticum sp. 8 8 8441
2589: 169. WHT Triticum vulgare 1 1 965
2590: 170. YS1 Zygosaccharomyces fermentati 1 1 5416
2591: 171. YS2 Saccharomycopsis fibuligera 3 3 9339
2592: 172. YS4 Candida boidinii 2 2 1863
2593: 173. YS5 Candida glabrata 3 3 2758
2594: 174. YSA Candida albicans 11 8 13498
2595: 175. YSB Candida tropicalis 16 13 23751
2596: 176. YSC Saccharomyces cerevisiae 1131 933 1716468
2597: 177. YSCTY Transposable element TY1 53 44 52430
2598: 178. YSD Saccharomyces diastaticus 4 4 4319
2599: 179. YSD Saccharomyces douglassi 1 1 4072
2600: 180. YSE Candida pelliculosa 1 1 5327
2601: 181. YSF Candida maltosa 5 4 8167
2602: 182. YSG Saccharomyces carlsbergensis 23 20 38053
2603: 183. YSH Hansenula wingei 3 3 720
2604: 184. YSI Saccharomyces fibuligera 2 2 6761
2605: 185. YSJ Yarrowia lipolytica 8 7 17412
2606: 186. YSK Kluyveromyces lactis 41 32 86137
2607: 187. YSM Hansenula polymorpha 3 3 8018
2608: 188. YSN Kluyveromyces fragilis 1 1 4193
2609: 189. YSO Zygosaccharomyces rouxii 5 3 15025
2610: 190. YSP Schizosaccharomyces japonicus 1 1 108
2611: 191. YSP Schizosaccharomyces malidevorans 1 1 107
2612: 192. YSP Schizosaccharomyces octosporus 1 1 109
2613: 193. YSP Schizosaccharomyces pombe 135 114 206826
2614: 194. YSQ Pichia pastoris 3 3 899
2615: 195. YSS Cephalosporium acremonium 5 5 2757
2616: 196. YST Yeast sp. 38 37 17556
2617: 197. YSU Candida utilis 4 4 7578
2618: 198. YSV Saccharomyces uvarum 1 1 2001
2619: 199. YSW Kluyveromyces drosophilarum 1 1 4757
2620: 200. YSX Saccharomyces rosei 1 1 278
2621: 201. YSY Saccharomyces kluyveri 5 4 2875
2622: 202. YSZ Zygosaccharomyces bailii 1 1 5415
2623: 203. ZAM Zamia pumila 1 1 1813
2624:
2625: Total 3636 2976 4659180
2626:
2627: ORGANELLE
2628:
2629: Key Name Reports Entries Bases
2630: -------------------------------------------------------------------------------
2631: 1. ABGMT Mitochondrion Absidia glauca 1 1 596
2632: 2. ACUCP Chloroplast Acorus calamus 1 1 103
2633: 3. AEGCP Chloroplast Aegilops crassa 2 1 1436
2634: 4. AEGCP Chloroplast Aegilops squarrosa 1 1 203
2635: 5. AKOMT Mitochondrion Akodon aerosus 6 6 2406
2636: 6. AKOMT Mitochondrion Akodon andinus 1 1 401
2637: 7. AKOMT Mitochondrion Akodon boliviensis 2 2 802
2638: 8. AKOMT Mitochondrion Akodon jelskii 3 3 1203
2639: 9. AKOMT Mitochondrion Akodon juninensis 1 1 401
2640: 10. AKOMT Mitochondrion Akodon kofordi 1 1 401
2641: 11. AKOMT Mitochondrion Akodon mollis 1 1 401
2642: 12. AKOMT Mitochondrion Akodon puer 1 1 401
2643: 13. AKOMT Mitochondrion Akodon subfuscus 3 3 1203
2644: 14. AKOMT Mitochondrion Akodon torques 3 3 1203
2645: 15. ALFCP Chloroplast Medicago sativa 3 3 3460
2646: 16. AMDCP Chloroplast Acetabularia mediterranea 1 1 1175
2647: 17. AMFMT Mitochondrion Apis mellifera 1 1 2949
2648: 18. AMHCP Chloroplast Amaranthus hybridus 1 1 1187
2649: 19. AMTMT Mitochondrion Ambystoma tigrinum 2 1 225
2650: 20. ASIMT Mitochondrion Ascobolus immersus 2 1 5142
2651: 21. ASNMT Mitochondrion Aspergillus amstelodami 1 1 624
2652: 22. ASNMT Mitochondrion Aspergillus nidulans 22 18 32837
2653: 23. ASTCP Chloroplast Avena sativa 1 1 1623
2654: 24. ATHCP Chloroplast Arabidopsis thaliana 3 2 1499
2655: 25. ATHMT Mitochondrion Arabidopsis thaliana 2 1 880
2656: 26. ATPCP Chloroplast Atriplex patula 1 1 1786
2657: 27. ATPCP Chloroplast Atriplex rosea 1 1 1790
2658: 28. BETMT Mitochondrion Beta vulgaris 4 4 5919
2659: 29. BLSMT Mitochondrion Boletus satanas 2 1 341
2660: 30. BLYCP Chloroplast Hordeum vulgare 18 13 23006
2661: 31. BNACP Chloroplast Brassica napus 1 1 1633
2662: 32. BOLCP Chloroplast Brassica oleracea 1 1 543
2663: 33. BOLMT Mitochondrion Brassica oleracea 1 1 549
2664: 34. BOLMT Mitochondrion Brassica sp. 2 2 770
2665: 35. BOMMT Mitochondrion Bolomys amoenus 2 2 802
2666: 36. BOVMT Mitochondrion Bos taurus 6 5 19563
2667: 37. CEUMT Mitochondrion Cervus unicolor 1 1 2682
2668: 38. CHECP Chloroplast Chenopodium album 1 1 207
2669: 39. CHKMT Mitochondrion Gallus domesticus 1 1 3571
2670: 40. CHKMT Mitochondrion Gallus gallus 4 3 1153
2671: 41. CHLCP Chloroplast Chlorella ellipsoidea 12 9 15063
2672: 42. CHPMT Mitochondrion Pan troglodytes 1 1 896
2673: 43. CNAMT Mitochondrion Citrullus lanatus 1 1 4512
2674: 44. COCMT Mitochondrion Cochliobolus heterostrophus 1 1 1827
2675: 45. CODCP Chloroplast Codium fragile 4 4 304
2676: 46. COOCP Chloroplast Coleochaete orbicularis 1 1 1587
2677: 47. CPACP Chloroplast Cyanophora paradoxa 11 8 2748
2678: 48. CPACY Cyanelle Cyanophora paradoxa 5 5 5317
2679: 49. CRECP Chloroplast Chlamydomonas moewusii 3 2 8107
2680: 50. CRECP Chloroplast Chlamydomonas reinhardtii 42 33 37766
2681: 51. CRECP Chloroplast Chlamydomonas sp. 3 2 2346
2682: 52. CREMT Mitochondrion Chlamydomonas reinhardtii 26 22 21535
2683: 53. CRRMT Mitochondrion Corcorax melanorhamphos 1 1 239
2684: 54. CRYCP Chloroplast Cryptomonas phi 2 1 715
2685: 55. DARMT Mitochondrion Daucus carota 1 1 690
2686: 56. DIPMT Mitochondrion Dipodomys californicus 1 1 239
2687: 57. DIPMT Mitochondrion Dipodomys heermanni 1 1 239
2688: 58. DIPMT Mitochondrion Dipodomys panamintinus 2 2 478
2689: 59. DRMMT Mitochondrion Drosophila mauritania 1 1 976
2690: 60. DROMT Mitochondrion Drosophila melanogaster 9 6 16133
2691: 61. DRSMT Mitochondrion Drosophila simulans 1 1 975
2692: 62. DRVMT Mitochondrion Drosophila virilis 1 1 191
2693: 63. DRYMT Mitochondrion Drosophila yakuba 9 3 19938
2694: 64. EGRCP Chloroplast Euglena gracilis 35 24 62419
2695: 65. EQQMT Mitochondrion Equus quagga 2 2 229
2696: 66. FHEMT Mitochondrion Fasciola hepatica 2 1 708
2697: 67. FRGMT Mitochondrion Rana catesbeiana 9 5 12109
2698: 68. FSBMT Mitochondrion Acipenser transmontano 2 1 156
2699: 69. FSBMT Mitochondrion Cichlosoma centrarchus 1 1 239
2700: 70. FSBMT Mitochondrion Cichlosoma citrinellum 1 1 239
2701: 71. FSBMT Mitochondrion Cichlosoma labiatum 1 1 239
2702: 72. FSBMT Mitochondrion Cichlosoma nicaraguense 1 1 239
2703: 73. FSBMT Mitochondrion Cyprinus carpio 3 3 873
2704: 74. FSBMT Mitochondrion Julidochromis regani 1 1 239
2705: 75. FSBMT Mitochondrion Salmo gairdneri 2 2 855
2706: 76. FTRCP Chloroplast Flaveria bidentis 1 1 1839
2707: 77. FTRCP Chloroplast Flaveria pringlei 1 1 1842
2708: 78. GCOCP Chloroplast Gracilaria tenuistipitata 1 1 1930
2709: 79. GIBMT Mitochondrion Hylobates lar 1 1 896
2710: 80. GORMT Mitochondrion Gorilla gorilla 1 1 896
2711: 81. HAMMT Mitochondrion Cricetulus sp. 1 1 880
2712: 82. HNNMT Mitochondrion Helianthus annuus 1 1 1336
2713: 83. HUMMT Mitochondrion Homo sapiens 43 36 37585
2714: 84. HYRMT Mitochondrion Hydropotes inermis 1 1 2680
2715: 85. IPBCP Chloroplast Ipomoea batatas 1 1 2004
2716: 86. LEIKP Kinetoplast Leishmania aethiopica 1 1 376
2717: 87. LEIKP Kinetoplast Leishmania major 2 1 1031
2718: 88. LEIKP Kinetoplast Leishmania mexicana 3 3 2134
2719: 89. LEIKP Kinetoplast Leishmania tarentolae 22 16 28301
2720: 90. LEIMT Mitochondrion Leishmania tarentolae 1 1 189
2721: 91. LEMMT Mitochondrion Lemur catta 1 1 895
2722: 92. LIGCP Chloroplast Ligularia calthifolia 1 1 103
2723: 93. LMIMT Mitochondrion Locusta migratoria 3 3 5118
2724: 94. LUAMT Mitochondrion Lupinus angustifolius 2 2 1330
2725: 95. LUPMT Mitochondrion Lupinus luteus 2 1 630
2726: 96. MACMT Mitochondrion Macaca fascicularis 2 2 1598
2727: 97. MACMT Mitochondrion Macaca fuscata 1 1 896
2728: 98. MACMT Mitochondrion Macaca mulatta 1 1 896
2729: 99. MACMT Mitochondrion Macaca sylvanus 1 1 896
2730: 100. MCXMT Mitochondrion Microxus mimus 2 2 802
2731: 101. MMUMT Mitochondrion Muntiacus reevesi 1 1 2682
2732: 102. MPOCP Chloroplast Marchantia polymorpha 10 1 121024
2733: 103. MSQMT Mitochondrion Aedes albopictus 9 9 3448
2734: 104. MUSMT Mitochondrion Mus musculus 22 15 20678
2735: 105. MZECP Chloroplast Zea mays 72 55 54150
2736: 106. MZECP Chloroplast Zea perennis 2 2 1456
2737: 107. MZEMT Mitochondrion Zea mays 53 45 80743
2738: 108. NEUMT Mitochondrion Neurospora crassa 44 39 55773
2739: 109. NEUMT Mitochondrion Neurospora intermedia 5 4 10804
2740: 110. NRACP Chloroplast Neurachne munroi 1 1 1990
2741: 111. NRACP Chloroplast Neurachne tenuifolia 1 1 2010
2742: 112. OBECP Chloroplast Oenothera berteriana 6 4 1813
2743: 113. OBEMT Mitochondrion Oenothera berteriana 18 15 25036
2744: 114. OBOCP Chloroplast Oenothera odorata 2 2 964
2745: 115. ODOMT Mitochondrion Odocoileus virginianus 1 1 2677
2746: 116. OHOCP Chloroplast Oenothera hookeri 2 2 2132
2747: 117. OLICP Chloroplast Olisthodiscus luteus 1 1 714
2748: 118. ORAMT Mitochondrion Pongo pygmaeus 1 1 895
2749: 119. OSPMT Mitochondrion Oenothera sp. 2 2 1635
2750: 120. PALMT Mitochondrion Paracentrotus lividus 17 17 21974
2751: 121. PANMT Mitochondrion Podospora anserina 21 20 65750
2752: 122. PARMT Mitochondrion Paramecium aurelia 9 9 8110
2753: 123. PARMT Mitochondrion Paramecium primaurelia 4 3 5645
2754: 124. PARMT Mitochondrion Paramecium sp. 34 17 12563
2755: 125. PARMT Mitochondrion Paramecium tetraurelia 4 4 5844
2756: 126. PEACP Chloroplast Pisum sativum 36 30 55118
2757: 127. PEAMT Mitochondrion Pisum sativum 8 6 10694
2758: 128. PENCP Chloroplast Pennisetum americanum 2 1 325
2759: 129. PETCP Chloroplast Petunia hybrida 7 7 5806
2760: 130. PETMT Mitochondrion Petunia hybrida 4 3 2954
2761: 131. PETMT Mitochondrion Petunia parodii 1 1 1774
2762: 132. PFAMT Mitochondrion Plasmodium falciparum 1 1 935
2763: 133. PGYMT Mitochondrion Paragyrodon sphaerosporus 2 1 337
2764: 134. PHVMT Mitochondrion Phaseolus vulgaris 1 1 88
2765: 135. PIGMT Mitochondrion Sus scrofa 2 2 686
2766: 136. PILCP Chloroplast Pilayella littoralis 1 1 353
2767: 137. PMGMT Mitochondrion Placopecten magellanicus 5 5 4580
2768: 138. POGMT Mitochondrion Thomomys townsendi 1 1 239
2769: 139. PZOCP Chloroplast Pelargonium zonale 2 2 463
2770: 140. RADMT Mitochondrion Raphanus sativus 2 2 5752
2771: 141. RATMT Mitochondrion Rattus norvegicus 35 26 23507
2772: 142. RATMT Mitochondrion Rattus rattus 4 4 4388
2773: 143. RHS Mitochondrion Rhizopogon achraeceorubens 2 1 341
2774: 144. RHS Mitochondrion Rhizopogon subcaerulescens 2 1 341
2775: 145. RICCP Chloroplast Oryza sativa 11 9 12553
2776: 146. RICMT Mitochondrion Oryza sativa 5 5 8084
2777: 147. RYECP Chloroplast Secale cereale 9 7 9269
2778: 148. SAIMT Mitochondrion Saimiri sciureus 1 1 893
2779: 149. SALCP Chloroplast Sinapis alba 8 6 9874
2780: 150. SAOCP Chloroplast Saponaria officinalis 1 1 1252
2781: 151. SCOMT Mitochondrion Schizophyllum commune 1 1 1120
2782: 152. SLMMT Mitochondrion Physarum polycephalum 1 1 1536
2783: 153. SNICP Chloroplast Solanum nigrum 1 1 1501
2784: 154. SOLCP Chloroplast Spirodela oligorhiza 9 9 6538
2785: 155. SOYCP Chloroplast Glycine max 14 9 16045
2786: 156. SOYMT Mitochondrion Glycine max 5 5 8683
2787: 157. SPFMT Mitochondrion Spodoptera frugiperda 1 1 446
2788: 158. SPICP Chloroplast Spinacia oleracea 47 38 79017
2789: 159. SRGCP Chloroplast Sorghum bicolor 1 1 862
2790: 160. SRGMT Mitochondrion Sorghum sp. 4 2 4768
2791: 161. STFMT Mitochondrion Asterina pectinifera 1 1 3849
2792: 162. SUIMT Mitochondrion Suillus cavipes 2 1 339
2793: 163. SUSMT Mitochondrion Strongylocentrotus drobachiensis
2794: 2 2 965
2795: 164. SUSMT Mitochondrion Strongylocentrotus franciscanus
2796: 3 3 1276
2797: 165. SUSMT Mitochondrion Strongylocentrotus intermedius
2798: 2 2 960
2799: 166. SUSMT Mitochondrion Strongylocentrotus pallidus 2 2 961
2800: 167. SUSMT Mitochondrion Strongylocentrotus purpuratus
2801: 5 4 16929
2802: 168. TARMT Mitochondrion Tarsius syrichta 1 1 895
2803: 169. TETMT Mitochondrion Tetrahymena thermophila 1 1 53
2804: 170. TEYMT Mitochondrion Tetrahymena pyriformis 14 13 12462
2805: 171. TOBCP Chloroplast Nicotiana acuminata 1 1 2052
2806: 172. TOBCP Chloroplast Nicotiana debneyi 3 3 4016
2807: 173. TOBCP Chloroplast Nicotiana otophora 1 1 2052
2808: 174. TOBCP Chloroplast Nicotiana plumbaginifolia 6 4 4169
2809: 175. TOBCP Chloroplast Nicotiana tabacum 47 40 200231
2810: 176. TOBMT Mitochondrion Nicotiana plumbaginifolia 2 1 1740
2811: 177. TOBMT Mitochondrion Nicotiana tabacum 4 3 4074
2812: 178. TOMCP Chloroplast Lycopersicon esculentum 1 1 103
2813: 179. TOMMT Mitochondrion Lycopersicon esculentum 2 1 558
2814: 180. TRBKP Kinetoplast Trypanosoma brucei 27 21 37228
2815: 181. TRBMT Mitochondrion Trypanosoma brucei 4 4 2285
2816: 182. TRCKP Kinetoplast Trypanosoma cruzi 27 27 11864
2817: 183. TREKP Kinetoplast Trypanosoma equiperdum 2 2 2017
2818: 184. TREKP Kinetoplast Trypanosoma evansi 3 2 1998
2819: 185. TRFKP Kinetoplast Crithidia fasciculata 19 18 12549
2820: 186. TRFKP Kinetoplast Crithidia oncopelti 6 3 1231
2821: 187. TRFMT Mitochondrion Crithidia fasciculata 2 1 2034
2822: 188. TRFMT Mitochondrion Crithidia oncopelti 1 1 149
2823: 189. TRLKP Kinetoplast Leptomonas sp. 1 1 2568
2824: 190. TRWKP Kinetoplast Trypanosoma lewisi 2 2 2036
2825: 191. VFACP Chloroplast Vicia faba 6 6 9547
2826: 192. VFAMT Mitochondrion Vicia faba 4 4 9356
2827: 193. WARMT Mitochondrion Pomatostomus isidori 1 1 239
2828: 194. WARMT Mitochondrion Pomatostomus ruficeps 1 1 239
2829: 195. WARMT Mitochondrion Pomatostomus superciliosus 1 1 239
2830: 196. WARMT Mitochondrion Pomatostomus temporalis 1 1 239
2831: 197. WHTCP Chloroplast Triticum aestivum 28 26 26229
2832: 198. WHTMT Mitochondrion Triticum aestivum 33 24 28296
2833: 199. XELMT Mitochondrion Xenopus laevis 6 5 25196
2834: 200. XERMT Mitochondrion Xerocomus chrysenteron 2 1 339
2835: 201. YSCMT Mitochondrion Saccharomyces cerevisiae 191 171 142613
2836: 202. YSGMT Mitochondrion Saccharomyces carlsbergensis
2837: 1 1 149
2838: 203. YSKMT Mitochondrion Kluyveromyces lactis 78 39 3699
2839: 204. YSKMT Mitochondrion Kluyveromyces thermotolerans
2840: 5 3 1287
2841: 205. YSLMT Mitochondrion Torulopsis glabrata 10 9 6200
2842: 206. YSPMT Mitochondrion Schizosaccharomyces pombe 10 9 13361
2843: 207. YSSMT Mitochondrion Cephalosporium acremonium 2 2 3029
2844: 208. YSTMT Mitochondrion Yeast sp. 4 4 5196
2845: 209. YSUMT Mitochondrion Candida utilis 1 1 306
2846: 210. YSVMT Mitochondrion Saccharomyces uvarum 3 3 2296
2847:
2848: Total 1569 1271 1848854
2849:
2850: BACTERIAL
2851:
2852: Key Name Reports Entries Bases
2853: -------------------------------------------------------------------------------
2854: 1. ABC Acetobacter aceti 1 1 1624
2855: 2. ABC Acetobacter xylinum 1 1 9540
2856: 3. AC2 Plasmid pAC27 1 1 803
2857: 4. ACC Acinetobacter calcoaceticus 13 10 27286
2858: 5. ACC Acinetobacter sp. 4 3 5193
2859: 6. ACH Achromobacter sp. 1 1 2414
2860: 7. ACL Acholeplasma laidlawii 3 2 1858
2861: 8. ACN Actinobacillus actinomycetemcomitans 1 1 3842
2862: 9. ACN Actinobacillus pleuropneumoniae 1 1 3831
2863: 10. ACO Acetogenium kivui 1 1 2477
2864: 11. ACY Actinomyces naeslundii 1 1 2160
2865: 12. ACY Actinomyces viscosus 2 1 1850
2866: 13. AER Aeromicrobium erythreus 1 1 1463
2867: 14. AFA Alcaligenes eutrophus 9 9 23911
2868: 15. AFA Alcaligenes faecalis 4 4 8591
2869: 16. AFA Plasmid pJP4 7 5 11105
2870: 17. AHA Aphanocapsa sp. 1 1 1920
2871: 18. AMC Acidaminococcus fermentans 2 1 3245
2872: 19. AMS Ampullariella sp. 1 1 1892
2873: 20. ANA Anabaena 7120 8 6 15235
2874: 21. ANA Anabaena sp. 27 19 26386
2875: 22. ANI Anacystis nidulans 32 26 36901
2876: 23. ANN Actinoplanes missouriensis 2 1 1639
2877: 24. APM Anaplasma marginale 5 4 11487
2878: 25. AQU Agmenellum quadruplicatum 3 3 5497
2879: 26. ARF Archaeoglobus fulgidus 3 2 1727
2880: 27. ARG Arthrobacter sp. 1 1 2075
2881: 28. ATU Agrobacterium rhizogenes 8 6 14645
2882: 29. ATU Agrobacterium sp. 1 1 1599
2883: 30. ATU Agrobacterium tumefaciens 38 31 62767
2884: 31. AVH Azotobacter chroococcum 1 1 1654
2885: 32. AVI Azotobacter vinelandii 24 20 80709
2886: 33. AZS Azospirillum brasilense 1 1 1910
2887: 34. BAD Bacillus caldolyticus 1 1 1147
2888: 35. BAL Bacillus caldotenax 5 5 5550
2889: 36. BAM Bacillus amyloliquefaciens 19 16 17715
2890: 37. BAN Bacillus anthracis 4 4 14401
2891: 38. BBR Bacillus brevis 11 10 25879
2892: 39. BC1 Plasmid pBC16 1 1 257
2893: 40. BCC Bacillus coagulans 2 2 2433
2894: 41. BCE Bacillus cereus 18 14 17016
2895: 42. BCI Bacillus circulans 9 6 12823
2896: 43. BCQ Bacillus Q 5 5 786
2897: 44. BFI Bacillus firmus 1 1 1434
2898: 45. BIF Bifidobacterium longum 1 1 1767
2899: 46. BLI Bacillus licheniformis 21 18 19815
2900: 47. BLL Bacillus lautus 1 1 2323
2901: 48. BMA Bacillus macerans 1 1 2744
2902: 49. BME Bacillus megaterium 24 19 31050
2903: 50. BNG Bacteroides gingivalis 1 1 1420
2904: 51. BNO Bacteroides nodosus 6 5 4828
2905: 52. BNR Bacteroides fragilis 6 5 8556
2906: 53. BOR Borrelia burgdorferei 7 6 3181
2907: 54. BPE Bordetella bronchiseptica 1 1 4936
2908: 55. BPE Bordetella parapertussis 3 3 4749
2909: 56. BPE Bordetella pertussis 21 15 42985
2910: 57. BPO Bacillus polymyxa 4 3 8764
2911: 58. BPU Bacillus pumilus 12 9 8915
2912: 59. BRL Brevibacterium epidermidis 2 1 1721
2913: 60. BRL Brevibacterium lactofermentum 9 8 22049
2914: 61. BRU Brucella abortus 2 2 5253
2915: 62. BS2 Plasmid pBS2 1 1 2279
2916: 63. BSN Bacillus natto 1 1 676
2917: 64. BSP Bacillus sp. 22 21 48210
2918: 65. BSS Bacillus sphaericus 16 12 30172
2919: 66. BST Bacillus stearothermophilus 41 39 62425
2920: 67. BSU Bacillus subtilis 274 216 357850
2921: 68. BTH Bacillus thuringiensis 67 52 149803
2922: 69. BTT Thermoactinomyces thalpophilus 2 2 1036
2923: 70. BUT Butyrivibrio fibrisolvens 2 2 4411
2924: 71. C11 Plasmid pColBM-C1139 1 1 2149
2925: 72. C1B Plasmid Colicin B4 3 3 1561
2926: 73. CAJ Campylobacter coli 3 3 4985
2927: 74. CAJ Campylobacter fetus 1 1 3974
2928: 75. CAJ Campylobacter jejuni 4 4 5410
2929: 76. CB2 Plasmid Colicin B2 1 1 360
2930: 77. CCR Caulobacter crescentus 24 24 15105
2931: 78. CD1 Plasmid Colicin D 1 1 1099
2932: 79. CDC Caldocellum saccharolyticum 6 5 16338
2933: 80. CE1 Plasmid Colicin E1 42 31 17722
2934: 81. CE2 Plasmid Colicin E2 5 5 4629
2935: 82. CE3 Plasmid Colicin E3 1 1 392
2936: 83. CE5 Plasmid Colicin E5-099 2 2 2401
2937: 84. CE8 Plasmid Colicin E8 1 1 1268
2938: 85. CE9 Plasmid Colicin E9 2 2 2148
2939: 86. CEC Plasmid Colicin E3-CA38 8 3 4883
2940: 87. CEC Plasmid Colicin E6-CT14 2 2 4675
2941: 88. CFI Cellulomonas fimi 7 7 5787
2942: 89. CFI Cellulomonas uda 1 1 1828
2943: 90. CFR Citrobacter freundii 6 5 4522
2944: 91. CFX Chloroflexus aurantiacus 4 3 3568
2945: 92. CGF Chlorogloeopsis fritschii 1 1 210
2946: 93. CH1 Plasmid pCHL1 2 2 8101
2947: 94. CHT Chlamydia psittaci 5 5 6932
2948: 95. CHT Chlamydia trachomatis 36 22 61362
2949: 96. CIA Plasmid Colicin Ia 1 1 3727
2950: 97. CIB Plasmid Colicin Ib 5 5 10934
2951: 98. CIB Plasmid Colicin Ib-P9 2 2 2373
2952: 99. CLA Plasmid Colicin A 4 4 3950
2953: 100. CLD Plasmid CloDF13 13 1 9957
2954: 101. CLK Plasmid Colicin K 2 2 815
2955: 102. CLN Plasmid Colicin V 1 1 412
2956: 103. CLN Plasmid Colicin V-K30 1 1 1465
2957: 104. CLN2 Plasmid Colicin V2-K94 1 1 550
2958: 105. CLO Clostridium acetobutylicum 9 7 14479
2959: 106. CLO Clostridium acidiurici 1 1 2266
2960: 107. CLO Clostridium botulinum 1 1 4835
2961: 108. CLO Clostridium cellulolyticum 1 1 2405
2962: 109. CLO Clostridium difficile 4 3 10451
2963: 110. CLO Clostridium innocuum 1 1 1544
2964: 111. CLO Clostridium pasteurianum 25 17 18163
2965: 112. CLO Clostridium perfringens 10 9 7118
2966: 113. CLO Clostridium sordellii 1 1 1504
2967: 114. CLO Clostridium tetani 3 3 10529
2968: 115. CLO Clostridium thermoaceticum 1 1 1965
2969: 116. CLO Clostridium thermocellum 10 8 16656
2970: 117. CLO Clostridium thermohydrosulfuricum 1 1 4839
2971: 118. CLO Clostridium thermosulfurogenes 2 2 4258
2972: 119. CLT Calothrix sp. 12 12 1936
2973: 120. CLV Plasmid ColVBtrp 1 1 441
2974: 121. CN2 Plasmid pCN2 1 1 117
2975: 122. CN3 Plasmid pCN3 1 1 114
2976: 123. COR Corynebacterium diphtheriae 2 1 2529
2977: 124. COR Corynebacterium glutamicum 9 6 16234
2978: 125. COR Corynebacterium nephridii 1 1 615
2979: 126. COR Corynebacterium sp. 2 2 2512
2980: 127. COX Coxiella burnetii 3 3 5956
2981: 128. CPC Cryptococcus albidus 2 1 2984
2982: 129. CPC Cryptococcus neoformans 1 1 2029
2983: 130. CYA Cyanobacterium nostoc 2 2 4220
2984: 131. CYT Cytophaga lytica 1 1 1509
2985: 132. DCG Dictyoglomus thermophilum 3 3 6900
2986: 133. DEI Deinococcus radiodurans 2 2 4970
2987: 134. DMO Desulfurococcus mobilis 7 7 9730
2988: 135. DSB Desulfobacterium autotrophicum 1 1 1376
2989: 136. DSB Desulfobacterium niacini 1 1 1375
2990: 137. DSB Desulfobacterium vacuolatum 1 1 1383
2991: 138. DSF Desulfococcus multivorans 1 1 1372
2992: 139. DSI Desulfomicrobium baculatus 1 1 1379
2993: 140. DSL Desulfomonas pigra 1 1 1381
2994: 141. DSO Desulfotomaculum orientis 1 1 1402
2995: 142. DSO Desulfotomaculum ruminis 1 1 1368
2996: 143. DSP Desulfobacter curvatus 1 1 1396
2997: 144. DSP Desulfobacter hydrogenophilus 1 1 1390
2998: 145. DSP Desulfobacter latus 1 1 1373
2999: 146. DSP Desulfobacter sp. 2 2 2869
3000: 147. DSU Desulfobulbus propionicus 1 1 1371
3001: 148. DSU Desulfobulbus sp. 1 1 1365
3002: 149. DSV Desulfosarcina variabilis 1 1 1527
3003: 150. DVU Desulfovibrio africanus 1 1 1382
3004: 151. DVU Desulfovibrio baarsii 2 2 1589
3005: 152. DVU Desulfovibrio baculatus 2 1 2589
3006: 153. DVU Desulfovibrio desulfuricans 5 5 4678
3007: 154. DVU Desulfovibrio fructosovorans 1 1 3180
3008: 155. DVU Desulfovibrio gigas 2 2 4126
3009: 156. DVU Desulfovibrio multispirans 1 1 186
3010: 157. DVU Desulfovibrio salexigens 2 2 2107
3011: 158. DVU Desulfovibrio sapovorans 1 1 1395
3012: 159. DVU Desulfovibrio vulgaris 9 9 11185
3013: 160. EAM Erwinia amylovora 2 2 1641
3014: 161. ECA Erwinia carotovora 14 12 17877
3015: 162. ECB Erwinia herbicola 1 1 4902
3016: 163. ECH Erwinia chrysanthemi 19 13 27706
3017: 164. ECO Escherichia coli 1716 1188 1848929
3018: 165. ECO Escherichia fergusonii 1 1 3133
3019: 166. ECO F sex factor plasmid 3 3 5370
3020: 167. ECO Plasmid Colicin BM-Cl139 3 3 3707
3021: 168. ECO Plasmid pCU1 1 1 2056
3022: 169. ECO Plasmid pF166 1 1 2133
3023: 170. EHP Ectothiorhodospira halophila 1 1 121
3024: 171. EHR Ehrlichia risticii 2 1 1498
3025: 172. EHV Ectothiorhodospira vacuolata 1 1 120
3026: 173. ENC Enterococcus faecium 1 1 1900
3027: 174. ENR Plasmid ENTR 2 2 1273
3028: 175. ENS Plasmid ENT 1 1 866
3029: 176. ENT Enterobacter aerogenes 6 6 5330
3030: 177. ENT Enterobacter agglomerans 3 3 1272
3031: 178. ENT Enterobacter cloacae 8 8 9262
3032: 179. ETA Edwardsiella tarda 2 1 306
3033: 180. EUB Eubacterium sp. 6 5 10586
3034: 181. FA3 Plasmid pFA3 1 1 1597
3035: 182. FDI Fremyella diplosiphon 26 20 29933
3036: 183. FIB Fibrobacter succinogenes 2 2 3736
3037: 184. FPL Plasmid F 30 23 37662
3038: 185. FRA Frankia sp. 3 2 3758
3039: 186. FRN Francisella tularensis 1 1 1233
3040: 187. FVB Flavobacterium heparinum 1 1 1528
3041: 188. FVB Flavobacterium okeanokoites 8 8 9873
3042: 189. FVB Flavobacterium sp. 4 4 6028
3043: 190. GS5 Plasmid pGS05 1 1 1357
3044: 191. HAF Hafnia alvei 1 1 2961
3045: 192. HAL Halobacterium cutirubrum 7 5 8712
3046: 193. HAL Halobacterium halobium 36 26 45031
3047: 194. HAL Halobacterium salinarium 1 1 606
3048: 195. HAL Halobacterium sp. 14 13 22487
3049: 196. HAL Haloferax volcanii 1 1 3566
3050: 197. HCL Heliobacterium chlorum 1 1 1512
3051: 198. HCU Halobacterium cutirubrum 2 1 3116
3052: 199. HEC Helicobacter felis 3 2 2887
3053: 200. HEC Helicobacter mustelae 2 1 1435
3054: 201. HEH Haemophilus haemolyticus 2 2 3186
3055: 202. HEI Haemophilus influenzae 96 41 17798
3056: 203. HEP Haemophilus parainfluenza 5 5 853
3057: 204. HLF Haloferax sp. 4 3 1187
3058: 205. HMO Halococcus morrhuae 2 2 4402
3059: 206. HPT Herpetosiphon aurantiacus 1 1 1484
3060: 207. HV2 Plasmid pHV2 1 1 6354
3061: 208. IM13 Plasmid pIM13 1 1 2246
3062: 209. INC Plasmid incB 1 1 352
3063: 210. INC Plasmid incI-1 1 1 418
3064: 211. INC Plasmid incI-gamma 1 1 417
3065: 212. INS Insertion sequence 10 10 4266
3066: 213. INS Insertion sequence IS1 5 4 3243
3067: 214. INS Insertion sequence IS150 2 1 1443
3068: 215. INS Insertion sequence IS186 2 2 2677
3069: 216. INS Insertion sequence IS2 4 4 517
3070: 217. INS Insertion sequence IS26 1 1 859
3071: 218. INS Insertion sequence IS30 1 1 1221
3072: 219. INS Insertion sequence IS4 1 1 1426
3073: 220. INS Insertion sequence IS476 1 1 1225
3074: 221. INS Insertion sequence IS493 1 1 1641
3075: 222. INS Insertion sequence IS5 3 2 1570
3076: 223. INS Insertion sequence IS891 1 1 1351
3077: 224. INS Insertion sequence ISHS1 1 1 1449
3078: 225. JD1 Plasmid pJD1 2 1 4207
3079: 226. JS3 Plasmid pJS37 3 3 252
3080: 227. KAE Klebsiella aerogenes 18 16 23367
3081: 228. KCI Kluyvera citrophila 1 1 2734
3082: 229. KPN Klebsiella pneumoniae 71 55 109716
3083: 230. KPN Plasmid pJHC-MW1 1 1 1352
3084: 231. KPO Klebsiella oxytoca 2 2 4901
3085: 232. KY1 Plasmid pKY1 1 1 3022
3086: 233. KYM Plasmid pKYM 1 1 2083
3087: 234. LAC Lactococcus lactis 13 12 24807
3088: 235. LAE Listonella ordalii 2 1 120
3089: 236. LAE Listonella tubiashii 2 1 120
3090: 237. LB1 Plasmid p1 1 1 533
3091: 238. LB3 Lactobacillus 30a 3 2 2189
3092: 239. LBA Lactobacillus acidophilus 1 1 400
3093: 240. LBB Lactobacillus bulgaricus 1 1 536
3094: 241. LBD Lactobacillus delbrueckii 7 4 5405
3095: 242. LBH Lactobacillus helveticus 1 1 3292
3096: 243. LBP Lactobacillus plantarum 3 2 3664
3097: 244. LBP Plasmid pC30il 1 1 2140
3098: 245. LBP Plasmid pLP1 1 1 2093
3099: 246. LCA Lactobacillus casei 6 6 9787
3100: 247. LCO Lactobacillus confusus 1 1 1320
3101: 248. LEP Leptospira biflexa 2 2 4788
3102: 249. LEP Leptospira interrogans 2 1 3244
3103: 250. LIS Listeria monocytogenes 3 2 3940
3104: 251. LM0 Plasmid pLM020 1 1 2330
3105: 252. LPN Legionella pneumophila 2 2 2005
3106: 253. LS1 Plasmid pLS11 1 1 253
3107: 254. MBA Methanobacterium ivanovii 2 1 1353
3108: 255. MBF Methanobacterium formicicum 1 1 3597
3109: 256. MBH Methanobrevibacter smithii 3 3 7221
3110: 257. MBI Methanobacterium thermoautotrophicum 10 7 26621
3111: 258. MBI Plasmid pME2001 1 1 1440
3112: 259. MBO Moraxella bovis 2 2 5044
3113: 260. MBO Moraxella lacunata 1 1 969
3114: 261. MBO Moraxella sp. 2 1 3034
3115: 262. MCL Mastigocladus laminosus 1 1 1701
3116: 263. MEC Micromonospora echinospora 1 1 398
3117: 264. MEF Methanothermus fervidus 9 8 18721
3118: 265. MEH Methanospirillum hungatei 1 1 295
3119: 266. MEN Methanolobus tindarius 1 1 128
3120: 267. MES Methanosarcina barkeri 5 3 13117
3121: 268. MLC Methylococcus capsulatus 1 1 2463
3122: 269. MLU Micrococcus luteus 10 9 19465
3123: 270. MLY Micrococcus lysodeikticus 1 1 166
3124: 271. MPL Mycoplasma-like organism 1 1 1535
3125: 272. MSG Mycobacterium bovis 7 6 7320
3126: 273. MSG Mycobacterium leprae 7 5 11027
3127: 274. MSG Mycobacterium tuberculosis 15 9 18387
3128: 275. MSG Plasmid pAL5000 1 1 4837
3129: 276. MTB Methylobacterium extorquens 1 1 4500
3130: 277. MTB Methylobacterium sp. 1 1 2791
3131: 278. MTB Methylobacterium specialis 2 1 2211
3132: 279. MTF Methylobacillus flagellatum 1 1 1349
3133: 280. MV1 Plasmid pMV158 1 1 2436
3134: 281. MVA Methanococcus vannielii 19 17 28753
3135: 282. MVO Methanococcus voltae 11 10 15241
3136: 283. MVT Methanococcus thermolithotrophicus 4 3 2820
3137: 284. MXA Myxococcus xanthus 16 15 27127
3138: 285. MXB Lysobacter enzymogenes 3 2 3218
3139: 286. MYC Mycoplasma capricolum 14 13 21175
3140: 287. MYC Mycoplasma hyopneumoniae 3 3 1928
3141: 288. MYC Mycoplasma mycoides 4 4 2716
3142: 289. MYC Mycoplasma sp. 41 37 51236
3143: 290. MYC Plasmid pADB201 1 1 1717
3144: 291. NAH Plasmid NAH7 (from P. putida) 6 5 3771
3145: 292. NAT Natronobacterium pharaonis 1 1 1015
3146: 293. NG2 Plasmid pNG2 1 1 1810
3147: 294. NGO Neisseria flavescens 1 1 1228
3148: 295. NGO Neisseria gonorrhoeae 63 55 50544
3149: 296. NGO Neisseria meningitidis 12 8 9940
3150: 297. NOC Nocardia mediterranei 3 3 450
3151: 298. NOS Nostoc commune 1 1 4241
3152: 299. NR1 Plasmid NR1 4 3 6463
3153: 300. NT1 Plasmid NTP1 2 2 1440
3154: 301. NT1 Plasmid NTP16 1 1 2730
3155: 302. P15 Plasmid P15A 2 2 1226
3156: 303. P18X Plasmid pACYC184 2 2 171
3157: 304. P23 Plasmid pMM2-3 2 2 182
3158: 305. P307 Plasmid P307 3 3 4629
3159: 306. P53 Plasmid pMM5-3 4 4 429
3160: 307. P55 Plasmid pMM5-5 4 4 420
3161: 308. PAC Plasmid P177 1 1 345
3162: 309. PAM Plasmid PAM177 1 1 1443
3163: 310. PAS Pasteurella haemolytica 4 3 15958
3164: 311. PAZ Plasmid pAZ1 1 1 808
3165: 312. PB0 Plasmid pUB110 9 8 12606
3166: 313. PB2 Plasmid pUB112 1 1 901
3167: 314. PBF4 Plasmid pBF4 1 1 1041
3168: 315. PBW Plasmid pBWH77 2 2 1623
3169: 316. PC1 Plasmid pC194 2 2 3946
3170: 317. PC2 Plasmid pC221 2 1 4555
3171: 318. PDE Paracoccus denitrificans 10 7 17422
3172: 319. PDG Plasmid pDGO100 2 2 3683
3173: 320. PDU Plasmid pDU1358 2 2 5076
3174: 321. PE1 Plasmid pE194 7 3 5039
3175: 322. PE2 Plasmid pED208 2 2 5640
3176: 323. PHL Plasmid pHly152 1 1 8215
3177: 324. PI25 Plasmid pI258 5 4 12140
3178: 325. PIJ Plasmid pIJ101 2 2 9188
3179: 326. PIP Plasmid pIP401 2 2 383
3180: 327. PIP Plasmid pIP630 1 1 1883
3181: 328. PIP11 Plasmid pIP1100 1 1 1386
3182: 329. PIP404 Plasmid pIP404 4 3 15188
3183: 330. PJH Plasmid pJH1 1 1 1489
3184: 331. PJM1 Plasmid pJM1 1 1 3581
3185: 332. PJR Plasmid PJR225 1 1 1527
3186: 333. PKL Plasmid pKLH1 1 1 160
3187: 334. PKL Plasmid pKLH102 2 2 351
3188: 335. PKL Plasmid pKLH104 1 1 131
3189: 336. PKL Plasmid pKLH2 2 2 674
3190: 337. PKL Plasmid pKLH201 1 1 153
3191: 338. PKM Plasmid pKM101 1 1 1797
3192: 339. PLB Plasmid pLB1 1 1 2190
3193: 340. PLM Plasmid pAA3.7X 3 1 9583
3194: 341. PLP Plasmid pSa 1 1 1447
3195: 342. PME Plasmid pMEA100 1 1 150
3196: 343. PMM Plasmid pMM110 1 1 240
3197: 344. PMO Plasmid pMON234 1 1 997
3198: 345. PNE Plasmid pNE131 2 1 2355
3199: 346. PNS Plasmid pNS1 1 1 3879
3200: 347. PNS Plasmid pNS1981 4 3 1819
3201: 348. PO2 Plasmid pOAD2 2 2 2914
3202: 349. PR1 Plasmid R1 13 10 7500
3203: 350. PR2 Plasmid R1126 1 1 428
3204: 351. PR6 Plasmid R6-5 2 1 858
3205: 352. PRC Plasmid R 1 1 1487
3206: 353. PRI Plasmid PRI13 2 1 2234
3207: 354. PRM Morganella morganii 2 2 1831
3208: 355. PRM Proteus mirabilis 7 6 16319
3209: 356. PRM Proteus vulgaris 13 8 14186
3210: 357. PRO Providencia sp. 1 1 1135
3211: 358. PRO Providencia stuartii 1 1 3889
3212: 359. PRS Propionibacterium shermanii 3 2 4951
3213: 360. PS1 Streptomyces lividans plasmid pS1 1 1 75
3214: 361. PSA Plasmid pSA2100 1 1 98
3215: 362. PSC Plasmid pSC101 16 10 16807
3216: 363. PSE Plasmid pCMS1 1 1 1322
3217: 364. PSE Pseudomonas aeruginosa 99 77 110225
3218: 365. PSE Pseudomonas amyloderamosa 5 2 4488
3219: 366. PSE Pseudomonas cepacia 2 2 5867
3220: 367. PSE Pseudomonas fluorescens 10 8 17694
3221: 368. PSE Pseudomonas fragi 2 2 1682
3222: 369. PSE Pseudomonas paucimobilis 1 1 1080
3223: 370. PSE Pseudomonas pseudoalcaligenes 1 1 2040
3224: 371. PSE Pseudomonas putida 28 26 68302
3225: 372. PSE Pseudomonas sp. 25 22 39293
3226: 373. PSE Pseudomonas syringae 11 10 27701
3227: 374. PSE Pseudomonas testosteroni 2 2 2435
3228: 375. PSE TOL Plasmid (from Pseudomonas putida) 11 7 9788
3229: 376. PSE Zoogloea ramigera 1 1 1524
3230: 377. PSM SYM megaplasmid(from R. meliloti) 9 8 5150
3231: 378. PSN Plasmid pSN2 1 1 1288
3232: 379. PT1 Plasmid pT181 7 4 5479
3233: 380. PTB Plasmid pTB913 1 1 1200
3234: 381. PWM Plasmid pWM5 1 1 569
3235: 382. PWP Plasmid pWP7b 1 1 1370
3236: 383. PWR Plasmid PWR60 1 1 4832
3237: 384. PYR Pyrodictium occultum 4 4 2077
3238: 385. PYW Pyrococcus woesi 2 1 124
3239: 386. R10 Plasmid R100 25 17 26157
3240: 387. R11 Plasmid R1162 5 5 2389
3241: 388. R12 Plasmid R124 1 1 272
3242: 389. R14 Plasmid R144 1 1 801
3243: 390. R26 Plasmid R26 1 1 1541
3244: 391. R27 Plasmid R27 1 1 1507
3245: 392. R36 Plasmid R386 1 1 441
3246: 393. R37 Plasmid R387 1 1 1160
3247: 394. R38 Plasmid R388 2 2 3204
3248: 395. R41 Plasmid R401 1 1 1857
3249: 396. R45 Plasmid R485 1 1 591
3250: 397. R46 Plasmid R46 3 3 2859
3251: 398. R48 Plasmid R483 1 1 1618
3252: 399. R53 Plasmid R538 3 2 1712
3253: 400. R65 Plasmid R65 2 2 1380
3254: 401. R67 Plasmid R67 1 1 293
3255: 402. R6K Plasmid R6K 7 6 1894
3256: 403. R75 Plasmid R751 4 4 1697
3257: 404. R77 Plasmid R773 1 1 4347
3258: 405. RA1 Plasmid RA1 1 1 758
3259: 406. RBH Plasmid pRBH1 2 2 1521
3260: 407. RBL Rhodopseudomonas acidophila 1 1 1491
3261: 408. RBL Rhodopseudomonas blastica 1 1 12368
3262: 409. RCA Rhodobacter capsulatus 32 26 52831
3263: 410. RDC Rhodocyclus purpureus 1 1 1478
3264: 411. REI Plasmid pRE-I 1 1 439
3265: 412. RER Rhodococcus erythropolis 2 1 2070
3266: 413. RER Rhodococcus fascians 2 1 121
3267: 414. RGN Plasmid RGN238 2 1 2427
3268: 415. RHA Azorhizobium caulinodans 4 2 3849
3269: 416. RHB Bradyrhizobium japonicum 23 18 34554
3270: 417. RHF Rhizobium fredii 1 1 2862
3271: 418. RHH Rhizobium phaseoli 4 4 3681
3272: 419. RHI Bradyrhizobium sp. 2 2 6665
3273: 420. RHI Rhizobium sp. 8 7 10894
3274: 421. RHJ Rhizobium japonicum 10 8 10225
3275: 422. RHL Plasmid pRL1JI 5 1 12055
3276: 423. RHL Rhizobium leguminosarum 13 13 19168
3277: 424. RHM Rhizobium meliloti 52 45 93945
3278: 425. RHR Rhizobium IRc78 2 2 2199
3279: 426. RHT Rhizobium trifolii 5 5 6886
3280: 427. RIA Plasmid Ri 1 1 21126
3281: 428. RIR Rickettsia conorii 1 1 539
3282: 429. RIR Rickettsia prowazekii 5 4 9706
3283: 430. RIR Rickettsia rickettsii 4 4 8555
3284: 431. RIR Rickettsia tsutsugamushi 2 2 5186
3285: 432. RIR Rickettsia typhi 2 2 1067
3286: 433. RIR Rochalimaea quintana 1 1 1493
3287: 434. RK2 Plasmid RK2 10 7 9952
3288: 435. RMV Rhodomicrobium vannielii 1 1 1484
3289: 436. ROS Roseburia cecicola 1 1 1031
3290: 437. RP1 Plasmid RP1 1 1 2709
3291: 438. RP4 Plasmid RP4 5 5 2997
3292: 439. RSF Plasmid RSF1010 7 7 10521
3293: 440. RSP Rhodospirillum rubrum 11 11 21804
3294: 441. RSS Rhodobacter sphaeroides 17 17 14217
3295: 442. RTS Plasmid Rts1 2 1 1855
3296: 443. RUA Ruminobacter amylophilus 2 1 2867
3297: 444. RUM Ruminococcus albus 1 1 2180
3298: 445. RVI Rhodopseudomonas viridis 5 4 3885
3299: 446. SA2 Plasmid pSAM2 3 3 866
3300: 447. SAC Sulfolobus acidocaldarius 6 5 15339
3301: 448. SAU Stigmatella aurantiaca 2 1 1300
3302: 449. SB2 Plasmid pSB24.2 2 2 7412
3303: 450. SCP Plasmid SCP1 1 1 2513
3304: 451. SE2 Plasmid pSE211 2 2 3017
3305: 452. SER Saccharopolyspora erythraea 10 7 8075
3306: 453. SHD Shigella dysenteriae 7 7 11010
3307: 454. SHF Plasmid pMYSH6000 1 1 4472
3308: 455. SHF Shigella flexneri 12 10 24401
3309: 456. SHS Shigella sonnei 11 11 6738
3310: 457. SLP1 Plasmid SLP1 3 3 630
3311: 458. SMA Serratia marcescens 31 25 35311
3312: 459. SMA Serratia sp. 1 1 2570
3313: 460. SME Spiroplasma melliferum 1 1 1510
3314: 461. SME Spiroplasma sp. 1 1 5025
3315: 462. SMY Plasmid pSL2 1 1 345
3316: 463. SMY1 Plasmid pSL1 2 2 633
3317: 464. SPA Spirochaeta aurantia 1 1 1257
3318: 465. SPO Sporolactobacillus laevis 1 1 118
3319: 466. SPO Sporosarcina ureae 2 1 116
3320: 467. SPU Spirulina platensis 1 1 5273
3321: 468. SSO Sulfolobus shibatae 1 1 1495
3322: 469. SSO Sulfolobus solfataricus 6 6 5873
3323: 470. SSP Sulfolobus sp. 7 4 19617
3324: 471. STA Plasmid pT48 1 1 2475
3325: 472. STA Staphylococcus aureus 75 56 85940
3326: 473. STA Staphylococcus carnosus 1 1 720
3327: 474. STA Staphylococcus epidermidis 1 1 423
3328: 475. STA Staphylococcus haemolyticus 1 1 1087
3329: 476. STA Staphylococcus hyicus 1 1 2212
3330: 477. STA Staphylococcus mutans 1 1 2288
3331: 478. STA Staphylococcus simulans 1 1 1486
3332: 479. STA Staphylococcus staphylolyticus 1 1 1825
3333: 480. STM Streptomyces antibioticus 1 1 1567
3334: 481. STM Streptomyces avidinii 1 1 638
3335: 482. STM Streptomyces azureus 1 1 1521
3336: 483. STM Streptomyces clavuligerus 6 4 5745
3337: 484. STM Streptomyces coelicolor 15 12 17749
3338: 485. STM Streptomyces fradiae 6 6 11455
3339: 486. STM Streptomyces glaucescens 6 4 6431
3340: 487. STM Streptomyces griseus 16 12 25048
3341: 488. STM Streptomyces hygroscopicus 7 7 6374
3342: 489. STM Streptomyces lavendulae 2 2 2078
3343: 490. STM Streptomyces limosus 1 1 2291
3344: 491. STM Streptomyces lividans 25 20 14900
3345: 492. STM Streptomyces plicatus 2 2 1245
3346: 493. STM Streptomyces rochei 4 4 2390
3347: 494. STM Streptomyces sp. 47 39 49732
3348: 495. STM Streptomyces thermotolerans 1 1 1260
3349: 496. STM Streptomyces vinaceus 1 1 1119
3350: 497. STR Plasmid pAM-beta-1 3 3 6001
3351: 498. STR Plasmid pMK157 1 1 1920
3352: 499. STR Streptococcus equisimilis 1 1 2568
3353: 500. STR Streptococcus faecalis 6 6 8343
3354: 501. STR Streptococcus lactis 8 7 8222
3355: 502. STR Streptococcus mutans 15 13 46945
3356: 503. STR Streptococcus pneumoniae 49 34 55746
3357: 504. STR Streptococcus pyogenes 24 18 36619
3358: 505. STR Streptococcus sanguis 3 3 8094
3359: 506. STR Streptococcus sobrinus 1 1 4995
3360: 507. STR Streptococcus sp. 17 14 31406
3361: 508. STV Streptoverticillum sp. 1 1 1130
3362: 509. STY Plasmid R1767 1 1 1519
3363: 510. STY Plasmid R64 1 1 482
3364: 511. STY Salmonella infantis 1 1 3430
3365: 512. STY Salmonella potsdam 1 1 1727
3366: 513. STY Salmonella rubislaw 1 1 1479
3367: 514. STY Salmonella sp. 7 6 6050
3368: 515. STY Salmonella typhimurium 174 142 204225
3369: 516. SYC Synechocystis sp. 15 11 18768
3370: 517. SYN Synechococcus sp. 15 15 38283
3371: 518. TBA Thermophilic bacterium 6 3 11617
3372: 519. TDT Trichodesmium thiebautii 1 1 357
3373: 520. TFE Plasmid pTF-FC2 1 1 329
3374: 521. TFE Thiobacillus acidophilus 9 4 599
3375: 522. TFE Thiobacillus ferrooxidans 9 7 14624
3376: 523. TFE Thiobacillus sp. 1 1 2172
3377: 524. THA Thermoplasma acidophilum 3 3 7703
3378: 525. THC Thermococcus celer 2 2 1312
3379: 526. THF Thermomonospora fusca 1 1 264
3380: 527. THP Thermofilum pendens 1 1 240
3381: 528. THR Thermomicrobium roseum 1 1 1528
3382: 529. TIP Plasmid pTiAch5 1 1 1164
3383: 530. TIP Plasmid pTiAg162 1 1 420
3384: 531. TIP Plasmid pTiB6S3 1 1 4203
3385: 532. TIP Plasmid pTiC58 4 4 5214
3386: 533. TIP Plasmid Ti (from A. tumefaciens) 61 52 120107
3387: 534. TMO Thermotoga maritima 2 2 2763
3388: 535. TRN Transposon gamma-delta 6 3 1092
3389: 536. TRN Transposon Tn10 1 1 830
3390: 537. TRN Transposon Tn21 1 1 1333
3391: 538. TRN Transposon Tn2501 1 1 480
3392: 539. TRN Transposon Tn3 2 2 389
3393: 540. TRN Transposon Tn3411 2 1 2925
3394: 541. TRN Transposon Tn4521 1 1 1315
3395: 542. TRN Transposon Tn5 1 1 2040
3396: 543. TRN Transposon Tn501 1 1 86
3397: 544. TRN Transposon Tn602 4 4 639
3398: 545. TRN10 Transposon Tn10 11 4 6024
3399: 546. TRN15 Transposon Tn1525 1 1 1721
3400: 547. TRN16 Transposon Tn1681 1 1 658
3401: 548. TRN17 Transposon Tn1721 8 8 1797
3402: 549. TRN1771 Transposon Tn1771 3 3 348
3403: 550. TRN21 Transposon Tn21 4 4 6671
3404: 551. TRN25 Transposon Tn2501 1 1 1539
3405: 552. TRN26 Transposon Tn2680 1 1 194
3406: 553. TRN3 Transposon Tn3 10 8 6351
3407: 554. TRN34 Transposon Tn3411 1 1 1321
3408: 555. TRN43 Transposon Tn4351 2 1 1982
3409: 556. TRN431 Transposon Tn431 3 3 2405
3410: 557. TRN4551 Transposon Tn4551 1 1 2080
3411: 558. TRN4556 Transposon Tn4556 2 2 86
3412: 559. TRN5 Transposon Tn5 9 6 4978
3413: 560. TRN501 Transposon Tn501 4 4 7310
3414: 561. TRN554 Transposon Tn554 5 1 6691
3415: 562. TRN7 Transposon Tn7 7 7 7535
3416: 563. TRN9 Transposon Tn9 2 2 1362
3417: 564. TRN903 Transposon Tn903 6 6 6118
3418: 565. TRN916 Transposon Tn916 1 1 1740
3419: 566. TRN917 Transposon Tn917 3 3 6353
3420: 567. TRNCAM Transposon Tn-Cam204 1 1 921
3421: 568. TRP Treponema pallidum 7 6 8045
3422: 569. TTE Thermoproteus tenax 8 8 4399
3423: 570. TTH Thermus aquaticus 6 5 7004
3424: 571. TTH Thermus caldophilus 1 1 1229
3425: 572. TTH Thermus flavus 1 1 1771
3426: 573. TTH Thermus thermophilus 23 15 28221
3427: 574. TTV Thermoproteus tenax virus 1 2 1 13669
3428: 575. URE Ureaplasma urealyticum 2 2 3982
3429: 576. VCH Vibrio cholerae 17 15 18430
3430: 577. VI1 Plasmid pVI150 1 1 972
3431: 578. VIB Aeromonas hydrophila 7 6 7237
3432: 579. VIB Aeromonas sobria 2 1 2510
3433: 580. VIB Photobacterium leiognathi 5 4 10047
3434: 581. VIB Photobacterium sp. 5 5 8715
3435: 582. VIB Vibrio alginolyticus 5 4 12901
3436: 583. VIB Vibrio anguillarum 1 1 4379
3437: 584. VIB Vibrio fischeri 4 4 5791
3438: 585. VIB Vibrio harveyi 12 11 14283
3439: 586. VIB Vibrio parahaemolyticus 2 2 2861
3440: 587. VIB Vibrio sp. 3 2 1390
3441: 588. VIT Vitreoscilla sp. 2 1 689
3442: 589. VIT Vitreoscilla stercoraria 1 1 745
3443: 590. VVU Vibrio vulnificus 1 1 2237
3444: 591. W10 Plasmid pWR100 2 1 4761
3445: 592. WOL Wolinella succinogenes 1 1 91
3446: 593. WP1 Plasmid pWP113a 1 1 1316
3447: 594. WP1 Plasmid pWP116a 1 1 1336
3448: 595. WP1 Plasmid pWP14a 1 1 1336
3449: 596. XAA Xanthobacter autotrophicus 1 1 3041
3450: 597. XAN Xanthomonas campestris 3 3 4683
3451: 598. XEN Xenorhabdus luminescens 1 1 2553
3452: 599. YEP Plasmid pYV03 2 1 3316
3453: 600. YEP Yersinia bercovieri 2 2 257
3454: 601. YEP Yersinia enterocolitica 26 24 18913
3455: 602. YEP Yersinia pestis 4 4 4462
3456: 603. YEP Yersinia pseudotuberculosis 10 8 15439
3457: 604. ZMO Zymomonas mobilis 9 8 13565
3458:
3459: Total 5528 4293 6992664
3460:
3461: STRUCTURAL RNA
3462:
3463: Key Name Reports Entries Bases
3464: -------------------------------------------------------------------------------
3465: 1. AAU Auricularia auricula-judae 1 1 118
3466: 2. ABC Acetobacter sp. 2 2 236
3467: 3. ACA Acanthamoeba castellanii 3 3 400
3468: 4. ACC Acinetobacter calcoaceticus 2 2 1652
3469: 5. ACH Achromobacter cycloclastes 1 1 120
3470: 6. ACH Achromobacter xylosoxidans 1 1 114
3471: 7. ACL Acholeplasma entomophilum 1 1 1476
3472: 8. ACL Acholeplasma modicum 1 1 1473
3473: 9. ACN Actinobacillus actinomycetemcomitans 3 3 494
3474: 10. ACN Actinobacillus equuli 2 2 445
3475: 11. ACN Actinobacillus hominis 3 3 494
3476: 12. ACN Actinobacillus lignieresii 3 3 1931
3477: 13. ACS Avian sarcoma virus 1 1 75
3478: 14. ACY Actinomyces bovis 1 1 1368
3479: 15. ACY Actinomyces israelii 2 2 1879
3480: 16. ACY Actinomyces naeslundii 1 1 1378
3481: 17. ACY Actinomyces odontolyticus 1 1 1359
3482: 18. ACY Actinomyces pyogenes 2 1 1361
3483: 19. ACY Actinomyces viscosus 1 1 1351
3484: 20. AED Agaricus edulis 1 1 118
3485: 21. AEQ Actinia equina 2 1 120
3486: 22. AFA Alcaligenes eutrophus 1 1 1511
3487: 23. AFA Alcaligenes faecalis 6 6 3410
3488: 24. AKK Akkesiphycus lubricum 2 1 118
3489: 25. ALF Medicago sativa 1 1 119
3490: 26. ALL Asteroleplasma anaerobium 1 1 1471
3491: 27. ALR Rous sarcoma virus 1 1 75
3492: 28. AMG Acyrthosiphon magnoliae 2 2 281
3493: 29. AMO Amoebidium parasiticum 1 1 119
3494: 30. AMP Amoeba proteus 1 1 419
3495: 31. ANC Ancylobacter aquaticus 1 1 117
3496: 32. ANI Anacystis nidulans 4 4 371
3497: 33. ANM Anisodoris nobilis 6 3 994
3498: 34. ANP Anaeroplasma abactoclasticum 1 1 1453
3499: 35. ANP Anaeroplasma bactoclasticum 1 1 1436
3500: 36. ANP Anaeroplasma varium 1 1 1436
3501: 37. APE Acremonium persicinum 1 1 119
3502: 38. APL Aplysia kurodai 1 1 119
3503: 39. APN Anthoceros punctatus 1 1 118
3504: 40. APR Antheraea pernyi 1 1 120
3505: 41. APU Aeromonas punctata 1 1 109
3506: 42. AQU Agmenellum quadruplicatum 1 1 76
3507: 43. ARB Arbacia punctulata 9 3 1049
3508: 44. ARG Arthrobacter globiformis 4 3 1774
3509: 45. ARG Arthrobacter luteus 1 1 122
3510: 46. ARG Arthrobacter oxidans 2 1 121
3511: 47. ARG Arthrobacter sp. 2 1 121
3512: 48. ARN Argulus nobilis 1 1 1843
3513: 49. ARO Arhodomonas oleiferhydrans 1 1 1487
3514: 50. ARU Arundinaria gigantea 1 1 50
3515: 51. ASC Acinetospora crinita 2 1 118
3516: 52. ASE Aquaspirillum serpens 1 1 116
3517: 53. ASF Aspergillus flavus 1 1 119
3518: 54. ASG Aspergillus niger 1 1 119
3519: 55. ASN Aspergillus nidulans 4 4 476
3520: 56. AST Avena sativa 1 1 50
3521: 57. ATT Atractiella solani 1 1 119
3522: 58. ATU Agrobacterium tumefaciens 1 1 120
3523: 59. AUT Aureobacterium testaceum 1 1 120
3524: 60. AVI Azotobacter vinelandii 1 1 120
3525: 61. AXY Amphibacillus xylanus 2 1 116
3526: 62. BAC Bacillus acidocaldarius 1 1 117
3527: 63. BAE Batrachospermum ectocarpum 1 1 121
3528: 64. BAS Basidiobolus magnus 1 1 120
3529: 65. BBR Bacillus brevis 2 2 1674
3530: 66. BDE Bdellovibrio stolpii 1 1 1553
3531: 67. BEG Beggiatoa alba 1 1 120
3532: 68. BFI Bacillus firmus 1 1 116
3533: 69. BGA Blue Green Algae 1 1 76
3534: 70. BGL Bacillus globigii 1 1 116
3535: 71. BHA Beneckea harveyi 1 1 122
3536: 72. BJA Blepharisma japonicum 2 2 476
3537: 73. BLI Bacillus licheniformis 1 1 116
3538: 74. BLK Blakeslea trispora 1 1 120
3539: 75. BLT Blastobacter viscosus 1 1 118
3540: 76. BLY Hordeum vulgare 9 7 487
3541: 77. BME Bacillus megaterium 1 1 116
3542: 78. BMO Bombyx mori 14 13 1362
3543: 79. BNA Brassica napus 2 2 196
3544: 80. BNC Bacteroides asaccharolyticus 1 1 48
3545: 81. BNG Bacteroides gingivalis 1 1 53
3546: 82. BNI Bacteroides intermedius 1 1 52
3547: 83. BNO Bacteroides nodosus 1 1 1532
3548: 84. BOV Bos taurus 21 18 1426
3549: 85. BPA Bacillus pasteurii 1 1 117
3550: 86. BPL Brachionus plicatilis 1 1 121
3551: 87. BRA Branchiostoma belcheri 1 1 120
3552: 88. BRA Branchiostoma californiense 6 3 974
3553: 89. BRL Brevibacterium helvolum 2 1 120
3554: 90. BRL Brevibacterium linens 1 1 123
3555: 91. BRP Brugia pahangi 1 1 363
3556: 92. BRU Brucella abortus 2 1 1429
3557: 93. BSI Blastocladiella simplex 1 1 118
3558: 94. BST Bacillus stearothermophilus 10 10 845
3559: 95. BSU Bacillus subtilis 16 14 1153
3560: 96. BVO Bresslaua vorax 1 1 120
3561: 97. BVU Beta vulgaris 1 1 120
3562: 98. CAI Capniomyces stellatus 1 1 121
3563: 99. CAO Carpopeltis crispata 1 1 121
3564: 100. CAU Caulobacter spinosum 1 1 117
3565: 101. CBC Caseobacter polymorphus 1 1 121
3566: 102. CCI Coprinus cinereus 1 1 118
3567: 103. CCO Crypthecodinium cohnii 4 4 492
3568: 104. CDB Cardiobacterium hominis 1 1 1470
3569: 105. CEL Caenorhabditis elegans 9 7 713
3570: 106. CET Ceratobasidium cornigerum 1 1 118
3571: 107. CFI Cellulomonas biazotea 1 1 120
3572: 108. CHA Chaetopterus sp. 6 3 975
3573: 109. CHB Chlorobium limicola 2 2 1615
3574: 110. CHB Chlorobium phaeobacteroides 1 1 110
3575: 111. CHF Chordaria flagelliformis 2 1 118
3576: 112. CHH Chaetomorpha moniligera 1 1 120
3577: 113. CHK Gallus gallus 25 23 2774
3578: 114. CHL Chlorella pyrenoidosa 2 1 119
3579: 115. CHL Chlorella sp. 5 3 2082
3580: 116. CHO Chilomonas paramecium 1 1 124
3581: 117. CHR Chromobacterium fluviatile 1 1 1473
3582: 118. CHR Chromobacterium violaceum 1 1 1475
3583: 119. CHS Christiansenia pallida 1 1 120
3584: 120. CLL Callinectes sapidus 1 1 1861
3585: 121. CLM Spisula solidissima 6 3 937
3586: 122. CLO Clostridium aminovalericum 2 1 1554
3587: 123. CLO Clostridium barkeri 2 1 1527
3588: 124. CLO Clostridium bifermentans 1 1 117
3589: 125. CLO Clostridium butyricum 2 2 234
3590: 126. CLO Clostridium carnis 2 1 117
3591: 127. CLO Clostridium pasteurianum 4 3 1745
3592: 128. CLO Clostridium ramosum 1 1 1530
3593: 129. CLO Clostridium sticklandii 2 2 1501
3594: 130. CLO Clostridium tyrobutyricum 4 4 464
3595: 131. COE Coemansia mojavensis 1 1 120
3596: 132. COR Corynebacterium aquaticum 1 1 120
3597: 133. COR Corynebacterium glutamicum 1 1 121
3598: 134. COR Corynebacterium sp. 2 1 1366
3599: 135. COR Corynebacterium xerosis 2 2 243
3600: 136. COT Gossypium hirsutum 1 1 118
3601: 137. COX Coxiella burnetii 2 1 1484
3602: 138. CPA Cyanophora paradoxa 3 3 356
3603: 139. CRA Coprinus radiatus 1 1 118
3604: 140. CRB Limulus polyphemus 6 3 977
3605: 141. CRE Chlamydomonas reinhardtii 3 3 399
3606: 142. CRE Chlamydomonas sp. 1 1 118
3607: 143. CRS Cryptochiton stelleri 6 3 923
3608: 144. CTU Coleosporium tussilaginis 1 1 118
3609: 145. CUN Cunninghamella elegans 1 1 120
3610: 146. CUR Curtobacterium citreum 1 1 122
3611: 147. CVN Chromatium vinosum 1 1 1526
3612: 148. CYR Cycas revoluta 1 1 120
3613: 149. CYT Cytophaga aquatilis 1 1 111
3614: 150. CYT Cytophaga heparina 1 1 114
3615: 151. CYT Cytophaga johnsonae 1 1 116
3616: 152. DAC Dryopteris acuminata 1 1 121
3617: 153. DDE Dacrymyces deliquescens 1 1 118
3618: 154. DDI Dictyostelium discoideum 7 6 1170
3619: 155. DIT Diatoma tenue 1 1 118
3620: 156. DJA Dugesia japonica 1 1 120
3621: 157. DJA Dugesia tigrina 6 3 962
3622: 158. DOG Canis lupus 1 1 149
3623: 159. DOG Canis sp. 2 2 191
3624: 160. DPS Dipsacomyces acuminosporus 1 1 119
3625: 161. DRO Drosophila melanogaster 42 36 5318
3626: 162. DSA Desulfuromonas acetoxidans 1 1 1522
3627: 163. DSM Desulfomonile tiedjei 1 1 1505
3628: 164. DSP Desulfobacter postgatei 1 1 1519
3629: 165. DSV Desulfosarcina variabilis 1 1 1527
3630: 166. DUK Cairina moschata 1 1 78
3631: 167. DVU Desulfovibrio vulgaris 1 1 120
3632: 168. EAL Enchytraeus albidus 1 1 120
3633: 169. EAR Equisetum arvense 2 1 120
3634: 170. EBI Eisenia bicyclis 1 1 118
3635: 171. ECO Escherichia coli 148 116 14668
3636: 172. EFI Efibulobasidium albescens 1 1 118
3637: 173. EGR Euglena gracilis 4 4 391
3638: 174. EHP Ectothiorhodospira halophila 1 1 1494
3639: 175. EIK Eikenella corrodens 4 4 5933
3640: 176. EJA Entosphenus japonicus 2 2 241
3641: 177. EMP Emplectonema gracile 2 2 239
3642: 178. ERL Erythrobacter longus 1 1 119
3643: 179. ERP Protomonas extorquens 1 1 116
3644: 180. ERY Erysipelothrix rhusiopathiae 1 1 1487
3645: 181. ESE Endophyllum sempervivi 1 1 118
3646: 182. ESP Euphausia sperba 1 1 75
3647: 183. EUT Eucidaris tribuloides 6 3 923
3648: 184. EVA Exobasidium vaccinii 1 1 118
3649: 185. EWO Euplotes woodruffi 1 1 120
3650: 186. EXI Exidia glandulosa 1 1 118
3651: 187. FAE Faenia rectivirgula 1 1 1246
3652: 188. FBC Flexibacter sp. 1 1 117
3653: 189. FSB Misgurnus fossilis 3 3 399
3654: 190. FSB Oncorhynchus keta 1 1 75
3655: 191. FSB Salmo gairdneri 2 2 282
3656: 192. FSO Fusarium culmorum 2 2 387
3657: 193. FSO Fusarium decemcellulare 6 6 1161
3658: 194. FSO Fusarium graminearum 2 2 387
3659: 195. FSO Fusarium javanicum 4 4 768
3660: 196. FSO Fusarium moniliforme 6 6 1159
3661: 197. FSO Fusarium nivale 2 2 384
3662: 198. FSO Fusarium oxysporum 3 3 1010
3663: 199. FSO Fusarium solani 4 4 767
3664: 200. FVB Flavobacterium sp. 1 1 121
3665: 201. GBI Ginkgo biloba 1 1 120
3666: 202. GCL Gymnosporangium clavariaeforme 1 1 118
3667: 203. GCO Gracilaria compressa 3 2 242
3668: 204. GEA Gelidium amansii 2 2 241
3669: 205. GEM Gemmata obscuriglobus 1 1 108
3670: 206. GEN Genistelloides hibernus 1 1 122
3671: 207. GLA Giardia lamblia 1 1 127
3672: 208. GLC Gloiopeltis complanata 1 1 120
3673: 209. GOL Golfingia gouldii 6 3 973
3674: 210. GRA Graphiola phoenicis 1 1 118
3675: 211. HAL Halobacterium volcanii 56 46 5074
3676: 212. HAM Mesocricetus sp. 2 1 94
3677: 213. HAP Halichondria panicea 1 1 120
3678: 214. HAZ Haemophilus aphrophilus 3 3 494
3679: 215. HCU Halobacterium cutirubrum 15 13 1050
3680: 216. HDI Hymenolepis diminuta 2 2 215
3681: 217. HEA Haemophilus aegypticus 1 1 116
3682: 218. HEI Haemophilus influenzae 3 3 1917
3683: 219. HJA Halichondria japonica 1 1 120
3684: 220. HLF Haloferax mediterranei 2 1 123
3685: 221. HMO Halococcus morrhuae 2 2 309
3686: 222. HOC Haliclona oculata 1 1 120
3687: 223. HPT Herpetosiphon aurantiacus 1 1 117
3688: 224. HRO Halocynthia roretzi 2 1 119
3689: 225. HSA Hymeniacidon sanguinea 2 2 276
3690: 226. HUM Homo sapiens 51 45 6394
3691: 227. HYD Hydra sp. 6 3 966
3692: 228. HYF Hydrurus foetidus 1 1 118
3693: 229. HYV Hyphomicrobium sp. 1 1 119
3694: 230. HYV Hyphomicrobium vulgare 1 1 119
3695: 231. IGU Iguana iguana 1 1 120
3696: 232. ISO Isosphaera pallida 1 1 111
3697: 233. JLA Aurelia aurita 2 2 240
3698: 234. JLC Chrysaora quinquecirrha 1 1 120
3699: 235. JLN Nemopsis dofleini 1 1 120
3700: 236. JLS Spirocodon saltatrix 1 1 121
3701: 237. KAB Kabatiella microsticta 1 1 120
3702: 238. KIN Kingella denitrificans 1 1 1475
3703: 239. KIN Kingella indologenes 1 1 1474
3704: 240. KIN Kingella kingae 1 1 1476
3705: 241. LAE Listonella aestuarianus 1 1 119
3706: 242. LAN Lingula anatina 1 1 119
3707: 243. LAN Lingula reevi 6 3 919
3708: 244. LAP Lamprometra palmata 9 3 1044
3709: 245. LBK Lactobacillus kandleri 2 1 1528
3710: 246. LBM Lactobacillus minor 2 1 1524
3711: 247. LBR Lactobacillus brevis 1 1 117
3712: 248. LBT Lactobacillus halotolerans 2 1 1529
3713: 249. LCA Lactobacillus casei 2 1 1574
3714: 250. LCA Lactobacillus catenaforme 1 1 1549
3715: 251. LCO Lactobacillus confusus 2 1 1525
3716: 252. LEI Leishmania enriettii 1 1 68
3717: 253. LEU Leuconostoc cremoris 2 1 1493
3718: 254. LEU Leuconostoc lactis 2 1 1499
3719: 255. LEU Leuconostoc mesenteroides 2 1 1554
3720: 256. LEU Leuconostoc oenos 2 1 1510
3721: 257. LEU Leuconostoc paramesenteroides 2 1 1524
3722: 258. LGE Lineus geniculatus 1 1 120
3723: 259. LHE Lophocolea heterophylla 1 1 119
3724: 260. LND Linderina macrospora 1 1 119
3725: 261. LPN Fluoribacter bozemanae 6 6 796
3726: 262. LPN Fluoribacter dumoffii 6 6 825
3727: 263. LPN Fluoribacter gormanii 3 3 385
3728: 264. LPN Legionella pneumophila 11 10 1252
3729: 265. LSY Leptosynapta inhaerens 6 3 1051
3730: 266. LTT Leptothrix discophora 1 1 117
3731: 267. LUM Lumbricus sp. 6 3 976
3732: 268. LUP Lupinus luteus 5 5 380
3733: 269. LVI Lactobacillus viridescens 4 3 1816
3734: 270. LVI Lactobacillus vitulinus 1 1 1477
3735: 271. LYC Lycopodium clavatum 1 1 121
3736: 272. LYO Lycoperdon pyriforme 1 1 118
3737: 273. MAG Methylomonas agile 1 1 119
3738: 274. MAG Methylomonas methanica 2 2 1400
3739: 275. MAG Methylomonas rubra 1 1 119
3740: 276. MBF Methanobacterium formicicum 1 1 1476
3741: 277. MBI Methanobacterium thermoautotrophicum 4 4 415
3742: 278. MES Methanosarcina barkeri 1 1 130
3743: 279. MET Metridium senile 6 3 963
3744: 280. MGL Metasequoia glyptostroboides 1 1 120
3745: 281. MJU Microstroma juglandis 1 1 121
3746: 282. MLC Methylococcus capsulatus 3 3 1469
3747: 283. MLM Moloney murine leukemia virus 1 1 74
3748: 284. MLU Micrococcus luteus 2 2 238
3749: 285. MLY Micrococcus lysodeikticus 1 1 120
3750: 286. MNI Mnium rugicum 2 1 157
3751: 287. MOR Mortierella formosensis 1 1 120
3752: 288. MPO Marchantia polymorpha 1 1 119
3753: 289. MSE Megasphaera elsdenii 1 1 1567
3754: 290. MSG Mycobacterium asiaticum 2 1 1368
3755: 291. MSG Mycobacterium aurum 2 1 1349
3756: 292. MSG Mycobacterium avium 4 2 2735
3757: 293. MSG Mycobacterium chelonei 2 1 1355
3758: 294. MSG Mycobacterium chitae 2 1 1359
3759: 295. MSG Mycobacterium fallax 2 1 1348
3760: 296. MSG Mycobacterium flavescens 2 1 1357
3761: 297. MSG Mycobacterium gordonae 2 1 1373
3762: 298. MSG Mycobacterium kansasii 2 1 1369
3763: 299. MSG Mycobacterium leprae 1 1 313
3764: 300. MSG Mycobacterium neoaurum 2 1 1354
3765: 301. MSG Mycobacterium nonchromogenicum 2 1 1376
3766: 302. MSG Mycobacterium paratuberculosis 2 1 1367
3767: 303. MSG Mycobacterium phlei 2 1 1357
3768: 304. MSG Mycobacterium senegalense 2 1 1356
3769: 305. MSG Mycobacterium smegmatis 1 1 77
3770: 306. MSG Mycobacterium sp. 4 2 2715
3771: 307. MSG Mycobacterium terrae 2 1 1363
3772: 308. MSG Mycobacterium thermoresistible 2 1 1359
3773: 309. MSG Mycobacterium triviale 2 1 1351
3774: 310. MSG Mycobacterium tuberculosis 1 1 116
3775: 311. MSL Mytilus edulis 1 1 119
3776: 312. MTB Methylobacterium extorquens 2 2 1471
3777: 313. MTB Methylobacterium organophilum 2 2 1431
3778: 314. MTB Methylobacterium sp. 1 1 1052
3779: 315. MTE Methylosporovibrio methanica 1 1 1306
3780: 316. MUS Mus musculus 40 40 4555
3781: 317. MYA Mya arenaria 6 3 927
3782: 318. MYC Mycoplasma capricolum 3 3 259
3783: 319. MYC Mycoplasma hyopneumoniae 3 3 1799
3784: 320. MYC Mycoplasma mycoides 7 6 1885
3785: 321. MYC Mycoplasma sp. 24 24 34006
3786: 322. MYL Methylosinus trichosporium 2 2 1575
3787: 323. MYM Methylophilus methylotrophus 2 2 1619
3788: 324. MYP Methylocystis parvus 2 2 1433
3789: 325. MZE Zea mays 1 1 50
3790: 326. NDU Nematospiroides dubius 1 1 360
3791: 327. NEC Nectria haematococca 6 6 1152
3792: 328. NEM Ascaris suum 22 22 1251
3793: 329. NEU Neurospora crassa 6 6 1100
3794: 330. NGO Neisseria denitrificans 1 1 1478
3795: 331. NGO Neisseria gonorrhoeae 1 1 1486
3796: 332. NIF Nitella flexilis 1 1 121
3797: 333. NIT Nitrobacter winogradskyi 1 1 117
3798: 334. OCE Oceanospirillum linum 1 1 1542
3799: 335. ONG Onchocerca gibsoni 1 1 363
3800: 336. OPW Ophiocoma wendtii 9 3 1036
3801: 337. PAE Palaemonetes kadiakensis 1 1 1877
3802: 338. PAR Paramecium caudatum 1 1 366
3803: 339. PAR Paramecium primaurelia 1 1 366
3804: 340. PAR Paramecium tetraurelia 1 1 120
3805: 341. PAS Pasteurella multocida 3 3 1926
3806: 342. PBL Phycomyces blakesleeanus 1 1 120
3807: 343. PBR Perinereis brevicirris 1 1 120
3808: 344. PCL Prochloron sp. 1 1 122
3809: 345. PCR Philosamia cynthia ricini 2 2 289
3810: 346. PDE Paracoccus denitrificans 1 1 117
3811: 347. PEA Pisum sativum 8 8 824
3812: 348. PEC Penicillium chrysogenum 1 1 119
3813: 349. PEP Penicillium patulum 1 1 119
3814: 350. PEU Penaeus aztecus 1 1 1902
3815: 351. PFA Plasmodium falciparum 1 1 78
3816: 352. PGO Phascolopsis gouldii 1 1 120
3817: 353. PHS Phasianus colchicus 1 1 95
3818: 354. PHV Phaseolus vulgaris 1 1 75
3819: 355. PHY Pythium hydnosporum 1 1 118
3820: 356. PIL Pilayella littoralis 1 1 118
3821: 357. PIR Phlyctochytrium irregulare 1 1 118
3822: 358. PIS Pimelobacter simplex 1 1 120
3823: 359. PIV Pivellula marina 2 1 2885
3824: 360. PLA Platygloea peniophorae 1 1 119
3825: 361. PLC Planococcus citreus 2 1 116
3826: 362. PLC Planococcus kocurii 2 1 116
3827: 363. PLE Phleogena faginea 1 1 119
3828: 364. PLL Pirella marina 1 1 110
3829: 365. PLL Pirella sp. 2 2 222
3830: 366. PLT Planctomyces brasiliensis 1 1 110
3831: 367. PLT Planctomyces limnophilus 1 1 111
3832: 368. PLT Planctomyces staleyi 1 1 1525
3833: 369. PMC Pneumocystis carinii 1 1 120
3834: 370. PMI Prorocentrum micans 1 1 364
3835: 371. PNC Pseudonocardia thermophila 1 1 1246
3836: 372. PNU Psilotum nudum 2 2 171
3837: 373. POC Procaris ascensionis 1 1 1874
3838: 374. POO Prosthecochloris aestuarii 1 1 110
3839: 375. POR Porocephalus crotali 1 1 1830
3840: 376. POS Pleurotus ostreatus 1 1 118
3841: 377. PPO Puccinia poarum 1 1 118
3842: 378. PRA Procambarus leonensis 1 1 1869
3843: 379. PRE Planocera reticulata 1 1 120
3844: 380. PRM Proteus vulgaris 4 4 1925
3845: 381. PSE Pseudomonas aeruginosa 2 2 1637
3846: 382. PSE Pseudomonas cepacia 2 2 1589
3847: 383. PSE Pseudomonas fluorescens 2 1 120
3848: 384. PSE Pseudomonas sp. 1 1 118
3849: 385. PT4 Bacteriophage T4 17 12 979
3850: 386. PT5 Bacteriophage T5 9 9 711
3851: 387. PTE Porphyra tenera 1 1 121
3852: 388. PTR Plagiomnium trichomanes 1 1 119
3853: 389. PYE Porphyra yezoensis 1 1 121
3854: 390. QUL Coturnix coturnix 1 1 136
3855: 391. RAB Oryctolagus cuniculus 15 11 4955
3856: 392. RAT Rattus norvegicus 60 45 6084
3857: 393. RAT Rattus rattus 4 4 230
3858: 394. RCA Rhodobacter capsulatus 2 2 235
3859: 395. RCY Russula cyanoxantha 1 1 119
3860: 396. REC Renobacter vacuolatum 1 1 116
3861: 397. RER Rhodococcus equi 2 1 1360
3862: 398. RER Rhodococcus erythropolis 1 1 121
3863: 399. RHC Rhizoctonia crocorum 1 1 119
3864: 400. RHZ Rhizoctonia hiemalis 1 1 119
3865: 401. RIC Oryza sativa 2 2 417
3866: 402. RIF Riftia pachyptila 6 3 929
3867: 403. RIR Rickettsia rickettsii 2 1 1443
3868: 404. RIR Rickettsia typhi 2 1 1444
3869: 405. RMA Rhodopseudomonas marina 1 1 1417
3870: 406. RPA Rhodopseudomonas palustris 1 1 119
3871: 407. RRU Rhodospirillum rubrum 2 2 161
3872: 408. RSP Rhodospirillum rubrum 1 1 1446
3873: 409. RSS Rhodobacter sphaeroides 1 1 115
3874: 410. RTO Rhabditis tokai 1 1 119
3875: 411. RYE Secale cereale 3 3 356
3876: 412. SAC Sulfolobus acidocaldarius 2 2 204
3877: 413. SAG Schizochytrium aggregatum 1 1 119
3878: 414. SAH Saccharum officinarum 1 1 50
3879: 415. SAU Stigmatella aurantiaca 2 2 239
3880: 416. SCC Scyliorhinus caniculus 1 1 120
3881: 417. SCL Styela clava 6 3 967
3882: 418. SCM Schistosoma mansoni 2 2 215
3883: 419. SCS Saccharopolyspora hirsuta 1 1 1284
3884: 420. SCU Thyone briareus 6 3 1059
3885: 421. SEP Septobasidium carestianum 1 1 119
3886: 422. SFE Saprolegnia ferax 1 1 118
3887: 423. SFU Sargassum fulvellum 1 1 118
3888: 424. SHE Shewanella hanedai 2 1 120
3889: 425. SHP Ovis sp. 1 1 76
3890: 426. SHR Artemia salina 2 2 282
3891: 427. SJA Sabellastarte japonica 1 1 120
3892: 428. SLI Synechococcus lividus 1 1 120
3893: 429. SLM Physarum polycephalum 6 5 845
3894: 430. SME Spiroplasma sp. 11 11 14826
3895: 431. SMI Smittium culisetae 1 1 121
3896: 432. SNL Arion rufus 3 2 276
3897: 433. SNL Helix pomatia 1 1 119
3898: 434. SOB Scenedesmus obliquus 5 5 407
3899: 435. SOF Sepia officinalis 2 1 120
3900: 436. SOS Stichopus oshimae 1 1 120
3901: 437. SPG Saprospira grandis 1 1 121
3902: 438. SPI Spinacia oleracea 3 2 205
3903: 439. SPL Spirillum volutans 1 1 1492
3904: 440. SPM Spirobolus marginatus 6 3 977
3905: 441. SPO Sporolactobacillus inulinus 1 1 117
3906: 442. SPS Spirogyra sp. 1 1 120
3907: 443. SQD Illex illecebrosus 1 1 120
3908: 444. SRG Sorghum bicolor 1 1 50
3909: 445. SSO Sulfolobus solfataricus 1 1 126
3910: 446. SSP Sulfolobus sp. 1 1 131
3911: 447. STA Staphylococcus aureus 1 1 115
3912: 448. STA Staphylococcus epidermidis 5 3 264
3913: 449. STC Stentor coeruleus 1 1 353
3914: 450. STE Stella humosa 1 1 117
3915: 451. STF Asteria amurensis 2 2 195
3916: 452. STF Asterias forbesi 9 3 1045
3917: 453. STF Asterina pectinifera 1 1 120
3918: 454. STM Streptomyces griseus 1 1 120
3919: 455. STN Stenopus hispidus 1 1 1885
3920: 456. STR Streptococcus cremoris 1 1 117
3921: 457. STR Streptococcus faecalis 1 1 117
3922: 458. STR Streptococcus sp. 1 1 1577
3923: 459. STY Salmonella typhimurium 7 6 459
3924: 460. SUD Pseudocentrotus depressus 1 1 120
3925: 461. SUE Heliocidaris erythrogramma 6 3 1043
3926: 462. SUE Heliocidaris tuberculata 6 3 910
3927: 463. SUH Hemicentrotus pulcherrimus 1 1 120
3928: 464. SUL Lytechinus pictus 6 3 1046
3929: 465. SUP Psammechinus miliaris 8 8 422
3930: 466. SUS Strongylocentrotus purpuratus 6 3 988
3931: 467. SYB Syntrophospora bryantii 1 1 1532
3932: 468. SYC Synechocystis sp. 1 1 76
3933: 469. SYN Synechococcus lividus 1 1 119
3934: 470. SYW Syntrophomonas wolfei 1 1 1532
3935: 471. TAM Tatlockia maceachernii 3 3 458
3936: 472. TAM Tatlockia micdadei 9 9 1286
3937: 473. TAN Tilletiaria anomala 1 1 118
3938: 474. TAP Taphrina deformans 1 1 119
3939: 475. TCO Tilletiaria controversa 1 1 118
3940: 476. TET Tetrahymena thermophila 7 7 615
3941: 477. TEY Tetrahymena pyriformis 3 3 623
3942: 478. TFE Acidiphilium cryptum 1 1 122
3943: 479. TFE Thiobacillus acidophilus 1 1 120
3944: 480. TFE Thiobacillus ferrooxidans 2 2 240
3945: 481. TFE Thiobacillus intermedius 1 1 117
3946: 482. TFE Thiobacillus neapolitanus 1 1 119
3947: 483. TFE Thiobacillus novellus 1 1 120
3948: 484. TFE Thiobacillus perometabolis 1 1 116
3949: 485. TFE Thiobacillus sp. 1 1 117
3950: 486. TFE Thiobacillus thiooxidans 1 1 121
3951: 487. TFE Thiobacillus thioparus 1 1 118
3952: 488. TFE Thiobacillus versutus 1 1 116
3953: 489. TFE Thiomicrospira pelophila 1 1 118
3954: 490. TFE Thiomicrospira sp. 1 1 117
3955: 491. THA Artificial gene 3 3 276
3956: 492. THC Thermococcus celer 2 2 1611
3957: 493. THR Thermomicrobium roseum 1 1 127
3958: 494. THT Thiothrix nivea 1 1 122
3959: 495. THT Thiothrix sp. 1 1 120
3960: 496. THV Thiovulum sp. 1 1 123
3961: 497. TLA Thermomyces lanuginosus 2 2 276
3962: 498. TLP Torulopsis utilis 1 1 121
3963: 499. TOB Nicotiana tabacum 2 2 152
3964: 500. TOR Trichosporon oryzae 1 1 118
3965: 501. TRB Trypanosoma brucei 2 2 106
3966: 502. TRD Tripsacum dactyloides 1 1 50
3967: 503. TRF Crithidia fasciculata 7 7 1305
3968: 504. TRI Trichomonas vaginalis 1 1 341
3969: 505. TTE Thermoproteus tenax 1 1 1504
3970: 506. TTH Thermus aquaticus 2 2 243
3971: 507. TTH Thermus sp. 2 2 243
3972: 508. TTH Thermus thermophilus 5 4 354
3973: 509. TUL Tulasnella violea 1 1 118
3974: 510. TUM Tuberoidobacter mutans 1 1 116
3975: 511. TVI Thraustochytrium visurgense 1 1 119
3976: 512. UPE Ulva pertusa 1 1 120
3977: 513. URE Ureaplasma urealyticum 1 1 1464
3978: 514. UTH Uthatobasidium fusisporum 1 1 118
3979: 515. UUN Urechis unicinctus 1 1 120
3980: 516. VCH Vibrio cholerae 1 1 119
3981: 517. VER Verrucomicrobium spinosum 1 1 116
3982: 518. VFA Vicia faba 2 2 327
3983: 519. VIB Aeromonas hydrophila 1 1 118
3984: 520. VIB Aeromonas media 1 1 119
3985: 521. VIB Aeromonas salmonicida 1 1 119
3986: 522. VIB Alteromonas colwelliana 2 1 120
3987: 523. VIB Alteromonas putrifaciens 1 1 120
3988: 524. VIB Photobacterium angustum 1 1 120
3989: 525. VIB Photobacterium leiognathi 1 1 120
3990: 526. VIB Photobacterium sp. 1 1 120
3991: 527. VIB Plesiomonas shigelloides 1 1 120
3992: 528. VIB Vibrio alginolyticus 1 1 121
3993: 529. VIB Vibrio anguillarum 1 1 120
3994: 530. VIB Vibrio carchariae 1 1 120
3995: 531. VIB Vibrio cincinnatii 1 1 120
3996: 532. VIB Vibrio damsela 1 1 120
3997: 533. VIB Vibrio fischeri 1 1 120
3998: 534. VIB Vibrio fluvialis 2 2 240
3999: 535. VIB Vibrio gazogenes 1 1 120
4000: 536. VIB Vibrio logei 1 1 120
4001: 537. VIB Vibrio marinus 2 2 236
4002: 538. VIB Vibrio metschnitovii 1 1 120
4003: 539. VIB Vibrio mimicus 1 1 120
4004: 540. VIB Vibrio natriegens 1 1 121
4005: 541. VIB Vibrio nereis 2 1 121
4006: 542. VIB Vibrio parahaemolyticus 2 2 239
4007: 543. VIB Vibrio pelagius 1 1 120
4008: 544. VIB Vibrio proteolyticus 1 1 120
4009: 545. VIB Vibrio psychroerythus 1 1 119
4010: 546. VIB Vibrio sp. 5 4 478
4011: 547. VIT Vitreoscilla sp. 2 2 234
4012: 548. VIT Vitreoscilla stercoraria 2 2 1608
4013: 549. VVU Vibrio vulnificus 1 1 120
4014: 550. WHT Triticum aestivum 17 14 1729
4015: 551. WHT Triticum sp. 2 2 152
4016: 552. WHT Triticum vulgare 2 2 282
4017: 553. WLB Wolbachia persica 2 1 1475
4018: 554. WOL Wolinella succinogenes 1 1 1503
4019: 555. XEB Xenopus borealis 3 3 477
4020: 556. XEL Xenopus laevis 20 17 4158
4021: 557. XET Xenopus tropicalis 2 2 242
4022: 558. XYL Xylella fastidiosa 1 1 1493
4023: 559. YSA Candida albicans 1 1 121
4024: 560. YSC Saccharomyces cerevisiae 62 47 4920
4025: 561. YSG Saccharomyces carlsbergensis 4 2 242
4026: 562. YSK Kluyveromyces lactis 1 1 121
4027: 563. YSP Schizosaccharomyces pombe 8 8 630
4028: 564. YSR Pichia membranaefaciens 1 1 120
4029: 565. YST Yeast sp. 7 6 492
4030: 566. YSU Candida utilis 12 10 815
4031: 567. Unidentified 177 169 19082
4032:
4033: Total 1946 1647 445723
4034:
4035: VIRAL
4036:
4037: Key Name Reports Entries Bases
4038: -------------------------------------------------------------------------------
4039: 1. AA2 Adeno associated virus 9 6 7879
4040: 2. AAF Avian musculoaponeurotic fibrosarcoma virus
4041: 2 1 3171
4042: 3. AC2 Avian carcinoma virus 14 11 18641
4043: 4. ACB Avian erythroblastosis virus 19 15 18700
4044: 5. ACE Avian endogenous virus 5 5 2772
4045: 6. ACF Fujinami sarcoma virus 3 2 7503
4046: 7. ACM Avian myelocytomatosis retrovirus 10 8 11975
4047: 8. ACR Avian reticuloendotheliosis virus 12 8 8401
4048: 9. ACS Avian sarcoma virus 15 13 16357
4049: 10. AD4 Mastadenovirus h40 4 4 10795
4050: 11. AD4 Mastadenovirus h41 3 3 8920
4051: 12. ADA Mastadenovirus s30 5 5 1318
4052: 13. ADB Mastadenovirus 2 82 5 36399
4053: 14. ADB Mastadenovirus c2 1 1 196
4054: 15. ADC Mastadenovirus h3 12 11 9026
4055: 16. ADD Mastadenovirus h4 7 5 5078
4056: 17. ADE Mastadenovirus 7 1 1 2718
4057: 18. ADE Mastadenovirus h5 35 11 30276
4058: 19. ADG Mastadenovirus h7 15 6 13245
4059: 20. ADG Mastadenovirus s7 6 5 4931
4060: 21. ADI Mastadenovirus 9 2 2 332
4061: 22. ADJ Mastadenovirus 10 1 1 135
4062: 23. ADL Mastadenovirus 2 2 1 430
4063: 24. ADL Mastadenovirus h12 41 23 19901
4064: 25. ADR Mastadenovirus 18 2 2 364
4065: 26. ADT Tupaia adenovirus 4 4 3784
4066: 27. ADU Mastadenovirus 19 1 1 154
4067: 28. ADV Adenovirus VA 8 7 11146
4068: 29. ADV Mastadenovirus 2 2 1549
4069: 30. ADV Mastadenovirus h40 1 1 1849
4070: 31. ADV Mastadenovirus h41 2 1 1939
4071: 32. ADX Mastadenovirus bos1 1 1 159
4072: 33. ADX Mastadenovirus mus 7 5 9449
4073: 34. ADY Eggdrop syndrome-1976 virus 1 1 52
4074: 35. ADZ Mastadenovirus 31 2 2 300
4075: 36. ADZ Mastadenovirus bos3 1 1 2849
4076: 37. ADZ Mastadenovirus c2 2 2 3689
4077: 38. AEA Avian adenovirus 3 3 576
4078: 39. AEC Canine adenovirus 4 4 805
4079: 40. AEE Equine adenovirus 4 4 617
4080: 41. AIN Aino virus 1 1 850
4081: 42. ALE Rous associated virus 9 8 5119
4082: 43. ALK Avian leukemia virus 2 2 454
4083: 44. ALM Avian myeloblastosis virus 6 6 6465
4084: 45. ALR Rous sarcoma virus 162 136 65936
4085: 46. ALV Avian leukosis virus 12 12 5400
4086: 47. APH Foot and mouth disease virus 125 120 77985
4087: 48. ARE Avian retrovirus 2 2 698
4088: 49. ARE Avian retrovirus IC10 3 2 6013
4089: 50. ARR Adult diarrhea rotavirus 2 2 1445
4090: 51. ASB Avocado sunblotch viroid 20 19 4715
4091: 52. ASS Apple scar skin viroid 1 1 329
4092: 53. ASV African swine fever virus 4 3 6376
4093: 54. BBM Broad bean mottle virus 3 3 680
4094: 55. BBV Black beetle virus 4 3 4893
4095: 56. BCT Beet curly top virus 1 1 2993
4096: 57. BEC Bovine enteritic coronavirus 1 1 1710
4097: 58. BEV Bovine enterovirus 1 1 7414
4098: 59. BIM Bovine immunodeficiency-like virus 1 1 8482
4099: 60. BLC Bunyamwera virus 3 3 12294
4100: 61. BLC Bunyavirus La Crosse 19 18 9070
4101: 62. BLC Germiston bunyavirus 2 2 5514
4102: 63. BLV Bovine leukemia virus 20 20 31133
4103: 64. BNY Beet necrotic yellow vein mosaic virus 6 6 17031
4104: 65. BOO Boolarra virus 2 1 1305
4105: 66. BRV Berne virus 2 2 376
4106: 67. BTV Bluetongue virus 31 23 43490
4107: 68. BVD Bovine viral diarrhea virus 1 1 12573
4108: 69. BWY Beet western yellow virus 5 3 7958
4109: 70. BYD Barley yellow dwarf virus 3 2 6280
4110: 71. CAD Canine distemper virus 4 4 6857
4111: 72. CAN Carnation etched ring virus 1 1 7932
4112: 73. CAP Capripoxvirus 4 4 10417
4113: 74. CAS Cassava latent virus 2 2 5503
4114: 75. CASNS Cas NS1 retrovirus 1 1 2711
4115: 76. CBV Choristoneura biennis virus 1 1 1173
4116: 77. CCC Cadang-cadang coconut viroid 3 3 779
4117: 78. CCP Cricket paralysis virus 1 1 1594
4118: 79. CEA Caprine arthritis encephalitis virus 9 8 11854
4119: 80. CEV Citrus exocortis viroid 4 4 1484
4120: 81. CFD Coconut foliar decay virus 1 1 1291
4121: 82. CHM Chloris striate mosaic virus 1 1 2750
4122: 83. CHV Chlorella virus 2 2 3727
4123: 84. CMV Carnation mottle virus 1 1 4003
4124: 85. CNV Cucumber necrosis virus 1 1 4701
4125: 86. CO4 Coliphage N4 2 2 1759
4126: 87. COB Bovine coronavirus 8 5 15070
4127: 88. CPB Chlorella PBCV-1 virus 1 1 4265
4128: 89. CPE Euxoa scandens cytoplasmic polyhedrosis virus
4129: 1 1 882
4130: 90. CPF Cucumber pale fruit viroid 3 2 604
4131: 91. CPR Chandipura virus 1 1 1751
4132: 92. CPV Cowpox virus 20 20 17648
4133: 93. CRV Cymbidium ringspot virus 2 1 4733
4134: 94. CSO Campoletis sonorensis virus 8 6 9418
4135: 95. CSV Chrysanthemum stunt viroid 4 3 1040
4136: 96. CTN Coconut tinangaja viroid 1 1 254
4137: 97. CXB Coxsackievirus B1 2 2 8844
4138: 98. CXB Coxsackievirus B3 5 5 20481
4139: 99. CXB Coxsackievirus B4 1 1 7395
4140: 100. CYM Clover yellow mosaic potexvirus 1 1 1051
4141: 101. CYS Lymphocystis disease virus of fish 3 3 5310
4142: 102. DEN Dengue virus 105 103 90462
4143: 103. DHV Dhori virus 1 1 1479
4144: 104. DMB Thymotropic retrovirus type B 1 1 285
4145: 105. DNV Densonucleosis virus 1 1 4277
4146: 106. DPF Dapple peach fruit disease viroid 1 1 297
4147: 107. DPP Dapple plum and peach fruit disease viroid
4148: 1 1 297
4149: 108. DUG Dugbe nairovirus 1 1 1712
4150: 109. EAE Equine arthritis encephalitis virus 1 1 2580
4151: 110. EBO Ebola virus 2 2 3178
4152: 111. ECV Echo 11 virus 1 1 98
4153: 112. ECV Echo 6 virus 1 1 99
4154: 113. ECV Echo 9 virus 2 2 615
4155: 114. EEE Eastern equine encephalomyelitis virus 5 5 5163
4156: 115. EEV Venezuelan equine encephalitis virus 7 6 8300
4157: 116. EEW Western equine encephalitis virus 2 2 4521
4158: 117. EIA Equine infectious anemia virus 18 11 24136
4159: 118. EMC Encephalomyocarditis virus 10 9 27249
4160: 119. FCG Gardner-Arnstein Feline Leukemia oncovirus B
4161: 2 2 3863
4162: 120. FCL Feline calicivirus 2 2 6358
4163: 121. FCR RD114 retrovirus 1 1 126
4164: 122. FCS Feline sarcoma virus 7 7 14248
4165: 123. FCV Feline leukemia virus 17 15 38510
4166: 124. FHV Flock house virus 2 1 1400
4167: 125. FIP Feline infectious peritonitis virus 1 1 4500
4168: 126. FIV Feline immunodeficiency virus 6 3 19318
4169: 127. FLA Influenza virus type A 504 412 430270
4170: 128. FLB Influenza virus type B 85 72 102701
4171: 129. FLC Influenza virus type C 29 29 46959
4172: 130. FMV Figwort mosaic virus 1 1 7743
4173: 131. FPV Fowlpox virus 6 6 25819
4174: 132. FV3 Frog virus 3 3 2 2273
4175: 133. GPB Granulosis virus 1 1 999
4176: 134. GPR Gottfried porcine rotavirus 1 1 3302
4177: 135. GSB GS virus 1 1 307
4178: 136. GSH Ground squirrel hepatitis virus 1 1 3311
4179: 137. GVI Grapevine chrome mosaic virus 4 2 11653
4180: 138. GVI Grapevine viroid 3 2 665
4181: 139. GVT Trichoplusia ni granulosis virus 1 1 998
4182: 140. GYS Grapevine yellow speckle viroid 3 2 730
4183: 141. HAN Hantaan virus 6 5 9735
4184: 142. HBD Duck hepatitis B virus 6 5 12249
4185: 143. HBH Heron hepatitis B virus 1 1 3027
4186: 144. HCV Hog cholera virus 3 2 24567
4187: 145. HIV Human immunodeficiency virus type 1 101 53 187276
4188: 146. HIV Human immunodeficiency virus type 2 13 9 75733
4189: 147. HIV Human lymphotropic virus type III 1 1 261
4190: 148. HIV Human T-cell lymphotropic virus type II 7 3 10520
4191: 149. HJV Highlands J virus 2 2 505
4192: 150. HL1 Human lymphotropic virus type I 15 13 26925
4193: 151. HL2 Human lymphotropic virus type II 4 4 5400
4194: 152. HLV Hop latent viroid 2 1 256
4195: 153. HOB Human coronavirus 4 3 3550
4196: 154. HOJ HoJo virus 1 1 3613
4197: 155. HOM Mus hortulanus virus 3 3 4668
4198: 156. HOP Hop Stunt Viroid 10 7 2094
4199: 157. HPA Hepatitis A virus 13 11 42881
4200: 158. HPB Hepatitis B virus 74 70 76139
4201: 159. HPC Hepatitis C virus 3 2 7893
4202: 160. HPD Hepatitis delta virus 4 3 3523
4203: 161. HPE Hepatitis E virus 2 1 2570
4204: 162. HPU Duck hepatitis virus 2 1 3021
4205: 163. HPV Hepatitis virus 2 2 4553
4206: 164. HRD Human retrovirus type D 1 1 8785
4207: 165. HRV Human rhinovirus 11 9 31251
4208: 166. HS1 Herpes simplex virus type 1 148 109 315961
4209: 167. HS2 Herpes simplex virus type 2 37 31 54029
4210: 168. HS4 Epstein-Barr virus 88 68 310429
4211: 169. HS5 Human cytomegalovirus 44 40 136382
4212: 170. HS5 Murine cytomegalovirus 4 3 6921
4213: 171. HS5 Simian cytomegalovirus 1 1 880
4214: 172. HS6 Human herpesvirus type 6 2 2 26298
4215: 173. HSB Bovine herpesvirus type 1 9 9 8227
4216: 174. HSC Simian cytomegalovirus 2 2 2294
4217: 175. HSE Equine herpesvirus type 1 19 17 45132
4218: 176. HSG Gallid herpesvirus type 1 5 5 10607
4219: 177. HSK Gallid herpesvirus type 2 4 4 11325
4220: 178. HSL Feline herpesvirus 1 1 1619
4221: 179. HSL Herpesvirus ateles 1 1 2577
4222: 180. HSM Gallid herpesvirus type 1 2 2 3367
4223: 181. HSO Herpesvirus papio 2 2 806
4224: 182. HSP Human spumaretrovirus 3 3 12095
4225: 183. HSS Herpesvirus saimiri 30 29 22133
4226: 184. HSS Pseudorabies virus 18 14 38325
4227: 185. HST Herpesvirus tamarinus 2 1 2556
4228: 186. HSU Herpesvirus tupaia 1 1 863
4229: 187. HSV Herpes simplex virus 1 1 501
4230: 188. HSY Herpesvirus sylvilagus 1 1 559
4231: 189. HTV Human adult T-cell leukemia virus 3 2 3556
4232: 190. IBA Avian infectious bronchitis virus 24 24 31163
4233: 191. IBB Infectious bronchitis virus 3 2 7215
4234: 192. IBD Infectious bursal disease virus of chickens
4235: 4 3 9138
4236: 193. IHN Infectious hematopoietic necrosis virus 2 2 2961
4237: 194. INS Insect iridescent virus type 22 1 1 2183
4238: 195. IPN Infectious pancreatic necrosis virus 2 1 3097
4239: 196. IRI Iridescent virus type 1 1 1 2461
4240: 197. IRI Iridescent virus type 6 3 3 10327
4241: 198. JEV Japanese encephalitis virus 4 4 18496
4242: 199. KUN Kunjin virus 1 1 10664
4243: 200. KVS Killer virus of S.cerevisiae 8 6 1872
4244: 201. LCV Lymphocytic choriomeningitis virus 11 11 19716
4245: 202. LDV Lactate dehydrogenase-elevating virus 6 6 1684
4246: 203. LEE Lee virus 1 1 3616
4247: 204. LSV Lassa virus 4 4 10307
4248: 205. MAA Alfalfa mosaic virus 28 19 15793
4249: 206. MAV Myeloblastosis-associated virus 1 1 1173
4250: 207. MBG Bean golden mosaic virus 4 4 10465
4251: 208. MBG Bean yellow mosaic virus 1 1 1015
4252: 209. MBR Brome mosaic virus 16 12 9903
4253: 210. MBS Barley stripe mosaic virus 11 8 13655
4254: 211. MBV Middleburg virus 4 3 3394
4255: 212. MCA Cauliflower mosaic virus 28 25 41207
4256: 213. MCC Cowpea chlorotic mottle virus 7 6 6379
4257: 214. MCF Mink cell focus-forming virus 8 8 8202
4258: 215. MCG Cucumber green mottle mosaic virus 2 2 2421
4259: 216. MCM Maize chlorotic mottle virus 2 1 4437
4260: 217. MCP Cowpea mosaic virus 8 8 10170
4261: 218. MCV Cucumber mosaic virus 53 52 34142
4262: 219. MDP Aleutian mink disease parvovirus 4 2 8255
4263: 220. MEA Measles virus 30 25 83843
4264: 221. MEV Maus-Elberfeld virus 1 1 54
4265: 222. MGR Maguari bunyavirus 1 1 945
4266: 223. MHV Murine hepatitis virus 39 33 39489
4267: 224. MLA Abelson murine leukemia virus 8 6 10848
4268: 225. MLE Mouse RFV endogenous retrovirus 2 2 684
4269: 226. MLF Friend mink cell focus-inducing virus 5 4 7000
4270: 227. MLF Friend murine leukemia virus 2 2 4170
4271: 228. MLF Friend spleen focus-forming virus 9 9 13488
4272: 229. MLG Gross passage A murine leukemia virus 2 2 1220
4273: 230. MLK Kirsten murine leukemia virus 1 1 1335
4274: 231. MLM Moloney murine leukemia virus 56 42 35727
4275: 232. MLN Murine non-leukeminogenic retrovirus 1 1 529
4276: 233. MLO AKV murine leukemia virus 7 2 9000
4277: 234. MLR Rauscher spleen focus-forming virus 2 2 2244
4278: 235. MLS Soule murine leukemia virus 2 2 1310
4279: 236. MLT Tikaut murine leukemia virus 1 1 641
4280: 237. MLV Murine leukemia virus 53 44 48928
4281: 238. MLX Xenotropic murine leukemia virus 1 1 3060
4282: 239. MMT Mouse mammary tumor virus 29 28 37314
4283: 240. MNC Narcissus mosaic potexvirus 1 1 6955
4284: 241. MOK Mokola lyssavirus 2 2 152
4285: 242. MOP Mopeia virus 1 1 3419
4286: 243. MPM Mouse polyomavirus 1 1 1155
4287: 244. MPV Monkeypox virus 1 1 1276
4288: 245. MRV Marburg virus 1 1 59
4289: 246. MSB Southern bean mosaic virus 2 2 793
4290: 247. MSC Sugarcane mosaic virus 1 1 1782
4291: 248. MSH Harvey murine sarcoma virus 4 4 3226
4292: 249. MSJ FBJ murine osteosarcoma virus 1 1 4226
4293: 250. MSK Kirsten murine sarcoma virus 2 2 1933
4294: 251. MSN Solanum nodiflorum mottle virus 1 1 377
4295: 252. MSR FBR murine osteosarcoma virus 1 1 3811
4296: 253. MSV Murine sarcoma virus 5 5 5020
4297: 254. MSY Myeloproliferative sarcoma virus 3 3 5305
4298: 255. MTG Tomato golden mosaic virus 3 3 6342
4299: 256. MTR Tobacco rattle virus 7 7 20386
4300: 257. MTS Lucerne transient streak virus 3 3 970
4301: 258. MTV Tobacco mosaic virus 22 8 16050
4302: 259. MTV Velvet tobacco mottle virus 1 1 366
4303: 260. MTY Andean potato latent virus 1 1 96
4304: 261. MTY Clitoria yellow vein virus 1 1 120
4305: 262. MTY Eggplant mosaic virus 3 3 6469
4306: 263. MTY Kennedya yellow mosaic virus 1 1 83
4307: 264. MTY Ononis yellow mosaic virus 2 2 6342
4308: 265. MTY Turnip yellow mosaic virus 20 16 15958
4309: 266. MUM Mumps virus 11 9 13622
4310: 267. MUR Murine retrovirus SL3-2 1 1 492
4311: 268. MVE Murray Valley encephalitis virus 2 2 5994
4312: 269. MVM Minute virus of mice 9 7 16222
4313: 270. MYX Myxoma virus 3 2 4473
4314: 271. MZS Maize streak virus 5 4 8139
4315: 272. NDV Newcastle disease virus 49 47 98864
4316: 273. NEV Nephropathia epidemica 2 2 5466
4317: 274. NOD Nodamura virus 2 1 1335
4318: 275. NPA Antheraea pernyi nuclear polyhedrosis virus
4319: 1 1 285
4320: 276. NPA Autographa californica nuclear polyhedrosis virus
4321: 45 41 67447
4322: 277. NPB Bombyx mori nuclear polyhedrosis virus 3 3 3931
4323: 278. NPG Galleria mellonella nuclear polyhedrosis virus
4324: 5 5 2556
4325: 279. NPM Mamestra brassicae nuclear polyhedrosis virus
4326: 1 1 2598
4327: 280. NPO Orgyia pseudotsugata polyhedrosis virus 8 8 16937
4328: 281. NPS Spodoptera frugiperda nuclear polyhedrosis virus
4329: 1 1 1557
4330: 282. OLV Ovine lentivirus 2 2 18512
4331: 283. ONN O'Nyong-nyong virus 2 1 11835
4332: 284. ORF Orf virus 2 1 5003
4333: 285. PCB Baboon endogenous virus 8 7 20105
4334: 286. PCC Colobus type C cpc-1 endogenous retrovirus
4335: 2 2 373
4336: 287. PCE Chimpanzee type C endogenous retrovirus 2 2 430
4337: 288. PCG Gibbon leukemia virus 5 4 9202
4338: 289. PCM Macaca endogenous retrovirus 1 1 126
4339: 290. PCM Macaca mulatta type C retrovirus 4 4 938
4340: 291. PCS Simian sarcoma virus 12 9 10868
4341: 292. PEB Pea early browning virus 2 1 7073
4342: 293. PEV Subacute sclerosing panencephalitis virus 3 3 3444
4343: 294. PIB Bovine parainfluenza virus type 3 3 1 8700
4344: 295. PIC Pichinde Arenavirus 8 8 12637
4345: 296. PIF Human parainfluenza virus type 3 29 27 46139
4346: 297. PLV Potato leaf roll virus 5 3 6650
4347: 298. PLY Budgerigar fledgling disease virus 1 1 4980
4348: 299. PLY Polyomavirus 131 39 35440
4349: 300. PLY Polyomavirus BK 2 2 799
4350: 301. PLY Polyomavirus JC 3 3 765
4351: 302. PMP Papaya mosaic potexvirus 2 2 1039
4352: 303. PMS Simian paramyxovirus (SV5) 1 1 1382
4353: 304. PMV Pepper mottle virus 1 1 1480
4354: 305. POL Poliovirus 120 106 78406
4355: 306. POV Porcine parvovirus 1 1 3670
4356: 307. PPA Avian papillomavirus 2 2 786
4357: 308. PPB Bovine papillomavirus 14 14 32821
4358: 309. PPC Hamster papovavirus 2 2 10672
4359: 310. PPD Deer papillomavirus 1 1 8374
4360: 311. PPE European Elk papillomavirus 4 3 8842
4361: 312. PPH Human papillomavirus 40 38 101284
4362: 313. PPI Micromys minutus papillomavirus 3 3 487
4363: 314. PPL Lymphotropic papovavirus 1 1 5270
4364: 315. PPM Monkey B-lymphotropic papovavirus 4 4 10920
4365: 316. PPR Reindeer papillomavirus 2 2 930
4366: 317. PPV Plum pox potyvirus 3 3 13827
4367: 318. PRH Prospect Hill virus 1 1 1675
4368: 319. PRV Porcine rotavirus 11 9 12190
4369: 320. PSV Peanut stunt virus 1 1 393
4370: 321. PTP Punta toro phlebovirus 6 6 7130
4371: 322. PTV Potato spindle tuber viroid 5 5 1795
4372: 323. PV1 Parvovirus H1 3 2 5302
4373: 324. PV3 Parvovirus H3 1 1 125
4374: 325. PVA Raccoon parvovirus 2 1 2410
4375: 326. PVB Papovavirus BKV 38 25 18937
4376: 327. PVB Parvovirus B19 4 4 11325
4377: 328. PVC Canine parvovirus 4 4 10016
4378: 329. PVD Bovine parvovirus 2 1 5517
4379: 330. PVF Feline panleukopenia virus 10 6 16703
4380: 331. PVM Mink enteritis virus 4 2 4888
4381: 332. PVR Kilham rat virus 1 1 125
4382: 333. PVR Parvovirus R1 2 2 548
4383: 334. PVS Potato virus S 1 1 3552
4384: 335. PVX Potato virus X 8 6 22573
4385: 336. PVY Potato virus Y 8 5 17509
4386: 337. RAV Rabies virus 21 20 32337
4387: 338. RBF Malignant rabbit fibroma virus 3 3 446
4388: 339. RBF Rabbit fibroma virus 15 15 28212
4389: 340. RBV Rabbit rotavirus 1 1 1036
4390: 341. RCM Red clover mottle virus 1 1 3543
4391: 342. RDV Rice dwarf virus 5 3 5209
4392: 343. REO Reovirus sp. 2 2 2903
4393: 344. REO Reovirus type 1 21 19 14348
4394: 345. REO Reovirus type 2 14 12 6823
4395: 346. REO Reovirus type 3 48 34 25499
4396: 347. RML Rauscher murine leukemia virus 3 3 395
4397: 348. RNM Red clover necrotic mosaic virus 3 2 5338
4398: 349. RO1 Rotavirus sp. 3 3 4074
4399: 350. RO1 Rotavirus subgroup 1 3 2 2712
4400: 351. RO2 Rotavirus subgroup 2 6 6 5955
4401: 352. ROB Bovine rotavirus 21 16 24214
4402: 353. ROH Human rotavirus 6 6 8164
4403: 354. ROR Rhesus rotavirus 2 2 3424
4404: 355. ROT Simian rotavirus SA11 19 15 20730
4405: 356. RPF Rinderpest virus 5 5 10323
4406: 357. RPV Raccoonpox virus 1 1 2195
4407: 358. RRV Ross river virus 4 3 19686
4408: 359. RSB Bovine syncytial virus 1 1 1201
4409: 360. RSH Human respiratory syncytial virus 33 21 19332
4410: 361. RSV Rat sarcoma virus 1 1 1380
4411: 362. RUB Rubella virus 12 7 26119
4412: 363. RVF Rift Valley fever virus 1 1 3884
4413: 364. SAM Satellite arabis mosaic virus 1 1 300
4414: 365. SAP Satellite panicum mosaic virus 1 1 826
4415: 366. SFS Sandfly fever Sicilian virus 2 1 1747
4416: 367. SFV Semliki forest virus 12 3 15380
4417: 368. SHV Simian hepatitis A virus 6 4 4331
4418: 369. SIG Sigma virus 1 1 1718
4419: 370. SIN Sindbis virus 16 6 18450
4420: 371. SIV Simian immunodeficiency virus 31 23 106170
4421: 372. SIV Simian immunodeficiency virus 1 1 7759
4422: 373. SIV Simian immunodeficiency virus 3 3 9130
4423: 374. SLO St. Louis encephalitis virus 8 8 5391
4424: 375. SMF Simian foamy virus 1 1 3534
4425: 376. SND Parainfluenza virus 39 31 59944
4426: 377. SND Parainfluenza virus type 4A 2 2 3767
4427: 378. SNV Spleen necrosis virus 9 8 3909
4428: 379. SPV Spiroplasma virus 1 1 4421
4429: 380. SRV Sapporo rat virus 2 2 5420
4430: 381. SSH Snowshoe hare bunyavirus 11 10 6726
4431: 382. STL Simian T-cell lymphotropic virus type I 3 3 8227
4432: 383. STT St. Thomas 3 rotavirus 1 1 1062
4433: 384. SUV Subterranean clover mottle virus 2 2 720
4434: 385. SV4 Rhesus macaque polyomavirus 180 42 15361
4435: 386. SV5 Simian virus 5 4 4 5586
4436: 387. SVC Spring viremia of carp virus 2 2 778
4437: 388. SVD Swine vesicular disease virus 2 2 7475
4438: 389. SYE Sonchus yellow net virus 3 3 2822
4439: 390. TAC Tacaribe virus 4 3 10607
4440: 391. TAS Tomato apical stunt viroid 3 2 723
4441: 392. TBE Tick-borne encephalitis virus 7 5 32678
4442: 393. TBR Tomato black ring virus 11 11 18222
4443: 394. TBS Tomato bushy stunt virus 3 2 5172
4444: 395. TBV Tick-borne virus 1 1 1586
4445: 396. TCV Turnip crinkle virus 5 5 6433
4446: 397. TEV Tobacco etch virus 4 3 21315
4447: 398. TGE Transmissible gastroenteritis virus 14 9 20969
4448: 399. TME Theiler's murine encephalomyelitis virus 4 4 26220
4449: 400. TMG Tobacco mild green mosaic virus 3 2 7768
4450: 401. TNC Tobacco necrosis virus 1 1 3660
4451: 402. TNS Satellite tobacco necrosis virus 4 2 1380
4452: 403. TOA Tomato aspermy virus 5 5 943
4453: 404. TOS Tomato ringspot virus 2 2 3096
4454: 405. TPM Tomato plant macho viroid 1 1 360
4455: 406. TRS Tobacco ringspot virus 3 3 790
4456: 407. TSV Tobacco streak virus 3 3 2525
4457: 408. TVM Tobacco vein mottling virus 4 2 9892
4458: 409. UST Ustilago maydis P6 virus 1 1 1234
4459: 410. UUK Uukuniemi virus 2 2 4951
4460: 411. VAC Vaccinia virus 97 88 389431
4461: 412. VAR Variola virus 1 1 1274
4462: 413. VAZ Varicella-zoster virus 11 7 131533
4463: 414. VLV Visna virus 2 2 9690
4464: 415. VSV Vesicular stomatitis virus 169 146 192746
4465: 416. VYS Saccharomyces cerevisiae virus ScV1 1 1 819
4466: 417. WCP White clover mosaic virus 4 3 13303
4467: 418. WDV Wheat dwarf virus 2 2 2829
4468: 419. WHV Woodchuck hepatitis virus 7 7 19239
4469: 420. WMS Woolly monkey sarcoma virus 2 1 1431
4470: 421. WNF West Nile virus 7 4 11434
4471: 422. WTV Wound tumor virus 11 9 14409
4472: 423. YFV Flavivirus febricis 5 3 22289
4473: 424. ZYM Zucchini yellow mosaic virus 1 1 1374
4474:
4475: Total 4751 3707 6439492
4476:
4477: PHAGE
4478:
4479: Key Name Reports Entries Bases
4480: -------------------------------------------------------------------------------
4481: 1. AL3 Bacteriophage alpha3 7 5 1299
4482: 2. BAZ Bacteriophage Z 1 1 370
4483: 3. BBF Bacteriophage BF23 3 3 1434
4484: 4. BEO Corynebacteriophage omega 1 1 1880
4485: 5. BET Corynebacteriophage beta 3 2 4162
4486: 6. BEU Corynebacteriophage gamma 2 2 139
4487: 7. BFR Bacteriophage fr 4 4 2053
4488: 8. BM2 Bacteriophage PM2 3 3 1025
4489: 9. BNF Bacteriophage NF 6 5 3258
4490: 10. BO1 Bacteriophage Bo1 1 1 205
4491: 11. BP2 Bacteriophage P21 2 2 191
4492: 12. BPH Bacteriophage phi-11 2 2 2041
4493: 13. BT1 Bacteriophage T1 1 1 1091
4494: 14. BU3 Bacteriophage U3 1 1 201
4495: 15. BZ3 Bacteriophage Bz13 1 1 218
4496: 16. C31 Bacteriophage phi-c31 1 1 3413
4497: 17. CF1 Bacteriophage Cf16 1 1 500
4498: 18. CP1 Bacteriophage Cp-1 5 3 3364
4499: 19. CP5 Bacteriophage Cp-5 4 2 1850
4500: 20. CP7 Bacteriophage Cp-7 5 3 4792
4501: 21. CP9 Bacteriophage Cp-9 1 1 1253
4502: 22. CPT Bacteriophage Cp-T1 1 1 730
4503: 23. D18 Bacteriophage D108 9 7 3935
4504: 24. F1C Bacteriophage f1 16 13 16373
4505: 25. F2C Bacteriophage f2 1 1 58
4506: 26. FR1 Bacteriophage fr1 1 1 205
4507: 27. G14 Bacteriophage G14 1 1 113
4508: 28. H19B Bacteriophage H19B 2 2 3301
4509: 29. H30 Bacteriophage H30 1 1 1905
4510: 30. H44 Bacteriophage H4489A 1 1 3222
4511: 31. HB3 Bacteriophage HB-3 1 1 1319
4512: 32. HP1 Bacteriophage HP1 4 2 10673
4513: 33. IKE Bacteriophage Ike 3 2 7200
4514: 34. J93 Bacteriophage 933J 2 1 1499
4515: 35. JP3 Bacteriophage Jp34 2 2 1070
4516: 36. JP5 Bacteriophage Jp501 1 1 205
4517: 37. K5T Bacteriophage BK5-T 5 5 2070
4518: 38. KU1 Bacteriophage Ku1 1 1 220
4519: 39. L17 Bacteriophage L17 2 2 240
4520: 40. L54 Bacteriophage L54a 1 1 1626
4521: 41. LAM Bacteriophage lambda 120 24 55603
4522: 42. LP7 Bacteriophage LP7 1 1 2110
4523: 43. M13 Bacteriophage M13 12 8 8218
4524: 44. M13MP7 Bacteriophage M13mp7 1 1 60
4525: 45. M13MP8 Bacteriophage M13mp8 3 3 240
4526: 46. M13MP9 Bacteriophage M13mp9 2 2 318
4527: 47. M2Y Bacteriophage M2Y 2 2 336
4528: 48. MS2 Bacteriophage MS2 16 8 4679
4529: 49. OX2 Bacteriophage Ox2 2 2 2641
4530: 50. P15 Bacteriophage phi-105 1 1 1306
4531: 51. P16 Bacteriophage 16-3 1 1 720
4532: 52. P18 Bacteriophage 186 1 1 3561
4533: 53. P21 Bacteriophage phi-21 2 2 949
4534: 54. P22 Bacteriophage P22 17 15 18461
4535: 55. P29 Bacteriophage phi-29 18 15 29805
4536: 56. P42 Bacteriophage 42D 2 2 1986
4537: 57. P434 Bacteriophage 434 7 5 2933
4538: 58. P80 Bacteriophage phi-80 7 6 4714
4539: 59. P82 Bacteriophage 82 1 1 1200
4540: 60. P93 Bacteriophage 933W 2 1 1661
4541: 61. PA2 Bacteriophage PA-2 1 1 2816
4542: 62. PF1D Bacteriophage Pf1 1 1 435
4543: 63. PF3 Bacteriophage Pf3 4 4 12981
4544: 64. PFD Bacteriophage fd 12 7 7334
4545: 65. PFI Bacteriophage Fi 1 1 78
4546: 66. PG4 Bacteriophage G4 12 8 7247
4547: 67. PGA Bacteriophage Ga 4 4 4022
4548: 68. PH1 Bacteriophage H1 1 1 98
4549: 69. PH15 Bacteriophage phi-15 3 3 2352
4550: 70. PH2 Bacteriophage 21 1 1 1688
4551: 71. PH3 Bacteriophage phi-3T 2 2 3422
4552: 72. PH5 Bacteriophage phi-105 6 5 3851
4553: 73. PH6 Bacteriophage phi-6 7 7 13619
4554: 74. PHC Lactococcus 1 bacteriophage 1 1 1654
4555: 75. PHI Bacteriophage phi-H 1 1 2465
4556: 76. PHK Bacteriophage phi-K 3 2 426
4557: 77. PHM Bacteriophage phi-vML3 1 1 1208
4558: 78. PK3 Bacteriophage K3 7 6 6630
4559: 79. PM1 Bacteriophage M1 1 1 1714
4560: 80. PM2 Bacteriophage M2 1 1 1820
4561: 81. PMU Bacteriophage Mu 48 37 18049
4562: 82. PP1 Bacteriophage P1 43 41 20939
4563: 83. PP2 Bacteriophage P2 11 10 7614
4564: 84. PP4 Bacteriophage P4 9 8 14159
4565: 85. PP7 Bacteriophage P7 6 5 3315
4566: 86. PQB Bacteriophage Q-beta 14 13 1866
4567: 87. PR4 Bacteriophage PR4 2 2 240
4568: 88. PR5 Bacteriophage PR5 2 2 238
4569: 89. PR722 Bacteriophage PR722 2 2 240
4570: 90. PRD1 Bacteriophage PRD1 9 9 9061
4571: 91. PS2 Bacteriophage PBS2 1 1 720
4572: 92. PSP Bacteriophage Sp 3 2 4542
4573: 93. PST Bacteriophage ST 1 1 246
4574: 94. PT2 Bacteriophage T2 9 6 8743
4575: 95. PT3 Bacteriophage T3 21 18 16851
4576: 96. PT4 Bacteriophage T4 120 67 128557
4577: 97. PT5 Bacteriophage T5 29 26 26837
4578: 98. PT6 Bacteriophage T6 4 3 2938
4579: 99. PT7 Bacteriophage T7 41 18 47176
4580: 100. PVK Bacteriophage VK 1 1 246
4581: 101. PX1 Bacteriophage phi-X174 40 15 7239
4582: 102. PZA Bacteriophage PZA 3 1 19366
4583: 103. R17 Bacteriophage R17 9 7 463
4584: 104. RB1 Bacteriophage RB18 1 1 674
4585: 105. RB5 Bacteriophage RB51 1 1 700
4586: 106. RHO Bacteriophage Rho11s 2 1 2187
4587: 107. S13 Bacteriophage S13 2 1 5386
4588: 108. SF6 Bacteriophage SF6 1 1 996
4589: 109. SP1 Bacteriophage SPO1 20 20 6864
4590: 110. SP2 Bacteriophage SPO2 1 1 3040
4591: 111. SP6 Bacteriophage SP6 5 4 2948
4592: 112. SP8 Bacteriophage SP82 5 5 1527
4593: 113. SPB Bacteriophage SP-beta 4 3 2224
4594: 114. SPC Bacteriophage S-phi-C 1 1 1377
4595: 115. SPP Bacteriophage SPP1 2 2 1558
4596: 116. SPR Bacteriophage SPR 3 1 2129
4597: 117. ST1 Bacteriophage ST-1 2 2 844
4598: 118. T12 Bacteriophage T12 1 1 1837
4599: 119. TH1 Bacteriophage TH1 1 1 220
4600: 120. TW1 Bacteriophage TW19 1 1 76
4601: 121. TW2 Bacteriophage TW28 1 1 260
4602:
4603: Total 880 593 682556
4604:
4605: SYNTHETIC
4606:
4607: Key Name Reports Entries Bases
4608: -------------------------------------------------------------------------------
4609: 1. ACC Cloning vector 1 1 1337
4610: 2. AD2 Artificial gene 1 1 128
4611: 3. ADB Artificial gene 8 8 573
4612: 4. ADH Artificial gene 1 1 106
4613: 5. ADL Artificial gene 3 3 273
4614: 6. ADV Artificial gene 1 1 106
4615: 7. ALM Avian myeloblastosis virus 1 1 337
4616: 8. ALR Rous sarcoma virus 4 4 413
4617: 9. AMH Artificial gene 1 1 234
4618: 10. APH Artificial gene 1 1 400
4619: 11. ARB Artificial gene 3 3 1180
4620: 12. ARC Cloning vector 2 2 760
4621: 13. ARE Artificial gene 1 1 255
4622: 14. ARG Artificial gene 1 1 249
4623: 15. ARH Artificial gene 5 5 440
4624: 16. ARI Artificial gene 1 1 465
4625: 17. ARL Artificial gene 2 2 424
4626: 18. ARM Artificial gene 1 1 457
4627: 19. ARN Cloning vector 1 1 333
4628: 20. ARP Cloning vector 6 6 1079
4629: 21. ARS Artificial gene 1 1 529
4630: 22. ART Artificial gene 1 1 60
4631: 23. ARY Artificial gene 1 1 264
4632: 24. ATH Artificial gene 1 1 417
4633: 25. BAM Artificial gene 1 1 85
4634: 26. BKV BK Virus 6 3 1560
4635: 27. BOV Bos taurus 18 18 6440
4636: 28. BSF Cloning vector 2 2 54
4637: 29. BSM Cloning vector 1 1 54
4638: 30. BSU Bacillus subtilis 10 9 3921
4639: 31. BTH Artificial gene 2 2 104
4640: 32. CAR Artificial gene 1 1 3616
4641: 33. CEL Caenorhabditis elegans 1 1 186
4642: 34. CHK Gallus sp. 4 4 701
4643: 35. CHS Artificial gene 1 1 478
4644: 36. CMVMUS Artificial gene 1 1 1376
4645: 37. COT Artificial gene 1 1 7876
4646: 38. CRO Artificial gene 2 2 198
4647: 39. CVC Cloning vector 1 1 46
4648: 40. CVE Cloning vector 1 1 60
4649: 41. CVJ Cloning vector 5 5 390
4650: 42. CVK Cloning vector 1 1 120
4651: 43. CYN Artificial gene 1 1 282
4652: 44. DRO Drosophila sp. 4 4 4200
4653: 45. E6V Artificial gene 2 2 206
4654: 46. ECO Escherichia coli 124 111 24710
4655: 47. EGF Artificial gene 1 1 299
4656: 48. ERY Artificial gene 1 1 217
4657: 49. EXP Cloning vector 2 2 123
4658: 50. EZZ Cloning vector 1 1 60
4659: 51. FCN Artificial gene 1 1 42
4660: 52. FCS Cloning vector 2 2 136
4661: 53. FLA Artificial gene 1 1 69
4662: 54. FLU Influenza virus 6 6 861
4663: 55. FSB Artificial gene 2 2 747
4664: 56. GFA Artificial gene 1 1 176
4665: 57. HAL Artificial gene 4 3 1633
4666: 58. HBV Hepatitis B virus 3 3 315
4667: 59. HCY Artificial gene 1 1 313
4668: 60. HET Hetropolymeric DNA 2 2 594
4669: 61. HIR Artificial gene 1 1 220
4670: 62. HIV Artificial gene 6 6 879
4671: 63. HL1 Artificial gene 4 4 238
4672: 64. HNR Artificial gene 1 1 90
4673: 65. HPB Artificial gene 1 1 556
4674: 66. HS1 Artificial gene 1 1 780
4675: 67. HS2 Artificial gene 2 2 129
4676: 68. HS5 Human cytomegalovirus 1 1 210
4677: 69. HSV Herpes Simplex Virus 6 6 323
4678: 70. HUM Artificial human gene 80 71 22856
4679: 71. HY3 Plasmid pHY300PLK 1 1 4870
4680: 72. IFH Cloning vector 1 1 63
4681: 73. IL1 Artificial gene 1 1 88
4682: 74. INS Artificial gene 1 1 232
4683: 75. ISN Insertion element 6 6 378
4684: 76. JRD Cloning vector 3 3 6852
4685: 77. KAN Cloning vector 3 3 210
4686: 78. KPN Klebsiella pneumoniae 2 2 354
4687: 79. KY1 Artificial gene 1 1 171
4688: 80. LAC Cloning vector 2 2 1173
4689: 81. LAM Bacteriophgage lambda 4 4 336
4690: 82. LET Artificial gene 1 1 212
4691: 83. LGT Cloning vector lambda gt11 1 1 210
4692: 84. LHM Artificial gene 1 1 232
4693: 85. LOR Cloning vector 1 1 5614
4694: 86. M13 Cloning vector M13 8 8 643
4695: 87. M13MP7 Cloning vector M13mp7 2 2 120
4696: 88. M13MP8 Cloning vector M13mp8 1 1 382
4697: 89. M13MP9 Cloning vector M13mp9 1 1 60
4698: 90. M13TG103 Cloning vector M13tg103 1 1 66
4699: 91. M13TG114 Cloning vector M13tg114 1 1 60
4700: 92. M13TG115 Cloning vector M13tg115 1 1 66
4701: 93. M13TG117 Cloning vector M13tg117 1 1 63
4702: 94. M13TG120 Cloning vector M13tg120 1 1 54
4703: 95. M13TG130 Cloning vector M13tg130 1 1 93
4704: 96. M13TG131 Cloning vector M13tg131 1 1 93
4705: 97. MBO Artificial gene 1 1 91
4706: 98. MBR Artificial gene 3 3 157
4707: 99. MCA Cauliflower mosaic virus 2 2 139
4708: 100. MCV Cucumber mosaic virus 5 5 284
4709: 101. MHI Mouse-human hybrid 4 4 1574
4710: 102. MLE Artificial gene 1 1 936
4711: 103. MLF Artificial gene 1 1 213
4712: 104. MLM Artificial gene 2 2 178
4713: 105. MML Cloning vector 12 4 24042
4714: 106. MNV Artificial gene 1 1 87
4715: 107. MP7 Artificial gene 2 1 69
4716: 108. MP8 Artificial gene 2 1 60
4717: 109. MP9 Artificial gene 2 1 60
4718: 110. MS2 Artificial gene 1 1 100
4719: 111. MSM Artificial gene 2 2 331
4720: 112. MUS Mus musculus 38 38 4298
4721: 113. NEU Artificial gene 2 2 171
4722: 114. NNL Plasmid pNNL 1 1 815
4723: 115. NPA Autographa californica nuclear polyhedrosis virus
4724: 3 3 922
4725: 116. OVC Artificial gene 1 1 738
4726: 117. P17X Plasmid pACYC177 7 6 4190
4727: 118. P18X Plasmid pACYC184 5 4 4593
4728: 119. P23 Artificial gene 1 1 119
4729: 120. PAC Cloning vector 1 1 83
4730: 121. PAH Artificial gene 2 2 107
4731: 122. PBD Cloning vector 1 1 79
4732: 123. PBG Cloning vector 6 3 12379
4733: 124. PBR Plasmid pBR322 43 23 7123
4734: 125. PBR313 Plasmid pBR313 1 1 200
4735: 126. PBR322SV Plasmid pBR322/SV40 hybrid 5 5 209
4736: 127. PBR325 Plasmid pBR325 3 3 319
4737: 128. PBR327 Plasmid pBR327 3 3 3334
4738: 129. PBR329 Plasmid pBR329 1 1 4150
4739: 130. PBR345 Plasmid pBR345 2 2 1024
4740: 131. PBRH4 Plasmid pBRH4 1 1 71
4741: 132. PCE Cloning vector 1 1 510
4742: 133. PCG86 Plasmid pCG86 2 2 654
4743: 134. PCZ Plasmid pCZ 2 2 208
4744: 135. PDPL Plasmid PDPL13 1 1 79
4745: 136. PEA Artificial gene 1 1 1004
4746: 137. PEM Cloning vector pEMBL8m 4 2 7878
4747: 138. PES Cloning vector 1 1 99
4748: 139. PF1 Bacteriophage f1 1 1 254
4749: 140. PFD Artificial gene 1 1 85
4750: 141. PFE Plasmid pFE 2 2 180
4751: 142. PFH Plasmid pFH 1 1 120
4752: 143. PFL Cloning vector 2 1 4588
4753: 144. PFR Plasmid pFR 4 4 341
4754: 145. PFX Artificial gene 2 1 3627
4755: 146. PHP Plasmid pHP45 1 1 155
4756: 147. PHS Plamsid pHS 3 3 2877
4757: 148. PHV100 Cloning vector 1 1 396
4758: 149. PHV33 Artificial gene 13 13 650
4759: 150. PIC Plasmid pIC 5 5 477
4760: 151. PIG Artificial pig gene 3 3 440
4761: 152. PIP1088 Plasmid pIP1088 2 2 142
4762: 153. PIVX Cloning vector pi-VX 1 1 902
4763: 154. PJSC73 Plasmid pJSC73 1 1 3564
4764: 155. PK18 Cloning vector 1 1 2661
4765: 156. PKN Plasmodium knowlesi 2 1 360
4766: 157. PKT Artificial gene 3 3 264
4767: 158. PKU Cloning vector 3 2 7825
4768: 159. PL2 Artificial gene 2 2 240
4769: 160. PL5 Artificial gene 2 2 310
4770: 161. PLB Cloning vector 1 1 852
4771: 162. PLF Cloning vector 2 1 3641
4772: 163. PLY Artificial gene 1 1 66
4773: 164. PMB9 Cloning vector 1 1 138
4774: 165. PMC1843 Plasmid pMC1843 1 1 62
4775: 166. PMK20 Artificial gene 2 1 4028
4776: 167. PMT Cloning vector 1 1 2854
4777: 168. PMU Artificial gene 4 4 576
4778: 169. POG Cloning vector 1 1 352
4779: 170. POL Artificial gene 2 2 129
4780: 171. POLY Cloning vector 6 3 6226
4781: 172. PORI17 Plasmid pOri17 2 2 490
4782: 173. PPI Cloning vector 1 1 4734
4783: 174. PPUC Cloning vector 1 1 75
4784: 175. PQB Artificial gene 1 1 64
4785: 176. PRK Cloning vector 2 2 839
4786: 177. PRT Artificial gene 1 1 711
4787: 178. PRW1707 Plasmid pRW1707 1 1 66
4788: 179. PRW1718 Plasmid pRW1718 1 1 72
4789: 180. PRW1724 Plasmid pRW1724 1 1 66
4790: 181. PRW1725 Plasmid pRW1725 1 1 66
4791: 182. PSE Artificial gene 2 2 139
4792: 183. PSI Cloning vector 1 1 81
4793: 184. PSKS104 Plasmid pSKS104 1 1 69
4794: 185. PSKS105 Plasmid pSKS105 1 1 60
4795: 186. PSKS106 Plasmid pSKS106 1 1 60
4796: 187. PSKS107 Plasmid pSKS107 1 1 46
4797: 188. PSMF Cloning vector 2 2 259
4798: 189. PSP Cloning vector 3 3 215
4799: 190. PSR Artificial gene 1 1 138
4800: 191. PSS Cloning vector 2 2 475
4801: 192. PT4 Bacteriophage T4 4 4 725
4802: 193. PT7 Bacteriophage T7 2 2 282
4803: 194. PTK Plasmid pTK 1 1 68
4804: 195. PTL Cloning vector 1 1 51
4805: 196. PTN Plasmid pTN 1 1 355
4806: 197. PTR Plasmid pTr 1 1 137
4807: 198. PTU Artificial gene 4 4 883
4808: 199. PTZ Plasmid pTZ12 1 1 2517
4809: 200. PUC Cloning vector 2 1 3914
4810: 201. PUEX Cloning vector 2 1 6728
4811: 202. PVH51 Plasmid pVH51 1 1 3847
4812: 203. PX1 Bacteriophage phi-X174 1 1 59
4813: 204. PYM Artificial gene 1 1 252
4814: 205. PYR Artificial gene 1 1 158
4815: 206. PZ189 Cloning vector 1 1 153
4816: 207. R38 Plasmid R388 1 1 1167
4817: 208. R67 Plasmid R67 1 1 353
4818: 209. R6K Cloning vector 2 2 176
4819: 210. RAD Artificial gene 1 1 955
4820: 211. RAT Rattus sp. 14 13 1864
4821: 212. RET Cloning vector 2 2 780
4822: 213. RMT Artificial gene 6 6 638
4823: 214. RNA Artificial gene 1 1 328
4824: 215. ROT Artificial gene 2 2 141
4825: 216. RRNA Artificial gene 1 1 136
4826: 217. RSC1 Plasmid Rsc13 3 1 7894
4827: 218. RSF1050 Plasmid RSF1050 1 1 104
4828: 219. RSP Artificial gene 1 1 100
4829: 220. RSV Rous Sarcoma Virus 3 3 450
4830: 221. RTS Artificial gene 1 1 280
4831: 222. S100 Artificial gene 1 1 283
4832: 223. SAA Bacteriophage sigma-11-AA248 1 1 83
4833: 224. SAU Staphylcoccus aureus 1 1 60
4834: 225. SFV Semliki forest virus 3 3 171
4835: 226. SHI Cloning vector 3 3 428
4836: 227. SHU Artificial gene 3 3 272
4837: 228. SIN Cloning vector 2 2 596
4838: 229. SLM Artificial gene 10 10 817
4839: 230. SOMINS Artificial gene 1 1 226
4840: 231. SOP Artificial gene 1 1 111
4841: 232. SP02 Bacteriophage SP02 1 1 487
4842: 233. SP6 Artificial gene 1 1 78
4843: 234. SPI Artificial gene 2 2 295
4844: 235. SRU Artificial gene 1 1 252
4845: 236. STA Artificial gene 4 4 619
4846: 237. STM Artificial gene 1 1 71
4847: 238. STY Salmonella sp. 1 1 135
4848: 239. SV4 Simian Virus 40 13 13 2415
4849: 240. SVA Artificial gene 1 1 213
4850: 241. SYN Plasmid pDSP1 161 146 49857
4851: 242. SYN Synthetic sequence 51 48 131042
4852: 243. T13 Artificial gene 1 1 223
4853: 244. T4L Artificial gene 1 1 518
4854: 245. TAC Artificial gene 1 1 842
4855: 246. THA Artificial gene 1 1 641
4856: 247. THY Plasmid pUC8 2 1 503
4857: 248. TI Plasmid Ti 4 3 6552
4858: 249. TN3 Artificial gene 1 1 110
4859: 250. TNP Artificial gene 1 1 192
4860: 251. TNS Cloning vector 2 2 144
4861: 252. TOB Artificial gene 1 1 788
4862: 253. TRN28 Cloning vector 2 2 284
4863: 254. TRN3 Transposon Tn3 10 10 1119
4864: 255. TRN5 Artificial gene 1 1 80
4865: 256. TRNB Artificial gene 1 1 84
4866: 257. TU4 Cloning vector 2 2 350
4867: 258. VAC Cloning vector 4 4 683
4868: 259. VCH Artificial gene 1 1 444
4869: 260. VEC Cloning vector 1 1 143
4870: 261. VTR Cloning vector 1 1 148
4871: 262. WHL Artificial gene 1 1 507
4872: 263. XEL Xenopus laevis 13 13 1227
4873: 264. YSC Saccharomyces cerevisiae 41 40 10154
4874: 265. YSE Artificial gene 2 1 1795
4875: 266. YST Artificial gene 1 1 82
4876: 267. ZMO Artificial gene 3 3 740
4877:
4878: Total 1129 1028 516186
4879:
4880: UNANNOTATED
4881:
4882: Key Name Reports Entries Bases
4883: -------------------------------------------------------------------------------
4884: 1. Unidentified 4909 3756 4792964
4885:
4886: Total 4909 3756 4792964
This archive runs on limited infrastructure. Preserving old code on modern bandwidth. Automated agents are requested to crawl responsibly.