Available Libraries

GenSAS provides the following datasets as globally available libraries for use with alignment tools or RepeatMasker.  

GenSAS provided library RNA number Protein number Release date
refseq_archaea 1,146 995,054 (nr) 5/11/2017
refseq_bacteria 20,654 65,493,815 (nr) 5/11/2017
refseq_fungi 2,225,307 2,227,486 5/11/2017
refseq_invertebrate 3,007,080 2,831,130 5/11/2017
refseq_mitochondrion 111 112,258 5/11/2017
refseq_plant 3,757,224 3,459,765 5/11/2017
refseq_plasmid 10 97,210 5/11/2017
refseq_plastid 58 151,973 5/11/2017
refseq_protozoa 950,262 978,184 5/11/2017
refseq_vertebrate_mammalian 4,817,774 4,150,311 5/11/2017
refseq_vertebrate_other 4,122,126 3,805,135 5/11/2017
refseq_viral NA 147 (nr) 5/11/2017
sprot NA 554,860 6/7/2017
trembl NA 87,219,332 6/7/2017
RepBase library NA NA 4/28/2017