Available Libraries

GenSAS provides the following datasets as globally available libraries for use with alignment tools or RepeatMasker.  

GenSAS provided library RNA number Protein number Release date
refseq_archaea 1,175 1,237,647 (nr) 3/12/2018
refseq_bacteria 21,664 84,095,077 (nr) 3/11/2018
refseq_fungi 2,249,373 2,250,018 3/12/2018
refseq_invertebrate 3,759,889 3,511,561 3/12/2018
refseq_mitochondrion 78 124,917 3/11/2018
refseq_plant 4,767,311 4,343,610 3/11/2018
refseq_plasmid 7 734,837 (nr) 3/11/2018
refseq_plastid 58 237,574 3/12/2018
refseq_protozoa 952,904 990,178 3/12/2018
refseq_vertebrate_mammalian 5,429,739 4,657,935 3/12/2018
refseq_vertebrate_other 4,741,519 4,341,702 3/11/2018
refseq_viral NA 149 (nr) 3/11/2018
sprot NA 557,275 4/25/2018
trembl NA 114,759,640 4/25/2018
RepBase library NA NA 3/27/2018