Available Libraries

GenSAS provides the following datasets as globally available libraries for use with alignment tools or RepeatMasker.  

GenSAS provided library RNA number Protein number Release date
refseq_archaea 1,162 1,176,138 (nr) 9/14/2017
refseq_bacteria 21,190 74,849,217 (nr) 9/14/2017
refseq_fungi 2,244,026 2,246,293 9/14/2017
refseq_invertebrate 3,338,132 3,133,716 9/14/2017
refseq_mitochondrion 78 119,014 9/14/2017
refseq_plant 4,306,005 3,925,705 9/14/2017
refseq_plasmid 7 564,723 (nr) 9/14/2017
refseq_plastid 58 203,822 9/14/2017
refseq_protozoa 952,071 988,745 9/14/2017
refseq_vertebrate_mammalian 5,121,121 4,414,984 9/14/2017
refseq_vertebrate_other 4,372,891 4,011,342 9/14/2017
refseq_viral NA 148 (nr) 9/14/2017
sprot NA 556,006 10/25/2017
trembl NA 93,236,986 10/25/2017
RepBase library NA NA 4/28/2017