Available Libraries

GenSAS provides the following datasets as globally available libraries for use with alignment tools or RepeatMasker.  Click here for a file with GenSAS tool and database references.

GenSAS provided library RNA number Protein number Release date
refseq_archaea 1,364 3,866,671 5/9/2024
refseq_bacteria 28,624 263,898,582 5/9/2024
refseq_fungi 6,276,189 6,287,924 5/9/2024
refseq_invertebrate 11,551,703 10,324,549 5/9/2024
refseq_mitochondrion NA 274,328 5/9/2024
refseq_plant 10,130,093 9,441,444 5/9/2024
refseq_plasmid 7 2,538,603 5/9/2024
refseq_plastid NA 1,313,283 5/9/2024
refseq_protozoa 1,083,347 1,176,922 5/9/2024
refseq_vertebrate_mammalian 13,759,728 11,507,222 5/9/2024
refseq_vertebrate_other 19,517,099 17,004,973 5/9/2024
refseq_viral NA 683,242 5/9/2024
sprot NA 571,609 5/29/2024
trembl NA 244,910,918 5/29/2024
RepBase library NA NA 12/24/2018