Available Libraries

GenSAS provides the following datasets as globally available libraries for use with alignment tools or RepeatMasker.  Click here for a file with GenSAS tool and database references.

GenSAS provided library RNA number Protein number Release date
refseq_archaea 1,354 2,936,792(nr) 5/4/2023
refseq_bacteria 28,459 210,034,561(nr) 5/4/2023
refseq_fungi 5,154,195 5,163,994 5/5/2023
refseq_invertebrate 9,708,846 8,686,644 5/4/2023
refseq_mitochondrion NA 240,117 5/4/2023
refseq_plant 8,805,618 8,192,225 5/4/2023
refseq_plasmid 7 2,080,794(nr) 5/4/2023
refseq_plastid 44 1,064,130 5/5/2023
refseq_protozoa 1,083,003 1,164,421 5/4/2023
refseq_vertebrate_mammalian 12,211,560 10,215,066 5/4/2023
refseq_vertebrate_other 15,510,388 13,672,291 5/4/2023
refseq_viral NA 655,246 5/4/2023
sprot NA 569,793 6/28/2023
trembl NA 248,272,897 6/28/2023
RepBase library NA NA 12/24/2018