Commit a0c600fc authored by alat-rights's avatar alat-rights
Browse files

add datasets

parent a5a166e7
Loading
Loading
Loading
Loading
+354 −0
Original line number Diff line number Diff line
>sp|Q6GZX4|001R_FRG3G Putative transcription factor 001R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-001R PE=4 SV=1
MAFSAEDVLKEYDRRRRMEALLLSLYYPNDRKLLDYKEWSPPRVQVECPKAPVEWNNPPS
EKGLIVGHFSGIKYKGEKAQASEVDVNKMCCWVSKFKDAMRRYQGIQTCKIPGKVLSDLD
AKIKAYNLTVEGVEGFVRYSRVTKQHVAAFLKELRHSKQYENVNLIHYILTDKRVDIQHL
EKDLVKDFKALVESAHRMRQGHMINVKYILYQLLKKHGHGPDGPDILTVKTGSKGVLYDD
SFRKIYTDLGWKFTPL
>sp|Q6GZX3|002L_FRG3G Uncharacterized protein 002L OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-002L PE=4 SV=1
MSIIGATRLQNDKSDTYSAGPCYAGGCSAFTPRGTCGKDWDLGEQTCASGFCTSQPLCAR
IKKTQVCGLRYSSKGKDPLVSAEWDSRGAPYVRCTYDADLIDTQAQVDQFVSMFGESPSL
AERYCMRGVKNTAGELVSRVSSDADPAGGWCRKWYSAHRGPDQDAALGSFCIKNPGAADC
KCINRASDPVYQKVKTLHAYPDQCWYVPCAADVGELKMGTQRDTPTNCPTQVCQIVFNML
DDGSVTMDDVKNTINCDFSKYVPPPPPPKPTPPTPPTPPTPPTPPTPPTPPTPRPVHNRK
VMFFVAGAVLVAILISTVRW
>sp|Q197F8|002R_IIV3 Uncharacterized protein 002R OS=Invertebrate iridescent virus 3 OX=345201 GN=IIV3-002R PE=4 SV=1
MASNTVSAQGGSNRPVRDFSNIQDVAQFLLFDPIWNEQPGSIVPWKMNREQALAERYPEL
QTSEPSEDYSGPVESLELLPLEIKLDIMQYLSWEQISWCKHPWLWTRWYKDNVVRVSAIT
FEDFQREYAFPEKIQEIHFTDTRAEEIKAILETTPNVTRLVIRRIDDMNYNTHGDLGLDD
LEFLTHLMVEDACGFTDFWAPSLTHLTIKNLDMHPRWFGPVMDGIKSMQSTLKYLYIFET
YGVNKPFVQWCTDNIETFYCTNSYRYENVPRPIYVWVLFQEDEWHGYRVEDNKFHRRYMY
STILHKRDTDWVENNPLKTPAQVEMYKFLLRISQLNRDGTGYESDSDPENEHFDDESFSS
GEEDSSDEDDPTWAPDSDDSDWETETEEEPSVAARILEKGKLTITNLMKSLGFKPKPKKI
QSIDRYFCSLDSNYNSEDEDFEYDSDSEDDDSDSEDDC
>sp|Q197F7|003L_IIV3 Uncharacterized protein 003L OS=Invertebrate iridescent virus 3 OX=345201 GN=IIV3-003L PE=4 SV=1
MYQAINPCPQSWYGSPQLEREIVCKMSGAPHYPNYYPVHPNALGGAWFDTSLNARSLTTT
PSLTTCTPPSLAACTPPTSLGMVDSPPHINPPRRIGTLCFDFGSAKSPQRCECVASDRPS
TTSNTAPDTYRLLITNSKTRKNNYGTCRLEPLTYGI
>sp|Q6GZX2|003R_FRG3G Uncharacterized protein 3R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-003R PE=3 SV=1
MARPLLGKTSSVRRRLESLSACSIFFFLRKFCQKMASLVFLNSPVYQMSNILLTERRQVD
RAMGGSDDDGVMVVALSPSDFKTVLGSALLAVERDMVHVVPKYLQTPGILHDMLVLLTPI
FGEALSVDMSGATDVMVQQIATAGFVDVDPLHSSVSWKDNVSCPVALLAVSNAVRTMMGQ
PCQVTLIIDVGTQNILRDLVNLPVEMSGDLQVMAYTKDPLGKVPAVGVSVFDSGSVQKGD
AHSVGAPDGLVSFHTHPVSSAVELNYHAGWPSNVDMSSLLTMKNLMHVVVAEEGLWTMAR
TLSMQRLTKVLTDAEKDVMRAAAFNLFLPLNELRVMGTKDSNNKSLKTYFEVFETFTIGA
LMKHSGVTPTAFVDRRWLDNTIYHMGFIPWGRDMRFVVEYDLDGTNPFLNTVPTLMSVKR
KAKIQEMFDNMVSRMVTS
>sp|Q6GZX1|004R_FRG3G Uncharacterized protein 004R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-004R PE=4 SV=1
MNAKYDTDQGVGRMLFLGTIGLAVVVGGLMAYGYYYDGKTPSSGTSFHTASPSFSSRYRY
>sp|Q197F5|005L_IIV3 Uncharacterized protein 005L OS=Invertebrate iridescent virus 3 OX=345201 GN=IIV3-005L PE=3 SV=1
MRYTVLIALQGALLLLLLIDDGQGQSPYPYPGMPCNSSRQCGLGTCVHSRCAHCSSDGTL
CSPEDPTMVWPCCPESSCQLVVGLPSLVNHYNCLPNQCTDSSQCPGGFGCMTRRSKCELC
KADGEACNSPYLDWRKDKECCSGYCHTEARGLEGVCIDPKKIFCTPKNPWQLAPYPPSYH
QPTTLRPPTSLYDSWLMSGFLVKSTTAPSTQEEEDDY
>sp|Q6GZX0|005R_FRG3G Uncharacterized protein 005R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-005R PE=4 SV=1
MQNPLPEVMSPEHDKRTTTPMSKEANKFIRELDKKPGDLAVVSDFVKRNTGKRLPIGKRS
NLYVRICDLSGTIYMGETFILESWEELYLPEPTKMEVLGTLESCCGIPPFPEWIVMVGED
QCVYAYGDEEILLFAYSVKQLVEEGIQETGISYKYPDDISDVDEEVLQQDEEIQKIRKKT
REFVDKDAQEFQDFLNSLDASLLS
>sp|Q91G88|006L_IIV6 Putative KilA-N domain-containing protein 006L OS=Invertebrate iridescent virus 6 OX=176652 GN=IIV6-006L PE=3 SV=1
MDSLNEVCYEQIKGTFYKGLFGDFPLIVDKKTGCFNATKLCVLGGKRFVDWNKTLRSKKL
IQYYETRCDIKTESLLYEIKGDNNDEITKQITGTYLPKEFILDIASWISVEFYDKCNNII
INYFVNEYKTMDKKTLQSKINEVEEKMQKLLNEKEEELQEKNDKIDELILFSKRMEEDRK
KDREMMIKQEKMLRELGIHLEDVSSQNNELIEKVDEQVEQNAVLNFKIDNIQNKLEIAVE
DRAPQPKQNLKRERFILLKRNDDYYPYYTIRAQDINARSALKRQKNLYNEVSVLLDLTCH
PNSKTLYVRVKDELKQKGVVFNLCKVSISNSKINEEELIKAMETINDEKRDV
>sp|Q6GZW9|006R_FRG3G Uncharacterized protein 006R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-006R PE=4 SV=1
MYKMYFLKDQKFSLSGTIRINDKTQSEYGSVWCPGLSITGLHHDAIDHNMFEEMETEIIE
YLGPWVQAEYRRIKG
>sp|Q6GZW8|007R_FRG3G Uncharacterized protein 007R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-007R PE=4 SV=1
MRSIKPLRCCNAHGRHVSQEYGRCTLLLFREKLFLQTGLVCNKQCNAPNNDGAESKHHGI
HHGSRGALALRGAGVHLLASAALGPRVLAGLVPTGRSVQGSVGQCGRVAQIGRARDVAAR
KQESYCEK
>sp|Q197F3|007R_IIV3 Uncharacterized protein 007R OS=Invertebrate iridescent virus 3 OX=345201 GN=IIV3-007R PE=4 SV=1
MEAKNITIDNTTYNFFKFYNINQPLTNLKYLNSERLCFSNAVMGKIVDDASTITITYHRV
YFGISGPKPRQVADLGEYYDVNELLNYDTYTKTQEFAQKYNSLVKPTIDAKNWSGNELVL
LVGNEWYCKTFGKAGSKNVFLYNMIPTIYRDEPQHQEQILKKFMFFNATKNVEQNPNFLD
NVPEEYYHLLLPKSWVEKNLSDKYRKIMETEHKPLVFSCEPAFSFGLCRNTQDKNESYQL
SLCLYEREKPRDAEIVWAAKYDELAAMVRDYLKKTPEFKKYRSFISCMKGLSWKNNEIGD
KDGPKLYPKVIFNRKKGEFVTIFTKDDDVEPETIEDPRTILDRRCVVQAALRLESVFVHN
KVAIQLRINDVLISEWKEASSKPQPLILRRHRFTKPSSSVAKSTSPSLRNSGSDESDLNQ
SDSDKEDERVVPVPKTKRIVKTVKLPN
>sp|Q197F2|008L_IIV3 Uncharacterized protein 008L OS=Invertebrate iridescent virus 3 OX=345201 GN=IIV3-008L PE=4 SV=1
MSFKVYDPIAELIATQFPTSNPDLQIINNDVLVVSPHKITLPMGPQNAGDVTNKAYVDQA
VMSAAVPVASSTTVGTIQMAGDLEGSSGTNPIIAANKITLNKLQKIGPKMVIGNPNSDWN
NTQEIELDSSFRIVDNRLNAGIVPISSTDPNKSNTVIPAPQQNGLFYLDSSGRVWVWAEH
YYKCITPSRYISKWMGVGDFQELTVGQSVMWDSGRPSIETVSTQGLEVEWISSTNFTLSS
LYLIPIVVKVTICIPLLGQPDQMAKFVLYSVSSAQQPRTGIVLTTDSSRSSAPIVSEYIT
VNWFEPKSYSVQLKEVNSDSGTTVTICSDKWLANPFLDCWITIEEVG
>sp|Q6GZW6|009L_FRG3G Putative helicase 009L OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-009L PE=4 SV=1
MDTSPYDFLKLYPWLSRGEADKGTLLDAFPGETFEQSLASDVAMRRAVQDDPAFGHQKLV
ETFLSEDTPYRELLLFHAPGTGKTCTVVSVAERAKEKGLTRGCIVLARGAALLRNFLHEL
VFNCGTGGRYIPEGYADMGDQERTRKMRKAVSSYYQFRTYETFAKSVATMSAEAIRARYD
RFVIVMDEVHHLRSVQAEGVNTYSAISRFLRTVRGCVKMLLTGTPMTNEPGELADVLNLI
LPQDKTIRPEDGIFSNSGDLLKPDELAERVRGRVSYLKAARPDAGLTFAGEVLGGTGMTH
LRLVRLEMSAFQSDAYASAWDQDAGDRNIFSNSRQCSLAVMPDRRWGSAAEARNPSQVRR
MAGQNLAEYSVKYDYLVRVASSSPKTFAYCEYVNGSGLSLLSDILLANGWRRATGRETTP
GKRFALLTASQKNIHKIVQRFNHEDNVDGAYISLLLGSRVVAEGLTFKEVRHTVILTPHW
NYTETAQAIARSWRAGSHDRLKARGEAVAVTVHRLVAVPRGRDTPRSIDSDMYAVSEVKD
KRIKAVERILMTSAADCSLLRSRNLYPSEFDGSRECEYGRCAYRCSNVSVEPGPLPALLG
ASAAEAVAQVRLDGGGDPAIMKVDMSTLWAEVTAGRRYVNRWGDGAVLRAEGGRLELSAP
YGSSEEGRWGDFYKTRNLCYAKMDQDHLRADDLRDSLPQEVEELLTVSPVETIGETASAM
PQEVATAILMACVQARADGKTLNVVRRDALLDFYKGFYAMGPSGWTVWLHARGANAKVYD
GRRWNPADEDTLEFLAARSAKFTDTRIGYYGLYNPNLKDFCIRDVTQGKRDKVDLRKLTV
GRRCVDWDQRTLVHIVARLMKIDGRRDFMPHATLREMRELAEQDPLHEPSDLTSKEACRR
FLFWTQKGDNKFRRQDICKAMEKWFIENDLMEDNFDCGHQHKRRGKFA
>sp|Q91G85|009R_IIV6 Uncharacterized protein 009R OS=Invertebrate iridescent virus 6 OX=176652 GN=IIV6-009R PE=3 SV=1
MIKLFCVLAAFISINSACQSSHQQREEFTVATYHSSSICTTYCYSNCVVASQHKGLNVES
YTCDKPDPYGRETVCKCTLIKCHDI
>sp|Q6GZW5|010R_FRG3G Uncharacterized protein 010R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-010R PE=4 SV=1
MKMDTDCRHWIVLASVPVLTVLAFKGEGALALAGLLVMAAVAMYRDRTEKKYSAARAPSP
IAGHKTAYVTDPSAFAAGTVPVYPAPSNMGSDRFEGWVGGVLTGVGSSHLDHRKFAERQL
VDRREKMVGYGWTKSFF
>sp|Q197E9|011L_IIV3 Uncharacterized protein 011L OS=Invertebrate iridescent virus 3 OX=345201 GN=IIV3-011L PE=4 SV=1
MMESPKYKKSTCSVTNLGGTCILPQKGATAPKAKDVSPELLVNKMDNLCQDWARTRNEYN
KVHIEQAPTDSYFGVVHSHTPKKKYTSRDSDSEPEATSTRRSATAQRAANLKSSPVDQWS
TTPPQPQPQPAAPTVKKTCASSPPAALSVKRTCTSPPPPPVLIDDDTGEDAFYDTNDPDI
FYDIENGVSELETEGPKRPVYYQRNIRYPIDGSVPQESEQWYDPIDDEFLASSGDVVSLE
PSPIAAFQPTPPKTVQFVPMPEEIIVPPPPPPKTVVDEGVQAMPYTVDQMIQTDFEESPL
LANVNLRTIPIEEVNPNFSPVLMQDMVRDSFVFGTVAQRVMASQRVKQFFKELIEQDVSL
AGRMCMDSGSPQLNLYNSLMGVKLLYRWRSSTTFYRAIVPEIDEPVQVMQDVLSSSEWAK
FDSQAGIPPKMVYIHYKLLNDLVKTLICPNFQLTHAALVCVDCRPEAVGSDGLQDGRQRR
CSNLVSEYHEMTLEDLFNTIKPADLNAKNIILSVLFQMLYAVATVQKQFGMGGLFANADS
VHVRRIQPGGFWHYTVNGLRYSVPNYGYLVILTNFTDVVNYRPDFATTRYFGRRQAKVVP
TRNWYKFVPFTTRYRPFVTVDPITQAKTTAYAPNPPTEGITINEFYKDSSDLRPSVPVDL
NDMITFPVPEFHLTICRLFSFFSKFYDSNFIGNDPFVRNLVDRYSQPFEFPDVYWPEDGV
SRVLACYTIEEIYPNWVDGDTDYVIESYNLD
>sp|Q6GZW4|011R_FRG3G Uncharacterized protein 011R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-011R PE=4 SV=1
MTSVKTIAMLAMLVIVAALIYMGYRTFTSMQSKLNELESRVNAPQLRPPVMSPIVPLNFI
ESEDLDKELD
>sp|Q6GZW3|012L_FRG3G Uncharacterized protein 012L OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-012L PE=4 SV=1
MCAKLVEMAFGPVNADSPPLTAEEKESAVEKLVGSKPFPALKKKYHDKVPAQDPKYCLFS
FVEVLPSCDIKAAGAEEMCSCCIKRRRGQVFGVACVRGTAHTLAKAKQKADKLVGDYDSV
HVVQTCHVGRPFPLVSSGMAQETVAPSAMEAAEAAMDAKSAEKRKERMRQKLEMRKREQE
IKARNRKLLEDPSCDPDAEEETDLERYATLRVKTTCLLENAKNASAQIKEYLASMRKSAE
AVVAMEAADPTLVENYPGLIRDSRAKMGVSKQDTEAFLKMSSFDCLTAASELETMGF
>sp|Q197E7|013L_IIV3 Uncharacterized protein IIV3-013L OS=Invertebrate iridescent virus 3 OX=345201 GN=IIV3-013L PE=4 SV=1
MYYRDQYGNVKYAPEGMGPHHAASSSHHSAQHHHMTKENFSMDDVHSWFEKYKMWFLYAL
ILALIFGVFMWWSKYNHDKKRSLNTASIFY
>sp|Q6GZW2|013R_FRG3G Uncharacterized protein 013R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-013R PE=4 SV=1
MANSVAFSSMTWYSPLASDNLYDICVDKVHNRVLCLCHSFGCCTNAVVIWILPSFDEFTP
QTLSCKGP
>sp|Q6GZW1|014R_FRG3G Uncharacterized protein 014R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-014R PE=4 SV=1
METLVQAYLDIQGKIAEFRREIKALRVEEKAITANLFEAMGEAGVESIRISEDRYLVAEE
KPKRTRSKQQFYQAAEGEGFTQEDVDRLMSLSRGAVTGSSSNVKIRKSAPARNEEDDDG
>sp|Q6GZW0|015R_FRG3G Uncharacterized protein 015R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-015R PE=4 SV=1
MEQVPIKEMRLSDLRPNNKSIDTDLGGTKLVVIGKPGSGKSTLIKALLDSKRHIIPCAVV
ISGSEEANGFYKGVVPDLFIYHQFSPSIIDRIHRRQVKAKAEMGSKKSWLLVVIDDCMDN
AKMFNDKEVRALFKNGRHWNVLVVIANQYVMDLTPDLRSSVDGVFLFRENNVTYRDKTYA
NFASVVPKKLYPTVMETVCQNYRCMFIDNTKATDNWHDSVFWYKAPYSKSAVAPFGARSY
WKYACSKTGEEMPAVFDNVKILGDLLLKELPEAGEALVTYGGKDGPSDNEDGPSDDEDGP
SDDEEGLSKDGVSEYYQSDLDD
>sp|Q6GZV8|017L_FRG3G Uncharacterized protein 017L OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-017L PE=4 SV=1
METMSDYSKEVSEALSALRGELSALSAAISNTVRAGSYSAPVAKDCKAGHCDSKAVLKSL
SRSARDLDSAVEAVSSNCEWASSGYGKQIARALRDDAVRVKREVESTRDAVDVVTPSCCV
QGLAEEAGKLSEMAAVYRCMATVFETADSHGVREMLAKVDGLKQTMSGFKRLLGKTAEID
GLSDSVIRLGRSIGEVLPATEGKAMRDLVKQCERLNGLVVDGSRKVEEQCSKLRDMASQS
YVVADLASQYDVLGGKAQEALSASDALEQAAAVALRAKAAADAVAKSLDSLDVKKLDRLL
EQASAVSGLLAKKNDLDAVVTSLAGLEALVAKKDELYKICAAVNSVDKSKLELLNVKPDR
LKSLTEQTVVVSQMTTALATFNEDKLDSVLGKYMQMHRFLGMATQLKLMSDSLAEFQPAK
MAQMAAAASQLKDFLTDQTVSRLEKVSAAVDATDVTKYASAFSDGGMVSDMTKAYETVKA
FAAVVNSLDSKKLKLVAECAKK
>sp|Q6GZV7|018L_FRG3G Uncharacterized protein 018L OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-018L PE=3 SV=1
MQNSKTDMCAALWAVTGLVLNVAVRFALEPFKESMGQGWHTAARVAVNGAIVLALADRLS
DSPVTMTLFVMALSASPE
>sp|Q6GZV6|019R_FRG3G Putative serine/threonine-protein kinase 019R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-019R PE=3 SV=1
MATNYCDEFERNPTRNPRTGRTIKRGGPVFRALERECSDGAARVFPAAAVRGAAAARAAS
PRVAAASPCPEFARDPTRNPRTGRPIKRGGPVFRALERECADYGGASPRRVSPARAFPNR
RVSPARRQSPAEAAEASPCPEFARDPTRNPRTGRTIKRGGPTYRALEAECADYGRLSPIR
SPWSDWSSTGLSPFRSHMRKSPARRSPARRSPARRSLARYTEHLTSDSETEVDYDARNVI
RSQVGPGGVCERFAADPTRNPVTGSPLSRNDPLYTDLMEICKGYPDTPLTKSLTGEGTDD
DTCEAFCRDPTRNPVTGQKMRRNGIEYQMFAEECDCSGISRPSGVSRTSGTSGSSGSSAS
SRPPNSFEAPGASSRPPNSFEASGAARVPGTPSVSRGEPRWMSSISTRHNYDESNPMSVA
FRLRHVKDIRKFLRTVRPGRSGFCATDKGGWLGSAAVSDNVIGQGSWGSVHMVKFRDFPE
EFVVKEAVLMSVSEKHRYKPTVVWDEWAAGSVPDEVVVNNMVTEIAATGMTPFVPLTAGA
GACDSCNPQLLEKAAKVTKCYLQAMEAADFSLDRVLPTMSPDQAASALAQILLGLQSLQT
TLGIMHNDIKAHNILVKRVPPGGYWKVTDSFNGQVFYIPNEGYLCMLADYGVVRLVKPAV
GMDTLYGTRNARFVPRDVGRWGKGAGTEYVVTPIRSKISVVVRGGRFVGVEPNKAVRYWK
NTDTSKVGDVITTNNVFYMGYDIEPDMQVQLDDTNSFPVWESRGDVADCVRTFVGGKRAS
QPGFHRLFYKKTGSAWEKAAETVAKQNPLFSGFTLDGSGLKYIRAATACAYIFPGMAVPR
PGEREIESFTM
>sp|Q6GZV5|020R_FRG3G Uncharacterized protein 020R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-020R PE=4 SV=1
MLQNYAIVLGMAVAVAIWYFFKIEEEAPPGPNPPKPDPPKPDPPKMHMPKKKPHWMDPHL
TGSQTVQYSRNRSMGDPIRGDLPIIPRDDGWFSTAANPAHTLHAGALSMIAPASTGGGLT
VNKLISAYADKGNAMSGRHNSPSYYGSS
>sp|Q6GZV4|021L_FRG3G Uncharacterized protein 021L OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-021L PE=4 SV=1
METIVLVPRQDQETFSDSRPVLDGDLMLEFLENKIRHPVRRRQPRVVPVTSSDPEVVDDE
DDEDQSDDSDEERQRLYFQYMVLKRMYPTEVIPEMTTYSNVAIMREKYKLLTRRLSLDKH
INEWKKYIIVGMCIMELVMTKLNFDASGFARYQIKSLGAYDQLLAEMADKYYEATPQSSV
EMRLMTTMGMNMAVFMLGKLLGGQMDFLGLLENAFGSSS
>sp|Q197D8|022L_IIV3 Transmembrane protein 022L OS=Invertebrate iridescent virus 3 OX=345201 GN=IIV3-022L PE=4 SV=1
MSFVHKLPTFYTAGVGAIIGGLSLRFNGAKFLSDWYINKYNDSVPAWSLQTCHWAGIALY
CVGWVTLASVIYLKHRDNSILKGSILSCIVISAVWSILEYNQDMFVSNPKLPLISCAMLV
SSLAALVALKYHIKDIFTILGAAIIIILAEYVVLPYQRQYNIVDGIGLPLLLLGFFILYQ
VFSVPNPSTPTGVMVPKPEDEWDIEMAPLNHRDRQVPESELENVK
>sp|Q6GZV2|023R_FRG3G Uncharacterized protein 023R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-023R PE=4 SV=1
MRVSQTSWIVSRMLEYPRGGFFYSTDMACMMEGLAEELAGGHKDEVLIVSGRNGDDEVFK
EFPNVRAADGLKGPNSIDPETKLVLIIDVSPTAISNALAATLQEFLIPVWVFCNHTRTLT
ASVTRRLGYKLWPKGTYTPYICEKAGVSEVVTYNQPESEKFVAFMSAARQIMDKRKSKKT
MQELAFLPHLAFAEIAMEGDQEMTPTLTAKKVSDIKDEQVNELASAMFRTGKLSHLDMLS
VPDCVYSCGEALKREVAKAKANRERFVVALRNAQYKKYTAGLLEAGTPVKTFTEVIKNWG
AYDTIFLPMGVDWTYTGGSNLIRMMMTPGSHKTVTFVPESDDVHEFCHNKPTVNTMGVES
AATGLAAELNRRWRRDNPVDAS
>sp|Q197D7|023R_IIV3 Uncharacterized protein 023R OS=Invertebrate iridescent virus 3 OX=345201 GN=IIV3-023R PE=4 SV=1
MGSYMLFDSLIKLVENRNPLNHEQKLWLIDVINNTLNLEGKEKLYSLLIVHNKQQTKIYD
PKEPFYDIEKIPVQLQLVWYEFTKMHLKSQNEDRRRKMSLYAGRSP
>sp|Q6GZV1|024R_FRG3G Uncharacterized protein 024R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-024R PE=3 SV=1
MWQYLPILLMTMISQLEWTVAAVKRYPAGGFITGDKLSRVFEALPWRVAVVSDEPEKYEG
FPILTEEDPAVFEDADCILFAVSDPKCVTGAMKSVFMASSKTAWVVYDGTETRATVRSWM
RRLWRAETYVPLLTHRGFVTDVCVYSQPDSERYVSVMTATAHFYSNRLEVLEEMAFVPHL
AYAKLAMGRYTVLDGCMSVKGSADVAPLNRSMWFLTAAAIPHGEIDTDSLFSDPGAVYSC
GSALREALGSLPEGSTSVVAVRNSSYRKYVRGILGPNFRVETFTNVVKTWGVYDYVLLPM
GISDSYKQGRDLMEKLEMPGGHRVVTFAPENYTVNEVHLNRPLKYAIKRMDLITPMVLRH
VSLNK
>sp|Q197D5|025R_IIV3 Uncharacterized protein 025R OS=Invertebrate iridescent virus 3 OX=345201 GN=IIV3-025R PE=3 SV=1
MNYSVIWAITILILGLVLTLAWARQNPTHPINPLVLNYHTKPSPKRHRMVLVVESFASVD
ALVELVENILSQTIRVASITVVSQRPDHLRQVPLLHQTCTFSRASGLSALFKETSGTLVV
FISKEGFHHFQSPTLLETIDQRGVTAEQTLPGIVLRNTDMPGIDLTTVYRQQRLGLGN
>sp|Q91G70|026R_IIV6 Uncharacterized protein 026R OS=Invertebrate iridescent virus 6 OX=176652 GN=IIV6-026R PE=4 SV=1
MAISFFSDTSYIIKSILLISLFSIIPLEDEVTKLKSSSLRETSELNKEEGITTCLYTFN
>sp|Q6GZU9|027R_FRG3G Uncharacterized protein 027R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-027R PE=4 SV=1
MANFLQDVNCETVSEYDGPDASIPEGVWEGYVGHDHAALWRTWSYIYECCKKGTLVQFRG
GKLVTFSMFDNPRFSNGAGIDAQKVLDLEDRARELQGYGPVNRRTDVMPVDRWTLNGPLL
RYDKMVLEDVGGTGSNRTMVRAQLEALQDERDVPDCDFILNVRDYPLLRRDGTRPYPQVY
GKGRRLPEPWARGGPHVPVVSMCSGPTYADIAVPTYECIAHAYTSSGRTLPAGGRFVKTP
SADSLPAWRDRKALAVFRGSSTGAGTSTEDNQRLRALQISMSRPDLADVGITKWNLRPRK
TERYDGYRIIEPWQFGRKSPYPAAAKPMTPEQIAGYKYVLCLWGHAPAFRLARDLSLGSV
VLLPSRPPGQEGLDMWHSSVLKPWTHYIPVRGDLSDLEKRIEWCRDNDAECEKIAAAGME
ASLNLLGWEGQLDRWMDVLRSVRLECCPGGYDMPPSPSLVSDSMCVRQMVSFPRYEDIPQ
PSSPMPVLPRCSGTLRGWGLAASLGWDLGDAAEVLNVKRSTAVLSKTVFNNLIYRTPHLR
YTFGVAASDPESTAAVILSEKLKGAVTMRSWLEDSRAWARGRNVASVLCQVSQALLEAQA
AAGTVFGDLSLDTILVVPNPLPEYIYHDGTGGSFGLKLMPGDKWAVVTYGDYTRARIRVL
KGDGRKGHLAVVGPQPVYTKLSERKWHDICCLVSCILRTARTSKRPAARALAAAVARAAG
VKRPDMDAEALEATPYEAREEPLTRFGPAEFINGLVREFKLEEGGWAWTEKNKNIEKVLR
PWERGLPLYPVRLWLSGDRKEAMRACVSSVLKAAPPRPATAAGAHHTFQTYLRTVGADLD
SFPEWAAAAAHLKRLWKSPGSLPAGSASLRAPSVPPPCHGPAWALPFGTRTPGEFPSWFD
PSCLGDWTEAMGQGAPLDLENGPAKAGSDPVAVHSAWETASQLSFEEDGWTESEPRPVRR
EAHVRAKERH
>sp|Q6GZU8|028R_FRG3G Uncharacterized protein 028R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-028R PE=4 SV=1
MDPNVLKNLSLMLSRRAGVSGGEPPRMIEWPEYGQRSEPCGSQTVWYVDRPVGAPFIKAF
ASEVEERGGGILIHAGKVTFDSAKKLAAMKEVQVFDVKYFSFDLMAVVPEHSLWKRPGDK
GYPEKTAQSFPKIMASDPVCRYHGFRPRDLVHVKPHDVYIVC
>sp|Q197D2|028R_IIV3 Uncharacterized protein 028R OS=Invertebrate iridescent virus 3 OX=345201 GN=IIV3-028R PE=4 SV=1
MDQYITLVELYIYDCNLFKSKNLKSFYKVHRVPEGDIVPKRRGGQLAGVTKSWVETNLVH
FPLWLSEWDETRWGVLNHYPLESWLEKNVSSKVPVNPVMWNFDSECLVYFFHNGRRTPFL
TPKGVVKLQVFYNLMSGKEVEWFYEISNGFLKPHLHQLSNVRELVRLKHAPVVVGAGGPR
LVTEGVYSLRDDDFVVDCSQIAAVKRAIERGESHQSLRKYQCPLFVALTDKFQDTVKLVE
KKFEVQLNELKAETTIQVLREQLRQEKKLKEQVLSLTQSFIPTIGGRGEEFGKPDETPSS
ASVGDDNFPSSTNHTFEARRRPSSLSSGGALKPSKIL
>sp|Q6GZU7|029L_FRG3G Uncharacterized protein 029L OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-029L PE=4 SV=1
MRRMRSGFKHCAIPIDICRWEYILSPLILQDLQGPQQGGSVAVDVTVRCSVRFVHLPHYG
GFNHGTVQRRVDPDDCRILRQLHIVLSLRLCLIDRDRL
>sp|Q91G67|029R_IIV6 Uncharacterized protein 029R OS=Invertebrate iridescent virus 6 OX=176652 GN=IIV6-029R PE=4 SV=1
MVERLGIAVEDRSPKLRKQAIRERFVLFKKNTERVEKYEYYAIRGQSIYINGRLSKLQSE
RYPKMIILLDIFCQPNPRNLFLRFKERIDGKSEWENNFTYAGNNIGCTKEMESDMIRIFN
ELDDEKRDV
>sp|Q197D0|030L_IIV3 uncharacterized protein 030L OS=Invertebrate iridescent virus 3 OX=345201 GN=IIV3-030L PE=4 SV=1
MHPTLKSNAGEWSQPIVNLFYSNFSGNCKALLQYIDNAGITDHIPIKFINVDNPTMRSVV
SAKISHVPALVVLQDDQMSLYVAESVWEWFDNYRTPPPLADGATVDSQASENGEKEAQPT
PPKEGLLTVLELAKQMRKEREQQT
>sp|Q6GZU6|030R_FRG3G Uncharacterized protein 030R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-030L PE=4 SV=1
MSLYLLLGLKILRYLKMVIVLRCHSAFLLSVKFLREKRRLKMYLGIMLGF
>sp|Q6GZU5|031R_FRG3G Uncharacterized protein 031R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-031R PE=4 SV=1
MDTPCKLFCIELKEGYVPGTVSHNHMMPYFLAGSGWPVEITFHAATVELKTQEDFPPAIG
IGIHNMTGVPVVETPHSGRMHFVFIFHSKSGRFSATYKCIPVPVVVRDYKTVASVSLTTL
SLEDIVGVKLFGTACDRSS
>sp|Q6GZU4|032R_FRG3G Uncharacterized protein 032R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-032R PE=4 SV=1
MVTVTELRATAKNLGIRGYSTMRKAELEEAIRDHGRVSEARVASPRRSPARSPRKSPAGR
KSPSKSPAGRKSPSKSPAGRKSPSKSPAGRKSPSKSPAGRKSPSKSPAGRKSPSKSPVRK
SPSKSPVRKSPRKSPAAKLQAGDRPASMNICKNLPKQRLVDIATEMGIDLNRESDGKPKT
KDQLCADIMGGAGRKSPRKSPSRSPVRKSPSRSPVRKSPVRSPRKSPVRVPSPVRSPVKE
KTPVRSPARSEDAGSDLAPRPRRGKAVRLDYDEDDDYSYGASTDNLFSGNKEIPFPTRKR
RTRKPEKVFVDVRSPHTLTDSEDEDDMVEVPELEDKEITMPGVLSPYSDEIVERGYVSQG
GADYINYIYRTEYALESDESFARGARPKTNKRDSDRAVREAAAAAAIARALDRRSQSGND
EPAVRRRSAPTDSSRESRRDREPQRDIAEPQRDIAEPQRDIAEPQRDIAEPRKVRFREAG
SADVRVFERDEPKEYGRVPVRPPLFMPAGEPLQPLKFRPKTPKIDDTIHRAQMVLPSKPS
QKETDNYYKQFAGEAVRPSEPVQWDKDDQVLYHKVPAWDDSSYAAAVSAWPMSVDPKQAE
SVFAEFEQLSAQDSDLIKVRKSIMKALGY
>sp|Q197C8|032R_IIV3 Uncharacterized protein 032R OS=Invertebrate iridescent virus 3 OX=345201 GN=IIV3-032R PE=4 SV=1
MKLMLEIVKNISEPVGKLAIWFNETYQVDVSETINKWNELTGMNITVQENAVSADDTTAE
ETEYSVVVNENPTRTAARTRKESKTAAKPRKMQIPKTKDVCQHIFKSGSRAGEQCTTKPK
NNALFCSAHRVRNSVTSNATEASEKTVAKTNGTAAPQKRGVKSKSPTVIPSDFDDSDSSS
SATRGLRKAPTLSPRKPPPTTTTASSAQEEEDEQQAHFSGSSSPPPKNNGNGAVYSDSSS
DEDDDDAHHTTVIPLLKKGARKPLDENVQFTSDSSDEED
>sp|Q91G65|032R_IIV6 Uncharacterized protein 032R OS=Invertebrate iridescent virus 6 OX=176652 GN=IIV6-032R PE=4 SV=1
MGVYKFCYNKKKEVGQVAVLQKERLIFYIVTKEKSYLKPTLANFSNAIDSLYNECLLRKC
CKLAIPKIGCCLDRLYWKTVKNIIIDKLCKKGIEVVVYYI
>sp|Q6GZU3|033R_FRG3G Transmembrane protein 033R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-033R PE=4 SV=1
MSGIQLDKETILKYSSAALVALSAVVAVMMVSNNSESWKPILVGAVVAASGAAAYQSWWP
KQS
>sp|Q6GZU2|034R_FRG3G Uncharacterized protein 034R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-034R PE=4 SV=1
MSAGHLRKRRYVKVGDIHDMGPILGGVHDVSSPPPNVHYQQQDDHNDPGCMIHYPGEGWF
SSMSTVEKLMLGAVIVAAVVVGVRMFMSSGNSSATSSFSTAPYFMG
>sp|Q91G63|034R_IIV6 Uncharacterized protein 034R OS=Invertebrate iridescent virus 6 OX=176652 GN=IIV6-034R PE=4 SV=1
MKQNLLILLSLLLVVVAIMWWLYEKKKEVPLPPPTPPTPPTPTGVPFLPMYAGLSSPVQY
NPADYLYGWEKYPHGPAWSFGDRVPYAEAKNALGGHFGGGLYSPRDPILESKLGGVYIGN
DLYTVGGVGGDGHW
>sp|Q6GZU1|035L_FRG3G Uncharacterized protein 035L OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-035L PE=4 SV=1
MIWVWPATGRGPGWWGIRRDPWGPEDSSCPCPRLPLSPTGPVGHSGPMGQCHPPVPSYRR
GRRDQKDPPLRRQTSPPLPPHPWDRPLPWVPWIPLDLCRHGDPRHPWDPGAQSGYPRVRE
VRGVPADRPLRPCPRQGPRTAATRKESSCRIPS
>sp|Q6GZU0|036L_FRG3G Uncharacterized protein 036L OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-036L PE=4 SV=1
MTLPDVSGSLGPLSPGTNGTLWAVGPRVVRYQIPALAYLTPGALWTLRTRGTSLTSGPIG
TRDSIRTLHAVHYDVWTLGPLGPLGPTSPRGPSARPCRLQTDSLHSTDARCYRCKMLQMQ
DATDARCKKDMSPFSFPGILEPSHLVGSLKSPRVDPGVPCRPLALWGHPYQCLRLVPLYQ
RCLHPHCFPAAPGRPWDPWCRPDRLDP
>sp|Q197C3|037L_IIV3 Uncharacterized protein 037L OS=Invertebrate iridescent virus 3 OX=345201 GN=IIV3-037L PE=4 SV=1
MNAATSGIQLNAQTLSQQPAMNTPLIHRSFRDDYTGLVSAGDGLYKRKLKVPSTTRCNKF
KWCSIGWSIGALIIFLVYKLEKPHVQPTSNGNLSLIEPEKLVSESQLIQKILNATTPQTT
TPEIPSSTEPQELVTEILNTTTPQTTTPEIPSSTEPQELVTEIPSSTEPQEEIFSIFKSP
KPEEPGGINSIPQYEQESNNVEDEPPPNKPEEEEDHDNQPLEERHTVPILGDVIIRNKTI
IIDGGNETIIIKP
>sp|Q6GZT9|037R_FRG3G uncharacterized protein 037R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-037R PE=4 SV=1
MQVFLDLDETLIHSIPVSRLGWTKSKPYPVKPFTVQDAGTPLSVMMGSSKAVNDGRKRLA
TRLSLFKRTVLTDHIMCWRPTLRTFLNGLFASGYKINVWTAASKPYALEVVKALNLKSYG
MGLLVTAQDYPKGSVKRLKYLTGLDAVKIPLSNTAIVDDREEVKRAQPTRAVHIKPFTAS
SANTACSESDELKRVTASLAIIAGRSRRR
>sp|Q6GZT7|039R_FRG3G Uncharacterized protein 039R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-039R PE=4 SV=1
MTSYCDTLKALAAESDSTGSERATIRMYMAMFSDASLRPAVSDTVASILGTDSLDHEDAE
MMLKFKLLFFSGSANASATSHYPKADDPQRFARSVSRGPSRVRRPARNSASRPVRR
>sp|Q6GZT6|040R_FRG3G Uncharacterized protein 040R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-040R PE=3 SV=1
MIRALCTIVLIAAGVAVALYLSLVYGYYMSVGVQDASWLTALTGNRPDAKVPFFDKAVGE
APEDKVAYTERPYPVSSTQSPTTTQSPTTTTLKPTTMAVLASIGATPTPVVCHNVRGDMQ
GIACNVVMKKTVAAALKVQPEAKKDNVNAQYRYGMWTPLRRSRSPFGVWNIPKKLAIAAP
DV
>sp|Q197C0|040R_IIV3 Uncharacterized protein 040R OS=Invertebrate iridescent virus 3 OX=345201 GN=IIV3-040R PE=4 SV=1
MVTMAIKNFHIQDDRLKNGRGNKTMSESDYNTSDSGGWVLVRKKRDRSTRPPDVVDRWSN
STSTFPMGLDQIKIKRNGCVNTY
>sp|Q91G57|041L_IIV6 Uncharacterized protein 041L OS=Invertebrate iridescent virus 6 OX=176652 GN=IIV6-041L PE=4 SV=1
MNFIRENETKYVLSTYQSMTPKNLMEYLLKYNYDNDCVYIFNNLPKDLQKEVDDLAKEVV
KANDEQIKAQDEQIKANDQKLKQLDVMIEFMKQYNKQLDNDIYLLEHQLENKRELNRQLG
IF
>sp|Q6GZT5|041R_FRG3G Uncharacterized protein 041R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-041R PE=4 SV=1
MRVVVNAKALEVPVGMSFTEWTRTLSPGSSPRFLAWNPVRPRTFKDVTDPFWNGKVFDLL
GVVNGKDDLLFPASEIQEWLEYAPNVDLAELERIFVATHRHRGMMGFAAAVQDSLVHVDP
DSVDVTRVKDGLHKELDEHASKAAATDVRLKRLRSVKPVDGFSDPVLIRTVFSVTVPEFG
DRTAYEIVDSAVPTGSCPYISAGPFVKTIPGFKPAPEWPAQTAHAEGAVFFKADAEFPDT
KPLKDMYRKYSGAAVVPGDVTYPAVITFDVPQGSRHVPPEDFAARVAESLSLDLRGRPLV
EMGRVVSVRLDGMRFRPYVLTDLLVSDPDASHVMQTDELNRAHKIKGTVYAQVCGTGQTV
SFQEKTDEDSGEAYISLRVRARDRKGVEELMEAAGRVMAIYSRRESEIVSFYALYDKTVA
KEAAPPRPPRKSKAPEPTGDKADRKLLRTLAPDIFLPTYSRKCLHMPVILRGAELEDARK
KGLNLMDFPLFGESERLTYACKHPQHPYPGLRANLLPNKAKYPFVPCCYSKDQAVRPNSK
WTAYTTGNAEARRQGRIREGVMQAEPLPEGALIFLRRVLGQETGSKFFALRTTGVPETPV
NAVHVAVFQRSLTAEEQAEERAAMALDPSAMGACAQELYVEPDVDWDRWRREMGDPNVPF
NLLKYFRALETRYDCDIYIMDNKGIIHTKAVRGRLRYRSRRPTVILHLREESCVPVMTPP
SDWTRGPVRNGILTFSPIDPITVKLHDLYQDSRPVYVDGVRVPPLRSDWLPCSGQVVDRA
GKARVFVVTPTGKMSRGSFTLVTWPMPPLAAPILRTDTGFPRGRSDSPLSFLGSRFVPSG
YRRSVETGAIREITGILDGACEACLLTHDPVLVPDPSWSDGGPPVYEDPVPSRALEGFTG
AEKKARMLVEYAKKAISIREGSCTQESVRSFAANGGFVVSPGALDGMKVFNPRFEAPGPF
AEADWAVKVPDVKTARRLVYALRVASVNGTCPVQEYASASLVPNFYKTSTDFVQSPAYTI
NVWRNDLDQSAVKKTRRAVVDWERGLAVPWPLPETELGFSYSLRFAGISRTFMAMNHPTW
ESAAFAALTWAKSGYCPGVTSNQIPEGEKVPTYACVKGMKPAKVLESGDGTLKLDKSSYG
DVRVSGVMIYRASEGKPMQYVSLLM
>sp|Q6GZT4|042L_FRG3G Uncharacterized protein 042L OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-042L PE=4 SV=1
MFAPPSSLFVPATAPAPSTSGFTIPANLRRDAYVCPFATAEKERKEREQQQPASKGLNHD
LAAQEPLHPSLVSRFPSNYRGSFLR
>sp|Q91G56|042R_IIV6 Uncharacterized protein 042R OS=Invertebrate iridescent virus 6 OX=176652 GN=IIV6-042R PE=4 SV=1
MATLQQAQQQNNQLTQQNNQLTQQNNQLTQRVNELTRFLEDANRKIQIKENVIKSSEAEN
RKNLAEINRLHSENHRLIQQSTRTICQKCSMRSN
>sp|Q91G55|043L_IIV6 Uncharacterized protein 043L OS=Invertebrate iridescent virus 6 OX=176652 GN=IIV6-043L PE=4 SV=1
MDLINNKLNIEIQKFCLDLEKKYNINYNNLIDLWFNKESTERLIKCEVNLENKIKFNQKY
NSDTIKIMNILFLICSDGVFGKIENNDVKPLTDEDEKICVKFGYKIMIGCLNDIPI
>sp|Q6GZT3|043R_FRG3G Uncharacterized protein 043R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-043R PE=4 SV=1
MEEVDGCAGPNSEAGALTAGALTAGAFAVTAGAGVAGAGVAGVGWCSWCSWCSWCWCSWC
SWCWCSWCWCSWCWCSWCWCSWCWCSWCWCSWCWCSWCLSKGWEDRGGLEGCKSCKGWCL
CSHCWCWCSWCWCSWCSWCLSKGWEDRGGLEGCKSCKGWCLCSHCRCWSIN
+354 −0

File added.

Preview size limit exceeded, changes collapsed.

+1 −1
Original line number Diff line number Diff line
@@ -35,6 +35,6 @@ def test_loading():

  loader = FASTALoader(
      featurizer=featurizer, legacy=False, auto_add_annotations=True)
  data = loader.create_dataset(input_files="../uniprot_truncated.fasta")
  data = loader.create_dataset(input_files="./data/uniprot_truncated.fasta")

  assert data.X.shape == (61, 3, 5)