Gene list
Applied filters:
COG category: RNA processing and modification
Gene type: CDS
Number of genes found: 149
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Dictyostelium discoideum AX4, AX4 >DDB_G0272997 DDB_G0272997, MGIEGLNLFMGKQFGCVEKVKEITTDHVYIDLNNYLHKSVKRGDNSKLNT EVNVFRILKSFIDGILRKVRVKHSVFFGIDGPGPRSKMILQRERRLKNGN IDKLKYFINRNKEQQQQQQQQQQQSPQLHDYDYLNENDLKSFDSSFSTLE FTPGTTFMGKLKDFLIFYTKNKLYFAKKIFISASDRIGEGEFKIFEQILN SNYPINDSFTIVSSDSDILLFSLLSKYKNVYIYNKDSEEIIKIDKIRDKI YQQCYKKKQKKQKQQKKSGGEVEGEGEGIATEEKEEGVQIEEEEEEEEEE EEDISKRRQAIVDFVMLTFLMGTDHLPKVSSYNIGSAWSEYCKIKKPLYN EETGLINLDLLFRLIGKSSQPRLFYRNRFVNNNNNNNNNNNNNNNNNNNN NNNNNNNSDVNNNQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQ SPQILKSLFNNAGNYYCLSKGISQPTQNFKCEKDKGNGSNSKIKTRYNII IENKIVDSSNDIDYLIRSFLSINHPFWESNINIIKEKYIYKLYNLFTSVK NGCNDNDADDNSEEEDDGCNENDEDEDDNQDEDEDFEENEIENENEIENE NEIENEIENEDGDVKMNEKETITTTTTTAAAYSKKDILLNHYLYGLIWHI NYFGGKCNDFNFSFPIKSVISSEVFKNFPIRFLNNNNNNNNSNNDNNFDI CEIERELLKIQNYSNILKNPLPPVPLLFGILLIDYSNKHLFPTIYHPIYK ETPHFNIMDRKRMEKEFRKSDAIDTLTTLFNTKTDKSKLTPYQQSQLQFS PTLFFNILPNRSIEFYEELFNNNNSNNNNNNNSGNISPLKYKLINTFRNN HFFNSELKLQSSDDIKDCLNNLYNNNLYNNSSYNNKNSIFNYNNNNNNNN NNNNNNNNNNNNNNNNNNNNNKHYQNKSNRFNNFKPELEMKAQQVQYQTT QELEDQDQALIQQHLKQYQQQQQQQKSTQTEQPREQPQPKLPKQPKQPKQ QQQPKQQQQQQQQQQQQQQQQQQQQQQQQQQKQPKLPREQRAKQAKPPQQ PQHPKQPKQPNEQQQQPKQQQQQQQQQQQQQQQQQQEIENNFQNIKPIYQ KSQHPQLKKQKQTH >DDB_G0279513 DDB_G0279513, MMTNINELPIPVLIKLFGFVNKINSSSYWFVNYTLVCKLWTTQILPTAWN EMTIATSQPAEPLIEYSEDGILSQFPSTTSEPYKFNKLVMRVGKVSEYID KIKSLTTIDLRNNSSTNYVIDRLAEALKSNKTLTYLNLYNNRLMQKGGTS IANAMKKNQSITHLDLGLNLLGANGGNAIADALKVNNTLVHLDLSSNQLG LRGAGPVVEALKINKSIKYLILNSNQLRDECSLPLADILRSNIGFIELAL NDNEIGSKGGIALAKMLKSSKVLTKLEFGKNELGDDGGLAMADVLKNNKN IKVVRLNWNKLGVKAIKALSESFKTNSTIIQLDLSFNNFGDEGLVCLSES FKQNKSILSLDLSRVASGLVGHKALADSLRVNNTIQTLDLTNCKITNEGG VELAKSLVDNKSISTLILNNNTFSKDTVSELAKTLESNSTITSLSLVHNQ LTIDGVEDLFKSLSTSTNKSLQTLDLTNNLLGSDGGNIIAQHLTKSNLSE LILTNNQLSSQGASSILNVLPQSNLQTLDISNNSIEPDVATSLCSAISNS QILKLNISTNKLDDTVIPPLIQAIQTNQSLISIQISANQFSKESNNKLLY SIQQNKSIYYYDLVEEI >DDB_G0284217 DDB_G0284217, similar to H. sapiens CNOT7 and CNOT8 and S. cerevisiae POP2 which are components of the CCR4-NOT transcription complex MVTLHTDEIKDVWGYNLDEEMEKIRNLVDDYNYIAMDTEFPGIVTRPVGN FRSTSDYHYQTLRLNVDQLKIIQLGLTFSDSEGNLAKPTCTWQFNFKFSL SEDMYAQDSIDLLSRSGIEFKKNEANGIDILDFGEQLMSSGIVLNDNIKW ISFHSGYDFGYLLKSLTCTVLPLDEADFFGSARTYFPCIYDIKYIMKSCK NLKGGLSELADDLDIKRIGPQHQAGSDSLLTSTTFFKMRKMFFENQLDDS KYLNILYGLSSFGPDGTPTNIHTGAGNNPPPQLSNSGSTNYGYSPLSQQN NPNNITNYNSTNPASQNQPTYYNNYPTTPTRYHNSSGPYSPTGNNLSMSQ PNTPSKNNSNDYYHKKS >DDB_G0285269 DDB_G0285269, MYHNFNYIDDENGHIKMLPSPVKPNGGKLRTPTSKATILTADIGHEIREV WAHNLEYEMSLIRELVDIYPCVAIDTEFPGFVNKPIESMRMYPDYNYQTL RSNVDLLKIIQFGITFSDSTGCLPVPTCTWQFNFKFSLKDDMYSPYAIEL LKSCGIDFQRIEDYGIDVNDFSELFISSGIVLNDKIQWICFHGGYDFGYL LKVLSCSELPKSESDFFDLLRIYFPCIYDVKYLMKSCKNLKGGLSGLAED LNVVRVGPQHQAGSDSLLTNSTFFKLREEFFENEIDDHKYKGILYGYNVS QNFHHNGHL >DDB_G0285705 DDB_G0285705, contains a RING-finger that is a specialised type of Zn-finger domain which in some cases has been shown to be involved in ubiquitin E3 ligase activity contains 14 putative transmembrane domains MQQQQNQEEEDFCRVCRNGSTPDNPLSYPCKCSGSIKYIHQNCLLEWIQH SKSSSCELCGHPFRFTPIYSPNAPEFIPSHELFYEALIRFKWYIKKISRI LYIVFCWLFIVPTVTCWIFNFFFGQKWLVPLGRVVMENSGFGSSHHTAVT LFYDFFIGTTLFFWIILASIASYMIIDFIHHKHAEIEIQDEFEFDSDDTY LQNLQQPQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQL QQLQQQPPLQQQNIIQQADNNNNTTNEENVYNNQDLGFQITDDMPIRLRQ RLRQHQQQQQKHLDQIRLQELQEQLAETISRDDQVGQAMIQDQITTLQLQ IQRNQQPQQQPQQPQQQQQQQQQQQPVAAGGEPIQLQPQQNQRVFFRIPG VLEMLRLQDVPEEHENDIENDPNQIDDLEHFIGLSGPLFNILTHCFILIV YNAIFLSTFLYFPYLIGHTFIEKLPFHVKDILKLVDGSISQCVAGVAVGY TILSFTALIILSILIKEKIAFKISTLTHSFIKIGVITIFELGILPIVVGC FIDFCSLRIFGGSIEARLSFALSQKMTFLFSRWIFGIFFMVNFTNLCSIF HQIFRKGVIWFLKDPSDPDFDPFKDMIKLSFKRHLFKVFVSLCAYAIIGL LFVFLPALFLSTIPGFLPINLQVNDPITKGSADILFIVAASFFPKFDTKI TIKNIVKFWITKSSKILSLESYLLPKEQQQNVQTANQQQQQQQQQQQQQE EEVYNEEEIEEDDDEKEPNQQQQQQQQKQGQVQKQQDVIFKPNNFKKRIS LFIFLGWLTLFLAICAYISIPVLVGRLILGPFSNDNDIYCILVGLFCGWV LSKIAFLLFSPSSSINIIQWVSIGLKVLLIGFIMVIVMPLLTGFLFDFIF MVPIMAPYDESFFIHFGDIFQNWCLGALLLKFWYRWINATNQNPDNNRNN VIEDLDQPRDRWIDRFKQFKRNGISNIDLKWTFSKIIFPICHYLFTLLTV PIFFSKFLVPLFGGSLILESISFRYGFAVYCFILLFEKILHKIKQWSSRF PNMIRDDKYLVGKQLHNIDQQQPLKLDDSGNGSNQTIF >DDB_G0289199 DDB_G0289199, MKRSNNDSNGPNKKSKQDDDPLSSILGDIDQSQHGKLQSTSIKSTFSLDS VMNRIIELSKQHGGIDSIEVEGRIGLFSNTSNGNTFKPGMVRDDWNDLYH HLKLKSEPVATTETDYIYSDSIRVAYDEHSKKCLRKDKKTDKTSFDQSTN LIYDFRISTSIEEKFPPPLSLPPGYIIRREKQRYTFTEDQWKIDLTKVIV RPDFNATVEQELYEVEIELFPEAIQACQEKTSLTELLNDFLNAIKGLTNI VKNGGETSFPEISLDKVGNVSEFYRLRDLVFKYIPSAPQRKNDTFPGSMP VNFGKKYFIHVQNNEYFVSDKTDGIRYMLLIDHTGCYLVDRKFDFYQIQG FDILVTLFGEGTLLDGEMVRNLQTKRANFLIFDVLSVKNELHHQKLLKDR LTEIGNVVSTLRSNLKVDTPFDILGKSFQLKSKIVNLFKNIKEYPNGERV YSDGKRCHNTDGIIFTPNIAYSNYTVHTLFKWKYCDKWTIDFKVRDRGQK GWYLSCVANDNIEVDCREVNFSNDDLQKLRREFQRARDTSTVVAECSFQP KWGTWKFHQVRHDKKKGNYISIVMDTMESIAENLSSDELKYRIPLLPHDD NWEEEMSRIRSQLINNIKPSKSSTSTYQPFPQ >DDB_G0289461 DDB_G0289461, putative ortholog of H. sapiens CNOT6 and S. cerevisiae CCR4 component of the CCR4-NOT transcription complex MEETDKNNITTTENSIKNEELPSSSTPPPPPPLPPQSTIVTKSKVKEEPL KIILPENFKFGEAEKSFSIIPGKPITYVPFIFTFKNQNSKLKYSKAIISS KWMIDGENIEKLLHYSNNSSAPTISFTPKKEHSGKELIFEIKISPLIFEQ HENKSNNIFSKLFNKSSSSSSTSSNNSNNNDNEIPIIIEYKHKILFEKSR ELLKINEPLNNNNNNINNQYRIIQYNILADCYVSDSWYTHSASYSLRWNS YRSYLLIEQILQYKADIVGTQEVDRLYWQLFKEMNVRGGYDYYPSYANDS NESPQTTMGGFNNSYREGCFIFFKKDRFNLLQGLEIDYTKLNRPDQKLLK KELVEILIQDPIYKSCITHFLEHSSHHVHHALVLLQDKQTKQKMIVVSKH MYWGSQGYNYHIQCVQIHLFTMILSNFIQVNKLENNIPIVVCGDFNSSPD DSCYNFMTKGLMMNDDHHLTLAGKYPPAFNSSQFDNHPEIKSIKHDFNFL SSYSLRPDGEPKFTIVSRAFTGNIDQIFVSKDRFKVNNVLEIGEKQDYKM LPSLTLASDHILLMTDLELLPSS >DDB_G0289921 DDB_G0289921, MGIPSFYRWLIENFPKVLENNLNGEIKFNNLYIDMNGVVHNAIKLDHSPT SSSSSSSTTTTPPTTPTTTYSKDKEVVLMLKSELSDEKLKERIFYRLDQM VNNVNPSSLLYIGVDGVPPRAKAIEQRKRRFKSSKETVDVIIKQLKSKSK PITRDSIIEQFSLIFDSNSISPATEFIEKVDDWIKDYCKQLSLKRQNLSI ILSDSTVPGEGEHKIMDYIRQNHPILKKDGMSHCFYGMDADLIFLGLESH LSNFYILRDPISLISCSTCKANDDHSNYECRSAIALKKQFQKKSTSVIVR GIPNQCSENDIKQLFSYYGNIKIEKIEKSNTKNKTLNAYIEFENEDIVNE VSTRGSTFTINNERVSIHVEYLDDIFKPKNEKGEIIKDEDEELAEQENPS TEKANQQLLTAEEIAKKQKEEPIPENAVFIVGLDSAVSKFDIITFFEKFG KISSFQLSPSPRFNKQQFVMIKYETQESARLACKSTDIEFFGTIITIKRA QLPKTESNTTINSNGSSKTPPLTEEEKQAKEKVKEKKRIIKDSKIQQVLA KFDPNTPKDVAIFYLGLSSWNVNRAFENYLLFNKAGLEHTILNNTTDNDN NNSNGSGSGKKKKILFDYVNIDSFRTYLEYYFFEKIEKSKREKINFNRVI NDFTVLCFYLGNDFLPHLPSVGIQSGSIELIMCWYREWIQNCLNGDDDIK YIVNEQSTHLNFENCLPLLESLADWESSLYPENLEKQVKKDFIKVNKNST SNESSSIVMVPPEFKYDDLLYYRVKFDLIGKPDDQVKKLVDDMCYQYTLG LHWVLRYYVSGCQAWDWYYPYHYAPLAKDLLNYQKRLSKIQDREMVNNQF NFKLSSPLPPLIHLASVLPRNSVAFLPDSMKHIVNENSPFTASYRDDYKY DFNGENVAWKAIVLLDFMDIEKFKEYLLPIVNNELTDNEKKRNLIGNDIK FKNGEIIQLSPLCNNTTTNNNNNNYEEKEIPCINITKNLSIKDLSYPTRN YFTRSSHPVIDSNYISIPTQQASTVDHNQLVPLTKEQADFLQWRKSKTGF NQSLEILQSIDFSKCGCVNIDKKQISQFSGSGAATATITCWLGRVLNNEI SSDSTIFIQSIDDSQLIINIGFKQAVKLTSIKFVSSSNRVPDRDSVPKVI KIYTNNDQPNIDFSVIESLTPKCTIEFSSPSELESYSSSTPFSFASGTTT TNTNFKSVNNLTIFIESNFSKNQDKVSIIEKIILS >DDB_G0291836 DDB_G0291836, MSDINEDEYNDEEMKAVLKEDSSDSSDDYENNNEDLSNSSDDDDDDDDDG DDSSDDDDDDNMESKTDYENSSEGLEVGLLEIKLREDPYSFEKNLNYINA LSKFTKQSNYQTLREAREKFQSIHPLSQDIWLAWFSDEQKYMKTDNDKQY ILSLYEKALNDFISVKINVSYCKFIIKINTNSGGLINNVKEIRKQFERSL EQCGDDIIESPLLWSEYRMFEQMLLSQIKDDKEKQTQIKIIRDLYHRQLS NPMIGLHSIYNDYQQWEHSQSIDNNNNNNNNQEKEKEEKEKEEKEIKLKF EKSLKQFKEREPFEIALKEKKYLDQRKWKEYIEFEKQQQHNDKPMRVATL FERQLKSFSNHFSIWSFYLTYLEKFTNFKDLHLKVFSRSLRSIYYSGEHW SKYLLLLEERVHNDNDNDKRVKIEQEFQRSLVSGLKSEYDYQLVYNTYID YNWRSIIKKLNTDSNANSNGNGNNNNNNNNNNNNNNNNNNNNNNNNNNNN SISENDKQLMKSLFETMNNQMSTIDVNNYTTVDRYMYIAQFEWRQFNDLS RYREIVDYVLSIDPSQYWIWCQYISFEMEQKQFQSVRELFKKASSHIRFD DPSSRIWQDWFTFERGYGDINQYRAVSDRYSIIQNKYNKEQERYLQQQQQ QQQKQQQQNKRKEKDDGKNKDEKRISKKQKNENHKEKGDQDEFKKPLPPT SKKEKEKEKDLPKKLIILNLSFDTAEPDLHKIFDKYGQIKSLKLVLDKNG KSKGICFILYKSHESANKALEMDQQIIKNRTICVQYSKDQQINDHVESNQ TLTTTATTTTTTNEIDFENHIGLTVFINNLSPSVNKEKLEQFLRHNGVTG IKDIRVVLKARPFAYIDLIDKENLKKALSLDKKYFLSKLINVNLSKPPSS ISPANNNNDNNNNNNNNTITSNGNDTEFIKEIPSRKPTLLVPRGIKNKK >DDB_G0269682 atxn2, similar to the human ataxin-2 (ATXN2) defects in the gene cause spinocerebellar ataxia 2 (SCA2) MSQSKDKKKFVGGGGGGGGNNSGGGGYGSPKHNNNNNNRNSSNNKSPHQS HHNQQHHQQQQQQQQQQQQQQQQPFDSLTAMKERTVFMSMSLVGQNVSVT LKNGDVYEGILHTTSTSTGSSGGGWGVALKMARKKDTNNRVITTLPLPLV IIEAKDFLQITATGVVLDHYRDSFMNRDQQSFITDTELSGFDGNLKEREL TPWTPDPSVGESLDDFAANSEAKKPANWDQFETNEKLFGVRTTYEEEIYT TRLDRDSEFYKINQSVAEKKAQEIENEKSGNIHLLEERGFVEGADYDEEE RYSSVVRKGLLPTSTTSTTTSPPTQNPTPSSSVYIPPSKRNNNNNTPSTP SVTSPPIVDKKHQQTHQDKKQTQQQQQQQQQQQQQQQQQQQQQQQQQQQQ QTQPTTTATATASTSTSTTTTTNESPSSSSTSSTPSTPSTPKNITTTTAT TSTNNTPTATNTNVNSPLGDRESPTISKLRLHQSTIDQDVMGSPRENLSP RSVAYTRYRQILSEPTNKSMNKSGSNISTTPVNGSGNVGPNGTPLLSSVH SDQAPKSPVPTIVSNHGLVKALSLELATPTVPEKFVNDFNNFKLKINNVD RGSETQGLKSFSSNLVIKSKSRPGSPLIGSGSPRPTPTQLSLSGSSTSTN TSTTSPPTTNTTTTTTTATNSTTPSTTEDDKSTTTPITTTILTENKSDDK EKEKEKEKEKVDEKEKEKEKEKSDEKDKDQSSTLVEKKDESSSSSNTTTT TTNTTNNNNNNTTTVTKLSKLKLNPNAKEFVPVVVNKPQPSFKSTTESNT DSVTPINEIYYDSMRKRQLQPESPDQVSLYWVDPFYPRYEEDPYAAAYQM RAHHHMVGHQPPPQLQFNPQFYSQQGHPQLQPHHHMVPPQLQQVPPGVNV HTMKPPGSLQPGGGGVVQPQGIVQPQGIVQPQGGVVQPSAGGAPKTMYQQ QQQQQQQTGQPGGPMGVQRGGHLPPQQQPQQQQQQPPPQFIQGIPPGANL VISNGPPNQPFVFQGAHPPYAVPHPQYPMPPQGIQGGNKRFYQPPPQGYP QVQPMIIPQQGQVVSQNSPQQDSPSNRLNQQVPPYSYMTHPPRGYHPNEN QYH >DDB_G0285829 bxdc5, ortholog of S. cerevisiae RPF1 and H. sapiens BXDC5 a nucleolar protein involved in the assembly of the large ribosomal subunit MVKPKKEVDKDDLTKEELKLRRSPTDIKCKAKRVLLVQKLQAAKKTAREK ARKQRKKEREILGDAAPAKEVPRTIESMRRADETIVDTENDKEFEEEINK DEFESYFDGRVPKIVVTTNQRSTKEAVEFAQVFTKLLPNCEFFHRRKYHL KEIVQFCNNRDYTDIIVVNETKGIIDELTISHLPNGPTAVFRLTNLVMPE DIPGGGEMTSHKAELIVNNFTTRLGHSIGRMFASMFAQDPNFKGRRVCTL HNQRDFIFFRQHRYIFESKEDANVQELGPRFTLKLKSLQKGSFNTSTGEY IHLHQHNMDVDRKKFVL >DDB_G0279311 cdc5l, contains two Myb DNA-binding domains ortholog of human CDC5L and yeast CEF1 may play a role in transcriptional activation and in mRNA splicing MRNVKGGVWKNTEDEILKVAIMKYGLNQWARISSLLTRKSPAQCKARWHE WLDPSIKKTEWSKEEEEKLLHLAKIFPSQWKTIAPLVGRTASQCLERYNR LLDEVQRQQDNENGGGSGGGGTTTTTTTTTGENDPRRLRMGDIDPTPETK PAKPDPIDMDEDEKETLSEAKARLSNTQGKKEKRKFREKQLEEARRLAFL QKKRELKAAGINYNPKKKGKEKSWDISKEIPFYLKPKAGFYDVPDEELRD EPNKDASFIGKRVDQIENPNYLQRQEKLNKLEDIKKSKKEIFNLPQLISE TSKSNDVEHSIKRTKLQLPEPQLTDDDIQEISDYEKLNGSGSGGGSGGVG VGEFPLPAPRTASISSTAANNNTNNIRTPMKQDTIMSEAQNLLALSNAQT PLKGGAGPNVSQTPLPKSVNNSTPFRTPNPLANQTPTQHNKKQSLNDSNE FAIEDKFKRQQGKNQLLSNLKNLPSPTIEYKLELPSELPTIEDDTTLELD NSEIHIREQQQLKHKEQFKLRNRSTVLKRNLPRSRNLFPINKNNNNNNNN NINQDELRILKEINRIISHDNKTFPNDSITPSSTFDDDDDDDNHHHHHDD IDNNSINDNDEKYENYDYFTNTELEFADKLIRDEIEQIKQELKQPLPSSN EILEEIDQIRSQFIYLPKENQFIEKSNANQTQLIENLQFEYDKTLNKIKN SSMKSVNLEKKLNIYNGGYQNRSNTIIKNIDDMFDQLEQSEIEYQCFVAL KNNESIQMEKRLKSIENQVYDQCEIESRLQQKYAQLLNEKNLLKKKLSIF >DDB_G0285507 clp1, ortholog of the conserved CLP1 protein S. cerevisiae CLP1 is involved in both the endonucleolytic cleavage and polyadenylation steps of mRNA 3'-end maturation Mammalian CLP1 is a subunit of cleavage complex IIA which is required for cleavage but not for polyadenylation of pre-mRNA MSNDNSVNINNFSSMNGGGGGSDIQFPLKPSQQQQQQQQNSINQSTIRTL EITQELRYEIDFDQNGWMKLIEGTAECFGTELSLNKVYKLSGTKGAVFTW TGCKIEITNNCQPYIGEKTPMPQYAGVYQELDAFRVSILDEPKKSGPRVI IVGPTDSGKSSLSKILLAYSARSGYQPLFVDLDPGQGSITIPGTISAAHI QNPLDIEEGLAGGIPLAHFYGHTSLDVNPDLFKALCKNLASFIDKQLDSS NISRISGFIANTCGWIDGLGYKILLQNIDVFKANLIIVMDNEKLYSDISS HYSQKDNSIKIIKLPKSGGVFIRPPVFRKKTRMNRIKEYFNGINDNLSPH YIVLDFKDVSIYRTGGGPAAPASALPIGTSSQIDPLQITEVYPSLDMCHS IFAISYAKQASNIFHSNVAGFLYVSDIDMETKKITVISPAPGPLPSRFLL LGTLKWMEN >DDB_G0281585 cpsf1, ortholog of the human CPSF1 and yeast YTH1 the 160 kDa subunit of the cleavage and polyadenylation specificity factor (CPSF) complex required for 3' processing of mRNAs human CPSF1 involved in the RNA recognition step of the polyadenylation reaction MSHHQVFQKQVLAPTGVEQCIKANLINDDSINLVLAKTNVLQIYKIRYEK IEKYENVSDSQPQQQQEQEQQQQDITQKKKIELKPSLELIIEKKLFGNIE SMASVRYPNSERDSLILTFRDAKISVLDYDSDLLDFEIRSLHYFEKDEFK GGRNHFKHPPLLKVDTQQRCAVMLLYDRNLAVLPFKKTSSILDDDDDDDD NNDDDDDENNEHDENENENENENIIKKEGDQQTKEKESVDDEFDLLFEKD SSPPPPSTAATAETTTTIKKESNNNQDKEKKNIEIENVKDFCFLHGYYEP TILFLHEPIQTWTSRIAVKKFTCQMTAISLNLLTKAGSFIWNVSNFPYNC EMLVSVPEPLGGALVITANIMFYVNQTSRYGLAVNEYASIDTSTIIGSQP FDFPIDDTLNLVFTLDRSNFVFLESDKFIGSLKGGELLIFHLISDGRSVQ RIHVSKAGGSVLTSCICVLSNNLIFLGSRLGDSLLLQYTEKSITDDQLEH ENFSNPYKKQKTSEVFDLFDENSETNNNNNSNNNNNKENQEKSSSSSIAS KLLEEIEDEEDQLFKEKKNQLKSYQLGICDQIINIGPIGDIVVGQSIDPT YDETIQPNQPEYVPKTLELVTCSGYGKNGSISVLQNNIKPELVMAFELPG ILNVWTVYKEEIEEEHIEKEIKKNTSKKRSRDENNNNEQEDNEQEDNEDN EEEEEEEKMQKDKNWHDYLYLSLKDGTTLIFETGRDLKEVGKFNFKSLDI GNLFGRKRIVVIYQGGIKLINGFDRVIQEIQINEPIKSSYICDPFILLQF HNGTIQIFKGIDEENQLIQFSINSISNNLNQSIFSSSLFFDRNKSFLNIN NKNQKLKLQQQQQQQQPSNEKKKKKDKSRGFLDSDSDSGESSEDEEMKDI KQENENENENENENENENENENENENEIEIKDQDNIYLNIYTTNGSYEIY RLTSQECIFKVSDIKFEYDILGINTNVSQNQILEQVLTPKSSLSKKQLQQ HLQKQKENGINSKNNYNQIQNSEILDIVEISLHNFNNSDPYLFMFNKIGD LIIYKSFKREKNGELRFKKYNHSFILRDSVTEFYQKQQEKELLNGMDDDD DMDDEKKKKKEEEEEENLNRQKRIFEFSSISGKRGLFIGGKKPIWAFCEK GYLRLHSMDSSDNSNSNNSNNNNNNNSNTVETFTSFNNISCQDGFIYFSK EKDVIKICTLSTLMNFENDIAIRRIPTKNSCHKIAYHSEAKCYVVIVSFP QVTQELQEDSKKPILTDDKFQIKLIDPTIDWNWKFIDSFSLQDRETVLAM KIVSLKFTEPDGITRARPFLVIGTAFTFGEDTQCKGRVLVFEIVSHKTQF ESEELGEKRLNLLYEKEQKGPVTALSSVNGLLLMTIGPKLTVNQFYTGSL VTLSFYDAQIYICSICTIKNYIVIGDMYKSVYFLQWKDNKTLNLLSKDYQ ALNIFSTEFIVNQKTLSILVSDLDKNILLFSFEPQDPSSRSGQINQEING NNKNDNRLPKKEQLVIFGTLDGGLNVLRPLDEKIYLLFYHIQSKLYYLPQ TAGLNPKQYRSFKSFSQNFHFSPSTFHQLPKFILDGDLISKFLSLSQSEK RLISNSINSTSDEIIESLKDVFESWNLF >DDB_G0278799 crop, MDAIRAQLDEFLGKDRNLLPKDRIKVENDFNDPDICKFFLCGLCPHELFT NANIRDLGPCSKLHDENCVKQYQNNKDKDKYDYEREWVRVIEGLISDNDK KIKRNKERLLQNPNGDANHHGGPIQQQSISQLDDEEGGLLPDKEQNSKIT ELDLKIQELLKKAEELGEEGQITEAQALMTEADELKNQKVELEKIEQEKN ENKRMSVCEICGALLFVGDKEKRSISHLEGKKHIGFQKIREVMEEYYKSG RRANLGRTDFYNAPPPPRDSYRDDRRSSSSSYHDIDGRRDHRYGGGSRDY GGSDRRGGGNYNNGRGSSRDNYNNINNSRDYRNDHGKDYDRKRERDYYND DDRRKRDRNY >DDB_G0286645 cstf3, component of the CSTF complex which in mammalian is required for polyadenylation and 3'-end cleavage of pre-mRNAs MEDENKEVMDTSQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQPSQDTIT STTTTTTDNSGPTTTVAVSPPTTTATAAVSPTPENQNVTTPTPISPSLSS VNVSPTTAVTITASPTTSTTTTASPTTVAAPTTVGSTPTTTTASPLGNIL SLNMPAVGKRLNVQIETLENRINNDMYDTEAWTLLLNEVQSQPISIARDI YKRFLSVFPTAGRYWKLYVEEEMKEKNYDIVEKIFFENLRSVKNVEFWKS YIAYIKQIKGDKVENREEIIKAFEFALESIGMDISSTSIWTDYIQFLKDE KASTQFEEGQKMTAIRKLYQRAIENPMHDLDNIYKEYEVYENSINKTLAK ALLSDHQGKYQHARNVYRDRKSLLEGILRNMLAKPPRSSDKEEHQVRLWR KLITYERSNPQKFDAVTLRNRVIATYNQCLLCLYHYPDIWYEAATYLADC GDSSGCIAMFDRSLIALPKNLFIHFAYADYLESQKKQPQAKEIYEKILQA NPEPLVWIQYMKFSRRTERIEGPRKIFKRAKSTPDCTYHVYIALGLIEYY INQDTRMARDIFEIGLKKFPSEIAFVNFYIEFLTNLNEENNTRVLFEKLL TWPSLEKSESIWRKFLDFEYRQNQDVSSILKLEKRYQVTVNSNTDKSGVL QALNRYKFLNLWSCHPTEIEIITKNILDDHSDQNKDDHHSHSHAHHHPPH SRHNANKDSTEKDQGAIDGKEGAVAAKLHKKGKGKEIKPVPQIESKPTFS TIIPTSNWKVKKPDITQMVPFRGEIGKFTQSSVVSIPQQQQQQQQPTPIS SQPISQQQQQQQQQPTPLSSQPISSQPSLQTNTQQQGNQPPNRSGLPDFI FYFLQNLPSNQSFMGPYIDPEQLIGIIRDTPLPIQFLNQQQLQLQQQQLQ QQQLQLQQQQQLQLQQIQQHQQQQQQANRTSPTLSNETLIIPNKPQQPQQ PQPQQTQTHKRKQPDDESNNEQQQQPPPQQPPQQQQEQQQQPPTTTTATT SVVSPITTLNSTPISAPTTVSPITTTTIPSTSSPTTTSTTVTAKSQQSAS DIYRKRQANKLSKKS >DDB_G0290485 dlrA, MQNHQNQMNVDHQFYFISGQDQSNNLETQIKKEDIDISDQASYHQPGLDK NNNNNNNNNSNSNSNSNSNSNSNSNSNNNNNNNNNNNNNNSNNNNNNNSN NNSNNNNINNSNNNNSNNNNHPINHMQQHHIHQHLQNYHFQSNNNSNNNN NNNNNNNNNNNNSNNNNSNNYNNHNNNQGNGNQQEPTSSPQISHHNNNNF NKNDNSITNLTNSDSIEESTSSSSSNLMAGSGSSASNQSPRVTSSTTPPN NNNNNNNQSAPNNITYETPEILLKPGERPPFIGSLADIIISNILGKAHKD EPNSQNFTNICSVCSKWRKISVGRLVNYTYQLPPDRSITNLFRNLSNNVY PNLINLQLKVSTPTLFDVSSFVRMLLTKNTTITTLELSQNGIGNKAAHCI GECLLANKTITHLNLSFNSIGNEGAEEISKAILVNTTLINLDLSQNCIGL KGSKALGQALQSTTILQTINLSKNRFGAKGIDFIVESIGKNSSLTEVDFS KNDLNEKSSKYVGEAIRKHPCLASVNLCDTKLSPESMKYISEGIQASQTI AYLDMSRNEFNYKGLKPLAAALSMCQSITYLDLTGDSIGDKGAVQLGDAL AQNHSIINLSLAFNNIGASGATSLGNALKTNRSLEILDLSINPEIGHLGA IHIAEGLAMNKKISKLSMCTNGLGPIGAKRLGEALRQNSTITDLQLRGNE IGDEGCRALSDSLKQNQSITELNLSGNGITNDGAKALCEALWYNQSLASI QLNHNNINTQGVQFMKELLLRSYLVNLDSYFYPPTSTTIVSVLYVTRGRN RGSNNRNNNLNTNLRIIV >DDB_G0288797 fip1l1, ortholog of yeast FIP1 and mmamalian FIP1L component of the cleavage and polyadenylation specificity factor (CPSF) complex required for 3' processing of mRNAs directly interacts with poly(A) polymerase MSEVEEQTLIKDASMNEPTDKTTTEGDNGGENENENENIAEGGENQEDNN NNNEGEEEEEEEEEEEEEEEEDEESDDDDVVVLLDQESVEASSSKPGATF RTTPNKFSYRNPSSITPGSGGKYMLTKQTPTGGGGGGSGFNSAKSNQKTI FEFDIESFEEKPWLKPGADISDYFNYNFTEETWKAYCERQNTMRMELNNL GKIKGYESNKPNIGGNTTGGTGGNGIGDKPNITNGNLGVGVGGGIGGGSS GSGGSIVGDLPPELQGDKLQQGIQRPQFKRQPSRSDINDESNLDQQQQTD GRFNRGGQQQIPPQQPQPQQQQQQPQPQQYQTNYQRDRIYNQDYRGSGRT YSTTNQYEDDSNNNNNGSGSGGGSDRRRNESSSSSSRGDSDRGERERERD RDDRDRSERERSDRDRSERERSDRSERERSDRDRSERERSDRSERSDRDK TSSSSSSSNNNSSSSTRGSDRYDRERGGDRDRTSESSSRGSDRDRDDYKS RSSSNTGGSSSSLRSDRDSRTPSNTSSNYSSSNPSSSSSSSDYKRKNYNT DSNDRSKKRK >DDB_G0302418 imp4, ortholog of the conserved eukaryotic IMP4 component of the 60-80S U3 small nucleolar ribonucleoprotein (U3 snoRNP) required for pre-18S rRNA processing forms a heterotrimeric complex containing IMP3 and MPP10 MLRRNARLRQEYLYRKNLEGADKEDYEKKRRIKKALDEGKPIPTELVDFE FKHRDEMKLDKLDGNRPKSIDDEYARAGIQDPKVLVTTSREPSSRLIQFT KELRMLFPNSQKMNRGAHVVKELVDACRANDVTDLVIAHEHRGEPVGLVI SHLPYGPTAYFEIKNCVMIHDIDEATPPSLAFPHLIFHNFTTPLGERTEN ILKYLFPVPKDDSRRVVTFSNDNDFISFRHHIYEKDGYKNVILKEIGPRF ELKLYKIQLGTLDQDEADLEWVYKPYMNSTKNRLFL >DDB_G0293900 mybA, MQPKRMSHNLTVGEKAGSPFNVILNAANNTNSDNSSNNSDNENDDNQNNN NNNNNNNNNNNEEEEEEDDDDDDDSQQNRHNISIPFTINKNSINNNNNLI NNINNNINNMNNNMNNNNNNNNMNNNINNNGNSNISNNNTPKVEKKKTKG KWTSEEDQILIKAVNLHNQKNWKKIAEHFPDRTDVQCHHRYQKVLHPNLV KGAWTKDEDDKVIELVKTYGPKKWSDIALHLKGRMGKQCRERWHNHLNPN IKKEAWSDEEDQIIRDQHAIHGNKWAEIAKFLPGRTDNAIKNHWNSSMKR VSNNNVHLKSHAIEHSLSSQDNQDSPKSIITSSSPIPTTTTTTTTTSTTL ITPPPPPLLPPPPSINKKEKKIKQPKKRNASEIEQTLSQPHINQHESSPI VFENISNGNNKIDIPAAQYLMTNGISCINNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNINNNNNINNNNNNNNNINNNNNNNNNNNNNNNNNNHSNV SANTNNNNNTINNNVSLQPPSQLNSNIAQLPSTPKNLAHVNIANKLNSPG ELMANIVTPIKFFQTATVSMNSSSKNYNNENTNNNNNNNNNHHHHHHNNN NNNNKRPRLDFSSATPTKNNESFSADLCSQFPDILFSPIQNKNNKESFLD NSGLSPLRSPLHTNFFETPMKNYEYLDFNSPKIPNTISPLKNFNSPFNKI NNHSNYNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNHNPNHNSH NHNNHNNNHNHNHNEFAQPQPPQGNYQQSPYKNNSTTASSTLSTPSYSNN SSISSSSCSSSSSNSASKATKVIQLSSSGIDKNSLLTINAKLNNNHHKNY LDSSSSSSSSSSSSSSSSSSSSSSSSSSSSAASSSSTPNNQSDLATVPFT PDDNVFNNNNNNNPPTPGKTKFKSRFSPNSKPYSYPQEYDNYGQSSSTPQ NINNTYNSICLGTNNNNNSNSSASNSFENNEENNNENDNNGSSSGGDKVP QMDSSFMALKLLKDNPNKSLFSKARKILGLGNISSSSLSPSSFVQQISNS ASASSTPTSSSSTPLSSPTTTTSSVAAAIINKISSTPKYFSNQNQNQNNN NNNNNNNISNNSNLSAFSTPGGNDHVPNFESNSFLAQSPFQDILNSQKLD QLHQLTQTNQLSKSKVLLRNHKNSNQTESNSRSTIESINSNGSNNGNNSG SSNSGSNSDKNNGKPISFRLMESSKPIEGL >DDB_G0268368 mybAA, MELTTEAPFINNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTPNNTSYSNY PQSGYVYQNYPLNNNNGGNNNNTIYYNQPQQYDPNYQTSVTSNSYPQYSY FASPNIISPIPSPCLGSTPSPIPSPTIYQYSNNSNNYCAINTPPLTSVPS PILNCNNKKRPEFNNNNNNHNHNNNNNNNNNNNNYNYNNSNNNQQQQKQQ QQQQQQQQQPQQPQQQSQQQQQQQQQQQQQQQQQFKQTNINTTPKNLSPV LQSVNSSASSTPQIQSYFQQPQYQQQYQQQYQQQYQQYQQPQQLSSANTT PQTNERPNYSSIIQPNQLFSQMVPSFINGAINNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNTTTNNNNNNNNNNNYNNITYFQPYTPFSIVDNSSMI VPDKQPQQQQPQQQPQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQNYN DDSNKNNNNNNNNNNNNNNNNNNNNNNNNENINSSNNNNMYNICPAAAYQ NIQFIKEQSNSSLSSSQPIPPFNLYNEQPQQQPQQPQPTQSQPILSSSST SVFDINHHHHHQQQQQQQQQQQQQQQQQQQQQQQQQQQQQPQPNLSSSSY ADNNNSFQSSSGNVWESQSSPIQSSVQISSPPQSNQSSIAPAPAVNLSAS ASSVATTTKQSNVKKQKQQQQQQQQQQQTKRQELSDSEDDTDNGDDIDED DEDDDEDEDDMEDEDTSSSSSSSSSSSSLSKKSPAVKKSGLKKSGRSKSS SNESKAKGHWTKEEDEKLRSLVDLHGTKRWKYIASLLCLRNGRQCRERWS NQLDPSIKRDAWTLEEDRIILDAHSKYGNKWAEISKLLPGRTNCAIKNHW NSTMKRKLSKKQYDFSSLPPISSSIVSDNSSSLSTPTDSISSSPSTSPIT LSSNVVVNDFDSQQQQQQQQTYQQPPPQSQDSGNNQFNFNNNNNNNNNNN NNNNNSVESIKLYTNVNISYI >DDB_G0289319 mybQ, MLYCGSTGGHYMIMTTNNNSNNNNNNNNNNNNNNNNNNNNNINQNHQHQH QHHHHQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQNYGESTTSTSMIPP SITTSLTPLTPTLSSQPQNIQQQQQQQHHHQQQHHHHHQQTQQQQQQILS PMMGSKRKLEEDMGVGQPTSNLTNEYLMSNNNSLKPILVSSPLLTPLSAS PGLTSMQMAFANASLSAPSTPLSMSPSLGPCSAPMSPSKKSKNSRSSSKS KYQNSEERWQSTTSKDNGKPSSPGIVKGPWKDEEDAKLVELVNKCGPKEW SSIAAKIPGRIGKQCRERWFNHLSPEVRKTNWTPEEDKIIIDAHASLGNK WTAISKMLDGRPANAIKNHWNSTLLKKIGGDSKSLNKEKDDDDDDDEDAE DGSSPVLSPISLYQSSSSTTTTTTTTTTNSSEKSNIPPFALSGSTTTSTN NLNNSTNSTNSINNNNNNNNNNNNSSNTNTAITTNEPLVAPKIIRAQTTP NSSPSLSSKKTHDKQKVPQSPKNSKQQQTQQQTQQQTQQQLQQQQQQQQQ QQQQQQQQQQQQHIQQQQMPVISQQPHIQQLPIDQQIQNAQFQQFHLQQP SMPSSTSVNIIPNQSSMEQHHYQQQQQAQQQQHQQQQHQQQQQHQQQQQQ QQYLMQQHQQYQQQYQLMQQHYQQQLSQQHHQQQHHPQQAQQHHPQQVQH HQQHINTTHNQHQQQQQQQQQQQQQQTNNSQVNSNNTDTTFSNSHPIEPH EVPYYYDYWGSSTSGSQTLIPSDNTTNTYTIENNNDFLLFDDNQHRIQPL QQHQHIKQEHAQMSHLPYHPTQPNNLNTTTTTTNNNNNNNNNNNNNNNNN NIPTPNMSASTTGTINHHIHQTHHHLTTPLSQSTPSVSTDQNNYINYDIS SLFSLPNEI >DDB_G0288259 papA, RNA polymerase that specifically incorporates ATP at the 3' end of mRNA MNKNGGPPVANITTSSTTITSTTTTQAKSQLPSSLSVNNLHTTQGSTDQP TILGVTEPISTAPPSSIDFKLSTELENTLISFNLFESPEESRKREEILGK LNQIVREWAKQVSLKKGYPEQTASEVVAKIFTFGSYRLGVHGPGSDIDTL CVGPKHIMRSDFFDDLSDILKVHPEITEFTTVKDAFVPVITMVFSGIPID LIYAKLALTAIPEELNDLIDESFLKNIDEKSILSLNGCRVTDQILKLVPN IPNFRMALRCIKLWAIRRGIYSNILGFLGGVSWALLTARICQLYPNSAPS TIIHRFFKVYEIWKWPAPILLCHIQEGGILGPKVWNPKRDKAHLMPIITP AYPSMNSTYNVSKSTLQLMKSEFVRGAEITRKIETGECTWKNLLEKCDFF TRYSFYIEIDCYSMNEEDSRKWEGWIESKLRFLISNLESTPKMKFAVPYP KGFTNNLHKANNPDQICTSFFMGLSFNFSNTPGADKSVDLTKAVTEFTGI IKDWLRTQPNPDTMDIKVQYIKKKQLPAFVKDEGPEEPVKTTKKRSSTGE PSATRKKLKSENSDNKLNSPKSPITTNINSTPTTSTPTTTANTTTNTTTA TTTTTTTTVPITSTPTSNISSPTMNSTELTTPTSTSTTTSNDSITTPPTT TTINSVQPPSAQPTENGSSTSNSPTSTSINNTALPPNPTTNSESTIETTI TLPTTLESQTSTLKDSNEISTNGTAVATEPTITSPSVNINESSTSTSTTT TTTVTEQQIQTAPTTATPINKTIVNTMEVNELSFISSSSETSQSKPPPKK PTISIIRGN >DDB_G0283543 prp40, ortholog of prp40 which in yeast is a U1 snRNP protein involved in splicing MSSDWVEAIADGKKFYYHKVTRVSVWEIPEDLKSPAPSSNDSNSNNQPVI IGDWKEYKTDKGQKYYYNTISGVRQWDAPPEFQQKLASTTTSTSTSSPQL SSSGSTTITTPIQPITTSATPQQPQQVNSNNNSNNNNNNKDLKESKDSNI NTNNLDTQQQQQQQQQQQQQQQQQQNKEDPIQTFKNLLTDNSISSICTFE KALKSIANDERYQVLKTMSERKQVFLDYQVDRKKVEQEEKRKKEKKAKED FIQLLRDSKEVTPLMSWRRASLYFESEPRWEAIESERERESLLHDHIQEL EQQEKNQLMSIKKEQMKILRQKLELDPSITVFTQWRKVRDQFENDDVFQV LDKFDFLTVFENFIRDLEKKLDDQKRLEKEKLKKDSRKDRDNFRELLNEK FKNGELHALTKWKIFKLNNENHQSFINLSQKSIGSTPLELFSDFKDELEI KYENDYKKLKEILKETNFKYSPESTTLESLKSEFSKHSNYNLIQEFNFLP YLEYLKYKEESREKNLAKKKKKRISQFKILLTETKVINKSSQWSDIQPII ESKKEYIDLGDDQERLRIFKDYIEFLVQNALDEEEDGNEEGELVLSPKKP SNDQSSSKKRRSYIDNLDDEDRYGTGSGSGSGSGGGSGGSSSGGGSGRDS RDSRDRDRGSDRGDRRDDRDRGRSSHKKEKR >DDB_G0274229 prpf8, central component of the U4U6-U5 snRNP complex contains the PRO8NT PROCN PRO C-terminal and Mov34MPNPAD-1 domains found in pre-mRNA splicing factors of the PRO8 family MDDTNSNINQSNESQHLEEKAKKWIQLNNKKYSEKRKFGAVEIRKEDMPP EHLRKIIKDHGDMSNRRFRDDKRVYLGALKYMPHAILKLLENIPMPWEQV KYVKVLYHLSGAITFVNEIPFVIEPIYIAQWATMWVTMRREKRDRTHFRR MKFPLFDDEEPPLDYSDNILDNEVEDPIQMELDENDDSEVIDWLYDSKPL VNTKFVNGSSYRKWRLNLPIMSTLFRLASPLLSDLTDSNYFYLFDDNSFF TSKALNMAIPGGPKFEPLFRDVDDDDEDWNEFNDINKVIIRNKIRTEYKI AFPYLYNSRPRKVKTPTYHTPNNCYIKNDSPDLPGFYFGAALNPIPSYKT SGNKNEQSEYGTEDDEFQLPEEIETILSKTEIEHDNLANGIQLYWAPRPF SLRSGTTRRAEDIPLVKSWYKEHCPSEHPVKVRVSYQKLLKCHVLNKLHH RKPKAQTKRNLFKSLKATKFFQSTEIDWVEAGLQVCRQGYNMLNLLIHRK NLNYLHLDYNFYLKPIKTLTTKERKKSRFGNAFHLCREILRLTKLVVDVH VKFRLGDADAFQLADAIQYLFSHLGLLTGMYKYKYRLMRQIRMCKDLKHL IYYRFNTGAVGKGPGCGFWAPMWRVWLFFLRGIVPLLERWLGNLLARQFE GRQTKGMAKTVTKQRVESHFDYELRAAVMHDILDMMPEGIKANKSRIILQ HLSEAWRCWKSNIPWKVPGLPIPIENMILRYVKSKADWWTNIAHYNRERI KRGATIDKTASKKNLGRLTRLWLKAEQERQHNYLKDGPYVSAEEAVAIYT TTVHWLEKRRFSAIPFPQTSYKHDIKILTLALERLKEAYSVKSRLNQSQR EELSLVEQAYDNPHDALARIKRHLLTQRTFKEVGIEFMDMYTHLVPIYDV DPFEKITDAYLDQYLWYEADKRQLFPNWVKPSDNEPPPVLIHKWCQGINN LDQVWETSQGECVVLLETQFSKVYEKMDLTLMNRLLRLIVDQNIADYMSG KNNVVINYKDMNHTNSYGLIRGLQFASFIFQYYGLVLDLLVLGLERASAL AGPPNLPNSFLTFPSVQTETAHPIRLYSRYVDRIHVLYKFTADEARKLIQ KYMSEHPDPNNENVVGYNNKKCWPRDCRMRLMKHDVNLGRAVFWQIKNRL PRSLTTIDWEDSFVSVYSKDNPNLLMNMAGFDIRILPKCRTPLDQLAPKD AVWSLQNVNTKERTAQAFLRVDTESQERFENRIRMILMASGSTTFTKIVN KWNTALIGLMTYYREAVVTTREMLDILVRCENKIQTRVKIGLNSKMPNRF PPVVFYTPKELGGLGMLSMGHVLIPQSDLKYSKQTDTGITHFTSGMSHDE DQLIPNLYRYIQPWEQEIKDSQRVWAEYAIKYEEAKSQNKNLTLEDLEDS WDRGIPRINTLFQKSRHTLAYDKGWRVRTDWKQYQVLKNNPFWWTNQRHD GKLWNLNNYRTDIIQALGGVEGILEHTLFKGTYFPTWEGLFWEKASGFEE SMKYKKLTHAQRSGLNQIPNRRFTLWWSPTINRKNVYVGFQVQLDLTGIF MHGKIPTLKISLIQIFRAHLWQKIHESLVMDLCQVFDQELDNLEISVVNK EAIHPRKSYKMNSSCADILLRATHKWQVSRPSLLNDNRDTYDNTTTQYWL DVQLKWGDFDSHDIERYSRAKFLDYTTDSMSLYPSPTGCLIGLDLAYNIY SSFGNWFLGVKPLVQKAMAKILKSNPALYVLRERIRKGLQLYSSEPTEPY LSSQNFGELFSNKIMWFVDDSNVYRVTIHKTFEGNLTTKPINGAIFIFNP RTGQLFLKIIHTDVWLGQKRLGQLAKWKTAEEVAALIRSLPVEEQPKQII ATRKGMMDPLEVHLLDFPNIVIQGSELQLPFQACLKVEKFGDLILKATEP KMVLFNIYDDWLSTIHSYTAFLRLILILRALHVNLERTKIILKPNKNVIT QPHHIWPTLTEQEWLTVEGSLKDLILADFGKRNNVNVASLTQSEIRDIIL GMEISAPSQQREDQIAEIEKQKTEASHLTAVTVRSTNIHGEEIITTATSP HEQKVFSSKTDWRVRAISATNLHLRTNQIYVNSDNAKETGGFTYVFPKNI LKKFITIADLRTQIMGYCYGISPPDNPSVKEIRCIVMPPQWGTPVHVTVP NQLPEHEYLKDLEPLGWIHTQPTELPQLSPQDVITHSKIMSDNKSWDGEK TVIISVSVAWPCTLTAYHLTPSGFEWGKNNKDSLNYQGYQPQFYEKVQML LSDRFLGFYMVPDRGSWNYNFMGVKHSTNMTYGLKLDYPKNFYDESHRPA HFQNWTQMAPSANDDEENQPENENLFE >DDB_G0282803 rcl1, MLKFQGSTHFRQRIICSTLSGKAIRITNIRDEDEKPGLRDYEASFLRLVD KITNGSKIEINSTGTQITYIPGIIIGGKGITHECGTVRGIGYFVEALICL GPFAKAPLDITLNGITNNDIDLTIDTIRTTTLPIIRKFGIEEGLIIKIIK RGAPPNGGGSVNFKCPIVPHLKAIQLIDEGKIRRIRGIAYATRISPQFSN RVLDKAKGLLLEYTPDVYISSDHYKGNESGLSPGYGLTLVAETTTGCCLS AECMSNTGIATTEQQLQKQKSTSETPEDLGERTAFALLEEIFNGGCIDSH NQSLALLFMVLCPEDISKVRLGKITPYTIEFIRQLRDFFGVTFKIEPDQN SKTVLFTCLGIGYKNMARSTF >DDB_G0273355 rexo2-1, similar to H. sapiens REXO2 and S. cerevisiae REX2 a mitochondrial 3'-5' RNA exonuclease there is a second copy of this gene MSTTPTYNHPVINERSKRMVWVDLEMTGLDISKDVILEMAIVITDAELNV IEKGPNLVIHRSDEVLKNMNDWCIEHHGKSGLTEDVRNSKISLEEAEKIM LEFVRKHTDKGICPLAGNTVHEDRRFLLKEMPTFAEHLHYRIIDVSTIKE LSRRWYPYIPSPKKVCGHRALQDIEESIEELKSYRVTVFK >DDB_G0273741 rexo2-2, similar to H. sapiens REXO2 and S. cerevisiae REX2 a mitochondrial 3'-5' RNA exonuclease there is a second copy of this gene MSTTPTYNHPVINERSKRMVWVDLEMTGLDISKDVILEMAIVITDAELNV IEKGPNLVIHRSDEVLKNMNDWCIEHHGKSGLTEDVRNSKISLEEAEKIM LEFVRKHTDKGICPLAGNTVHEDRRFLLKEMPTFAEHLHYRIIDVSTIKE LSRRWYPYIPSPKKVCGHRALQDIEESIEELKSYRVTVFK >DDB_G0276159 rtc1, ortholog of RTC1 which catalyzes the conversion of 3'-phosphate to a 2'3'-cyclic phosphodiester at the end of RNA MGKNKNYNKNQFKKSKTNNDTTVAQQQQTIEEKPDFKIDGSILEGGGQIL RNSVALASLFNKAISIEKIRYNRDQPGLKNQHKAGIDLMSRLFKAHLTGC SVGSCKLYYQPTQKTIQDDGVIEADTKTAGSICLMIQVSLPCLIFAPHST KMVLGGGTNCDFAPAADYIQNVFLPIATTMGFKCEMSIDKRGFYPKGGGA VTLTTQPLTQPLSPITIVNKGEVNRIVIKSYFTSPRISPLVAERMNNTAK KLIKKDFKKVDVETELIDVSKFSFGDGTFIEIRAYTDQGCIFGATGNGAI GVPAEKVAEDAANSLLKDLQDGGCMDEYLQDQLIIFMALAKGKSQIKTGP ISLHTQTSIHITSLMTGAIFTITPLTNNTQSGEETNLITCEGISYFPSDL NNNNNNSNSNTTTTTTTTTISTTTIDNQNSEEK >DDB_G0293554 sf1, ortholog of the conserved splicing factor 1 binds to the intron branch point sequence (BPS) of the pre-mRNA necessary for the ATP-dependent first step of spliceosome assembly MSPNDVEQQQLPTQNNNYNDNINSPSLNDDDEDSFFREIKEISRGRPKTR DEIQISDRTRLSRWDTPLTNDGVSPFSSIFKTLPPGLTDEQIAALILRLR VDEITKKITIGPIEFTERDRERSPSPPPTYDNNGKRSNTREQRIKEKLQK ERHQLVVTAQQINPTYKPPSDYQPPNEKKTRKIYIPIKNHPEYNFIGLII GPRGNTQKRMEKESGAKIAIRGKGSSRDGKPTKLQFQENDELHVLLTADT VDQLDKAEVLVREFLIPVEEGKNEHKRQQLRELAEMNGTLRERPAYMGNR SWTPVDIKCVQCGETSHPSSDCPLRSNESNQQYIESEYQKFIDEMSKSLG FDISISPNQNDNSLQNINLNNNNNNSNGNNNGNNNGRNFNNDTVGDMDES PPHHTQSHFQQNSPQFDQQQHQQQWNNNNNNNNNNNHNNNNNNNNNNNNN NNNNNNNNNNNNNNNNFSNYNNNNQSFYNNNNSPYGPPNGGSPYGPPRNT Y >DDB_G0293876 sf3a2, MSEYGKAGSGGLQSSQYDNIDRRERQKQLVLEHVDVSKDPYIISNHIGSF ECRLCLTVHNNVGNYLAHTQGKKHQTHLARRAAKEQRENPSVSKNNYIQT TRVIHKKTIKIGRPGYKIIKQRDSKTGQLSLLFQIDYPEIESGLQPRHRI MSAFEQRVEQPNKDYQYLLFAAEPYETIAFKIPNKEIDRTTGPDGKFFTH WDRNKTFTLQLYFKE >DDB_G0270020 sf3a3, subunit of the splicing factor SF3A required for spliceosome assembly contains PRP9 domain characteristic of splicing factor 3A subunit 3 expressed in pstO cells MSSSLLEKTRNLHENFERYELLIENEMKTEPKTTKERVLQSHRVNHYLNS SIECSKSLINIYTDSDHSRKDELTSISGFGTDLYSSFYEKLREIKDYHRK FPNLKEERNNEPLIFTPSISFTGNEMNGKFLDLNENYEKYINLSFNRNKS INLDYLTYLTSYYKFQYNDINRMKSPQYKDYLESVYKYLIHFIERTQPLF ELQSSITKSENEFIEKWNNNEFDPIENNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNSNVNSNNNQDDKLFCKACKKLFTSENVFNGHLKG KKHIQNEEKMNTNKDNKDGDSNNNNNNNNEKWYNKLKSRKDNTMFEYKIN RLSEYLSDQIESTKENVLKKQSRSYTEVVDGVMVGGVGDDEEEDEEVNID DLEVDVEVSKLKIANYPVDWSGKPIPYWVYRYLELGVEYKCEICGNQSYW GRKAYEKHFQETRHSYGMSSIGVPNTTHFHEITKIKDALELWSKIKNQTN QQQFKSDRDEEYEDETGNVMSKKNYDLLVKQGIINPNQKKRSHY >DDB_G0275957 sf3b1, MSDQQDQTMSEWDDTTLNKAKVVEATPRRNRWDETPVSKPSTGVEETPKR RSRWDETPININSGGLSGGVTPNYNAMSNGGVTPIFNNMMDGGVTPVYNS NNNNNSNSNGGSNNNKNILMQTPDPYQAQLQKEIDERNRPWTDEELDNIL PSEGYEILQPPANYQPVIASKKLTASTPIGAAGTSGGFFIQEEQSRGQDF GIIDAPDGITIKPEDKVYFEKILQEGGDNDEHLSPEEQKERRIMKLLLRI KNGTPPMRKQALRQLTDKAREFGPAPLFNQILPLFTSTSLEDQERHLLVK VIDRILYKLDDLVRPYVRKILSVIEPFLIDQNYYARVEAREIISNLSKAA GLASMTSTMRPDIDSPEEDIRNTTARAFAVVASALGIPSLMPFLKAVCKS KKSWQARHTGIKIVQQIAILMGCAILPHLKNLVVIVEHGLTDEQPKVRTI TALAISALAEAATPYGIESFDSVLKPLWYGIRQYREKGLAAFLKAIGYII PLMESSYASYYTKEVMTILVREFKTNEDEMKKIVLKVVKQCVATEGVESS YVREEIIPEFFKQFWVRRMALDKRNYKLLVETTLEIANKVGGGEIIERIV DDLKDESEAYRRMVMEAIEKIVSTLGASDISPTLEERLIDGILYAFQEQT TDETSIMLQGFGTVVLALNTRIQPYLQQIAGTIKWRLNNKSAKVRQQAAD LISRIAVVMMNCGEEQLLSHLGQILYEYLGEEYPEVLGSILGALKAIVNV IGMTKMTPPIKDLLPRLTPILKNRHEKVQENCIDLVGRIADRGSDFVSDR EGMRICFELLDMLKAHKKGIRRAAVNTFGYIAKAIGPQEVLATLLNNLKV QDRQNRVCTTVAIAIVAETCAPYTVLPGLINEYRIPELNVQNGVLKSLSF LFEYIGEMGKDYIYAVTTLLEDALMDRDAVHRQTACSAIKHISLGVMGLG CEDSLTHLLNYVWPNVFETSPHVINAFLEAVEGLRFALGPNTILQYTLQG LFHPSRKVRNIYWKLYNMLYISSQDALTPCYPRTLDENDNKYQRYELDFV I >DDB_G0284555 sf3b2, MDTTLTETIQNNNNINKSINNKKKLKHQKKKEQKKKQKQQKKEENIFQQT NNNNIKEEIKLENNDENNDENNNNNDDDKITAPIDGFSIDENDPSFELFS KLVKHFDNPYEDPKEIERKEQEEQEEKERKEREEEEERKKKEDNEDDDDD NNDDDDNNNDNEDEDSKKLSNKERKRQRKLHLPILKQLVDRPDVVELHDV NSPNPGYLIAMKSYRNTIPVPAHWCQKKKYLQGKRGFVKPPFELPSFIAA TGITKIREAILEKEKEMKSKQKQRERVQPKIRKMGIDYEVLRDAFFVHQT KPNLSIQGDLYYEGKEFEVNLKNKKPGVLSDELKRALGMIEGYPPPWLIY MQTYGPPPSYPNLKIPGVNSPIPEGAQYGFHPGGWGRPVLNEFGKPLYEN VNNNNNNINNNGDQQQQQQQSHPTREYWGELLPESEDFQEEEEQQEQQGT EEDELQQHQLEDDESIGDGISSVPSGLETPDIVNIKKSRYDQQQQQQQQQ PRELYQVIEQQNKNSSSGGLMESAHRYNIPSVIKQQQQQQQQQNSSRVDV IKSQRSAPVEITLNPSEVENGQEIDEELLKKKYEQATQALQKQRPKEDIS DIIEEQNKKRKNQLQKEEKQKKFKF >DDB_G0276137 xrn1, MGVPRFFRWVSERYPQILQKILDSNPPEYDNLYLDMNGIIHACSQEFANS LIEFSEEELIRQVCNYVDRLFHTIRPTKLFYMAIDGVAPRSKINQQRQRR FLSVHRDEKLKQKLISEGKPVPEVIFNRTAITPGTQFMYNLSESIQFYIK KKISEDLSWREVRVIFSGPENPGEGEHKIIDYIRKNKASPDWDANQSHCL YGLDADLILLGLITHEPNFSILREEISFKPTKRQLDFQLLHISLLREYLD LELRNDDLEFGYDLERIIDDFILIMIFFGNDFLPHLPFLEISKGGLNSIF ELYKSSLPSLGGYLTEGATIDLERLQHFFKFLQKFEKKQQQGIMGSTEED LDDKKVEVAELVEDSVLEHDGLDEEAKKEFERLAMERLKSHFPVSDSEDG EEDPDEQLVNELYNVENSYYRQYFNEFPNTLDEIKAFKEKVVLSYVEGLV WVLNYYHNGCISWVWFYPFYYAPLAIDFNNIPDLHIDFQPGEPITPFQQL LSVLPPQSVDLIPKSYQTLMLDLFSPIIDFYPVEFEIDTKDPHYFDGIAE LGFIDHQRLLDATASIKKSLAQSGQKVFTDEEEARNSIKNAVIIYHDADV DQFVKSPNSNVFKDIEHSSATTEDIVLPTFDNPLPHFRYCADQVLTGVQC PSGFPTFKSLQFTWKYQNSVIDVWGMMSRKDSLIVVPPHQSKQYDCNNID EFSKLKSLIGKKCYINWPYHQEGKILYFSCSNRKLFSKGTTDNSTPQKLA FLDHVKKTKLSLLRKGINIYYDNEKDDGKQQSNGSSTGYTASKLGYEDTN TILVHINKLVGVQTMPNGSTKKRYSDEEDVYPIELMVDYDLIATDSRFEE IDELPFEKRFPIGKKVLVTKKQYFGTIGTVINHYDNQLQLEIKVPSVKMD MNFGHQVAKKEVEYFPIQHVAKLVGTTVSSISQLTAGLFIFKPMVDIGLN LKFTGRQQQVLGYCRGYDVNRGGNIFHQWEFSQEAINLITEYFNKFPLLH QILALFSKPDSNIGVGGAKMSKNVDITPLFATKEEKAEFLQGVEEFIEKS GIRKKRVVPCGTDSLGKEGIEKIEKYYYDQTHQTEYTIQEIHCSTTDIVE PPSYESVISMERHHLIKGLQPKQDLNNSQNGVKSPTLSSQNYSFQSFANT GKFHLGDRVVSILDKGNLPFGTFGTVASIQDQKVDVVFDTECFAGNSLDG YCSEKRGICISKLRLYNLSCPPPPPKSTINKFYDQSIDPAEYWEKVQTQQ NNNHNHGQKKIYSNGGQKLNQQIDQQTPITNAVENKELNWQQLQLINNIS NPQQHNANNNNNNNNYNNNNNNHHHGQNHNQNKVNQHPLAINNPNSVNYP MKRKPTYVKQNFEQQEYADLSKNYPNLEYNFYYDQNDQRKQPQQLQQPKP QQQPQPQPQPQPKQPKQPKQPKQSKQPPQQPPQQPQEPIDPEKLRQQTRT NKRLNLIYQNIEQNSFPGSQSSEHNNGDGSSEEQVQTNPNALLLLNNMFA STSISSDQTNDPDGLPQGPPPHMMGHYPPGPPPMMGYPPHYHPGHSYPPP PPHMMGNYPPGPPPPHMMGYPPHYHPGHPYPPHHPGQQEEHHHHQQQQQE QQQHPTQQEQPNQHPKKKQPKQPKLPKQPNQNQTQPTQDGQQQQPPKQPK QPNPNQTPKQPAQPKQTKQPAQPKQPAQPKQAAQTKQPAQPKQPAQPKQA AQTKQPKQPKQSKQPAQQTSPTTPNPTTETNLNPTIETTSTPPTPTTAQ >DDB_G0269922 xrn2, MGIPAFFRWLIDKYGGLIQETTEPREADGGRSVVDFTTPNPNGEYDNLYL DMNGIIHPCAHPEKGPKPKSIEDMIQSIYEYLDLLFAIIRPRKLIYMAVD GVAPRAKMNQQRTRRFRAALDSRLDKDKEAALWRERIYDGLATQQEYEQY MEEKKNKFKFDSNCITPGTLFMDRVAESLRTYVAEKLTTDPAWKDVKIII SDASVPGEGEHKIMDYVRHQRAQPDYDPNLKHIIYGLDADLIMLGLATHE VNFDILREFIQPIARGVCHKCHKKGHLAIECKEEVDDSVKDFLVKNYQIL HLHLLKEYLELETKVSTPFGFDIDRIVDDFIFLCFFVGNDFLPHLPNLEI KDGAIDRVIKCYKELLPSFDDYLVSNGEVNFPRLSQIFVALTKGEEESFQ RKIQKDAQILKKRQNLSNIVSRPNSLNLNDKTSLSHKEAADKFLAEILKP VEETTQDEEKEEDSRPTKKSKNSSASSTTTTTVKSNKKAAELIKNKLIGG GGGKEKDKEETEQDDEQSASSKKSKKRGLKVIELDDDQQQQLNIVAAENE KKLLNKKSKQQKAAQIQEEADEKEHQENDKPSLFVNSYDDSSLPTEKEIF FDNSRNIKYSEEGWRDRYYQSCFQVENEDDIKQICKSYVEGLVWVLKYYF KGCSSWGWYYPYHYSPYITDISKFFDQFEYPQYEMGEPFKPFNQLMSVLP PASCQFVPKPYQKLMGISVDGEPVEESPIIEFYPRAFRIDRGPTEPLYKG VCLLPFINSIKLLKTISKTEPLLTEEEVDRNTLGHDLMFCHKDSNINSAF KEGSVTQLPENSSKIYGTIEDVSALAKKKMPPLMDAVAFKYNNSSVPNGY SFNYSTLKGSIIPKKSITLFKPRTQNAAINRMVNNSLGGSNQYKDNQQFN GGFGNVGSGGYNNKQIGYNRFNNQNYNNNRYNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNYNNYNNYNNNNNYKNNNYNNNGNYNGNNSNNYNNNNNYN NSNYNNYNNSYNNGNNYNNNNNNGNGYNSNYNNNYNNNYNNGNNNGNNYN NNYNNNYNNGNNNGFNNNNNNNYNNNNYGGYDNNNGFNNNNNNNNNNNNN NSYNYDFNNLNDPSLIDINNYGGGGNNTDLGTLVASNNNNNNINRNQQQQ QPQQQQQQQQKNKFNNNSNTKNQPPAKVPMKYNPFAKSKK # Dictyostelium fasciculatum, SH3 >DFA_12499 g1038, MNAIRAQLDELFGKDRNLAPSERAKRQPHFSDDEICKFYICGLCPNELFV TDPCTKLHDEDCLKLYQDCKDKEQYGYERAWAREMDQIILDNDKKVKKNK ERLILDAAKLEAEEGLNPSDAAKEFAKIMAEIEERIQALLKKSEELGEEG QITEAQELMAQAEKLKEEKLELEKAEELKDHNKKMSVCDICGALLFVGDK EKRSQSHLEGKKHVGFARLRSHMEEHYQSKKSNYQENRGNFNNHHQNNNN NNNNNYNNNNNNNNGHNNNNNYNNNNNNRDNYQRRNYNNNNNNNYNNNGG GAYRNNNDRGDRDTRSGGGYSRNDHRGGDYNNRNNNNNNNNGNYHKRERE YKDRDQPSDRERDNYSRR >DFA_11037 g10619, MSSSPPVTGNFINNMVIDTTHAAPFNNSGGLPNASSTSTSTSTNNTSYLY HKQLHMPTGIEFCIKGNVTSKDTVNLIVAKTTVIQIYSIKYEKNEQTEHQ QQQKQHNSEESDMSFLDSSDDDDDNQKDNSNNNNNNNNNNNNMTDQQKPW LELNYEKNLFGVIESLNCVRFPNEDRDCIILTFKDAKLSILSFNQNTQQL DIHSMHYYERNEFKSGRETFRSPPILKMDYQQRCAVMLLYDRHLVVLPFR HTMSSILDEEDEYEEEHMTKDSNNNNNNNNVNGKSTESSSSSFTSKGNLK DSYIISLKSIGVENVKDLTFLHGYYEPTLLILHEPSQTWTARIAVKKLTS CLTAVTLNLSQKQQSIIWSFDHMPYNCEKLLSVPEPLGGSLVITPNIMFY VSQSSRYALATNSYALIDTTSPTGEPFAIPIDNSSTNLVFTMDCAVYSFL EKDRLLVSLKNGDLLIFHLISDGRSVQRINITKAGKSVLSSNICVLSSTL LFLGSRLGDSLLLEYTEKVIDVDTNNVTENLSNPYKKQKTSEVLFGEDDN PSTSIAEEEDDEADAIFNRSRYQTKSYQLTIRDHITNIGPISDMITGISY ESNKGENEENEVDHTSSRNGKSSGNAVNLASIADRDYELVTCSGHGKNGA ITILQKNVRPDMIISFELSGVKQAWTLYYDPEQSSSKNKKRHLEIGQEQE NEEEEEDINWHSYLVLSQEKNTLIFHTAQELKEVATLSNPTVLAANLFNN KRLVFIHPTSIKVMNGHSSVTQEIKQSPIKYAYVCDPYILLHHQDGNISL YKGNPKDLTISKLDIPFNSSKTTSIMTCSLFIDLNPISNWWTFKKSNNQA GDIYLLILDSNYTLNIYLASDPTTPIFTTNHFNKELDILLNYNNNQQQQQ QQNIQTNNLNFKEISIHYLSDSIWSVPYLVAINEYDQIYLYRGFKNEKNE ISFKKIVNDVYQELPLEQPMPTTTTSNNNNVKEKKKSSSSSSSSSTATSS STPSSTTIINFNVEMNRRRIIPFSNIGNKRGIFVSGVSTPIWIFSEKNFP RIHPMKQQQQTTSSSSSSSSSSKRPITTFTTFHNINCKHGFIYFDHTGML CICRLPDGTNYENEWPIRKLAIRMTCHKISYHPVQKCYVLVLSYPQAPQS DEDEQEEQERELLKKPLVLEEKYQLKLIDPANNWNIIDSFSLAEKETVLC SKIIYLRHADESDIIPKLKPFVIVGTAYTHGEDTVCKGRILIFEIVSHRN QFNTATSKSTTTTTTTTTATTNTEENNNQEQQQDKEEQDEEGKPKKDTES ADEKPKVEEEEEEEEKEQDESPDQKIETEELQPQLQKRLNLLYEKDQKGP VTSIAGLNGLLIMSIGPKMIVNNFSSGSLIGLAFYDTQIFIVSLNTVKNY ILVGDMFKSISFFKLKQKKNIILLGKDYEEVSTYSSDFIVDEKKLSMVLS DANRNIRMFSFDPSDPESRAGQMLLAKSSFHIGELNNKFVRIPMKNTNYD NNSSSSSIIVNDKHLLFYGTLGGGINLLMPINKRFHEILHALETKLMHRG QTAGLNPRGFRYGHHVNNTLGHLHNQYVVDGDLLTKFQSLSPDDAKQLAT SIGSTTPIILDLLNQLHQSYNWVFKIFFFVNITIIVYYQFRLSDAMDLLT QLQDKLDHLFLVFGTCIGVLQRDAPPSSFSEVMNNQPPQLTQEQLTNADN WNSQTKHMALQVIETTKLIESIIESLPGFQRTENEQYQRLKALNHESKLL QQQLQDKENENVIMLQQVKEAIRMLSDETNSRSSTNNNKDHDKMDL >DFA_11745 g11321, MGVPRFFRWASERYPQIIQNLVDSNPPEYDNLYLDMNGIIHACSQEMTTK LIRFSEEELIRLVCNYIDKLFHIIRPTKLLYMAIDGVAPRSKLNQQRQRR FLSVFREEKEKKELIKEGKELPEVIFSRNAITPGTEFMSNLSECLQFFIK KKISEDLSWREIEIIFSGPENPGEGEHKIIDYIRKYKASPDWDPNQSHCL YGLDADLILLALVTHEPHFSILREEIAFRPQANKQLDFQLLHISLLREYM ELELKCELDFGYSLERIIDDFVLVMIFFGNDFLPHLPFCEISTGGLNSVL ELYKNSLNELGGYLTDEAEIDLDRLAVFLAKIAVFERKQNLTVGEAEENE VVEDMLLENDVQDDEEKKEIERRAFERLKNHFGEITFEDDDVEDINDYWI NGYYRSKFPDFPEENRSAIVDYKRHLVLKYVEGLSWVLNYYHNGCISWRW YYPYYYAPLAVDMRDLSSLEIEFESNGPVTPFQQLMSVLPPQSAHLLPAP YQELMTSAASPIIDFYPTEFEVDTSDSHYFDGIAVIGFPDLGRLLEATAA EDSWDLTDKERSRNALRNAVIIYHDAEQVVSEPSPNLRLFPSLEHSSAKT EDFILPFYEGLKPFRLCEGVLLGSHSPSGFPTFAVDVDFTWEYKNAVVNI WGMKSRKESIIVHPPIPQLNKDKTAKQMTLASIKHWIGRRCYVNWPYNTE ALIVGFSDSNQKISMLENVSSTNGTVRLTTQDYLAVQKVSYLDQLKKIPM DYLTKGIDVSEVTSNTILVHVRKIAGIDVEFGGRTVKRYTEKEYQLPIQL MVEYDKVRADSRYLETEEIPFATRFPIGKQVLYTNPDHFGAIGKVLGHAD ESTETLDLELKVGQSGPDLHFGHRVAKEETDEYFPIQHVCKATGLTNQQL SLLTGGLFIDKPMTDIGLNMKFTGRNQQLMGYCRGTTLGDRNGGTYKKWE FSNDAIQLIMDYLKTFPIIQKILVLVSQPSDDKSSSSGGGIRAIDISSLI SEKTDRVALVKSIEEYFDKTGIRRKRYVPCDSLSLSKKSIKRIEDHYGRL AAAAILIPMRTRTQSDKVIEPASYESVISYEKQHVIQKAHEQQLQKERKY GGGSGTSSPGTSSPSKPKQHERQFRLGDRVITMLDKGNVPFGLYGTVVSI QDQKVDVVLDRECFSGNNLEGFCSEKRGLFISKWRLYNLSTPDANYRRAT KSEKEGGYRLGSFPEYDSNEYWNNLKMTNDQDHGVVKIKVNHQANTNMVQ KVQQQTQNLNWQQQQEVYRQNQRKTYHTRQDQFVKTNWETEEDHATYRKT FPNVQTQQLNWQQLEQQQKSAAKQQQKGEKAGKGGKKGPKEQPQPPQQTS GQTPELLQKIFDSSKIVPSTNGNNTAAGESSSSVQQPSQPQQPQQPPQPL MGLIYNSLESQKDGPADHQATPPPPPYGYPPGQMHHPMAPPPFGFHPMAP PPYGFPPMQPGQMHPMPYHPHQQHQQQQQQQGQQPRQPRQPNPRYQKKDK PQNNNNNNNNNDTNNNNNHQQPKSPHHQKQPKSQSPQQTNTNTNTNNTNV HTLKDVKKSSPHPKKDKVWIAKPKTSPTNSPPSTEASSPVAPSQPASTNN NASTSESNN >DFA_11830 g11393, MGIPAFFRWLVDKYGNVITPTKEPRDSDGSRLKCDFSELNVNGEFDNLYL DMNGIIHPCAHPEKGPKPRNTQDMMDSIVEYLDLLFAIIRPRKLIYMAID GVAPRAKMNQQRARRFRAALDSRITKEQAARDLIERLNNGSLSQEDYDAI QKDGAEKYHFDSNCITPGTEFMALVALTLRQYVAEKISTDPAWKDVKVII SDASVPGEGEHKIMEYIRHQRSQPDYNPNLKHVMYGLDADLIMLALSTHE VNFDILREFIAPPKGNRFGTPAPTPSKDVDEDEVKDFLVKDYQLLNLAIL REYLDTELKCNPPFEYNVERIIDDFIFICFFVGNDFLPHLPSLQINEGAI DRLMRIYKELLPTFEGYLTDNGEIDLDRLRKVFVRLSREEEDILLRRKKK EERFAQGRNNRFQQTARTDITGTASSQQLNTQVSERHKQAAASILSDIFQ PVAADTEENNPKKKVKTDHLSNKDAAQQLRENLAKATSKGTKDQLDATNR EAAALLQQKMSKYNEKKRVVKVIDIQQTPEAAETKDDNTKKRKKKETEDK SAAEDQDDSLEKKLFIHKMKDVNIGSEGWRTRYYEHHFETEQPQNELVRA ICQSYVDGLAWVLRYYFHGCCSWGWYYPYHYAPFILDLAENGSMIVEPQF ELGAPFRPFQQLMSVLPKASGQFVPKPYRTMMGISIDGEDRNDVILHFYP NEFLIDVAPGQPTWKGVCQLPFIDENELLSALKPLDNSLTEDEAFRNSHG TDLIISHNSTSIGSTTDLPRNLEKPDQILGTISPVIPGVEKRLPPLLSAK VFTYKNPELVNESTCGILPGAILQKVNLEQYRTRGIQNSSANRMINHELG SNQYKNRNQQNGGNFNSNFNNNNFNNNNNTSNQNYNNNGNNRNYNNNQNN YNNNNQNYNNQNNNQNYNNQNYNNNNQNYNNNNWNQNQNNMGGNNWNQNQ NYNNNNYNNNQNYNNNMGGNNWNNQNNNMNNWNNNNNNMGYNNNNMGGYI NNNNNYNNNNQQQQQGIVNMSVEQKQTVLNNMMLMMKQYNEGANNMTPQE YNQMSTMMNMLLQQQQQDLQLVQNQNQFNVGRQNNNNDNHHNKKFNNNNN NQGGYNNQGGNKNYNSNQGGNKNYNNNNNNNNYNNNNNQQQQGGNKMKYN PFAKSKK >DFA_01263 g1264, MDRQDISSFIGGTDLAAAPIPTVIFNLPKEHELRFEVEHGETALIKLIEG NAEYFGTELLLNREYKLTGCKGAVFTWNSCKLEVSQSTKAYIANETPMMQ YARVHKIMDDIRISCLSNRESGPKVIIVGPTDVGKSSISKILLGYSTRLG YAPTFVDLDPGQGSITIPGAVCASLVDKPVDIEDGLTNSLPFVQYYGHTS LDANPTLFKALVSSLATSIEKRMETNEQARVSGVIINTCGWIDGLGYEIL IDSIDIFKANLIVVMDNDKLYSELNKKYTGAIKVIKLPKSGGVYLRSPIF RKKTRMSKIREYFYGISGDLCPHFTILDFKDIVVLKTGGGPAAPSSALPI GAQSVIDPLQLQEVTPSTDMIHSILAVSYTKSKQSILKSNIAGFLYVTEV NLETKKMTVLAPCPGLIPSKFLLMGTLKWLE >DFA_01671 g1656, MSQKEKGGKKPYASFNNNNNNNNNNNNNGGKLNNSGGYNNKQSSPSNNNN KSSPSNTNNINNNNNNNNNNEDLEVKKMKDRSLYMALCLVGYQVSVTLKN GVTYEGLLSSATTTSGAGWGIVIKMARKKEVPPPAIITTPPTPLMVIESK DFLCLTATGVVFDNLSTYQQGSGRGSASFQSDTDISGHDGVVRERELTPW MSDGSEHESLEASALNKQNASWDQFSTNEKLFGVTSSYDEDLYTTSLDRQ SDSYRNRLRDAERIAQEIEGKTSSNLHMQEERGQVKGSDYDEEERYSSVV RQPDAKAGAKQQAGGNLSSSAGVYVPPNKRNQQVQQQTPQQTQSAPVTPL TKSSSSTSLKDENNTKPAVATAGAAAAVGTSTTSTTTSSTPTKDEKLTQQ QQIFKESESNSANSLKTSGNGINSDNSPVTKLRFPVRERTNSIDHNDLVS SPRDGQSPRTLANYTKVRAAIVSEKMRNSEPRSPLCSPLVSDPVGLSALS LHSSKPTFTENTIKEFNEFKLIKSEVDRKAQMEQLKSFSRDYNISRSRPS SPLIGPNSPRLANITALSLSPTNLDNKDDSKTEDVKKEATAATTTTATTT TATTTTSKLKLNPNAKAFTPGSLSANAPVFTPKGLTLKPAAIPASAPHDF SGMPGGDIGRSTSNNTDSTTPINELYYESMKKRQQNPENPESVSSYWSDV PSYRQQYGGGEDEQYGSPAGGYSMRPPIIPMGVVPIPPYYPSPPPMVAPP QIKSMKMPYNGQPRSYPQHQGVPVASNQPLGPPPYAVFQPQFPPPFAVPA MYNPPPPQHGVPKRYYPHQNSYQMQPHMMIPPQNNSQSPSPSHQSPQIPS PTSPTHSRIITSQPAFIPAAYQNYGVPPRYQNDPNQGYPPN >DFA_01687 g1674, MTNIDNNGITTTNGSSSNGTTTTSNTKIDKKKQKLQKKKEQKKRQKQKKY QDKQQQQNGSHSNSSNNVEIDGDEPPAPVDNMMIEEDPDFVIDESDPTFE LYNKLLKHFDNPTHQDDDQDVQFTDQEENQDTDQQENDEVEEEEEEEEEE EEEGKGKSKKLSNRERKKQQRLNLPILKQLVDRPDIVELHDTNSPNPAFL INLKSCRNSVPVPIHWSQKRKYLQGKRGFVKPPFELPEFIAATGITKIRD ALLERESQKKTKTKQRERLQPKMRTMNIDYQILRDAFFVHQTKPKLTGQG DLYYEGKEFEVSIKKNKPGQLSTDLKNALGMLEGYPPPWLIHMQNNGLPP SYPSLKIPGVNAPIPEGAQYGFHPGGWGNPPMDPSMFAQQHHDKTVRGSL INQEERERWGQLVPEEEYEDDEEEGDEDDEQEGDDHEDGMPPPPPPLSAQ QVQDGTMSIPSGGSETPDVVDIRKQQYNNNNNGGGNMLPQFQQQKQLYQV VEQSSRQLGQGIMESNYKYNLPTNIKMNTPSSSSSSTASASGARKVDLIK GHKSAPVEVTFAPNEVEDVELDEELLKKKYEQAVSGDKSSQRKEDYGDDD HKKRKAKSQDDKQKKFKF >DFA_01698 g1684, MGEEDKQSYGNNWGDDDEDDFFRQITEIQSNYDRGRPRTREEISADNRTR KNKWDVDKNPAVSLPGIPKTIPPGLTDDQLSSLLIRVRIDEITKKLVTGP IEYDTKEDRSRSPSPVYDNTGKRTNTREQRTRDKLAKERHNLVTNAQQIN PNFKPPSDYQPIHKKKTMKIYIPVKDHPEYNFIGLIIGPRGNTQKKMEKE SGAKIAIRGKGSLQDGKVSKPQYAENDDELHVLLTADTQDQLEKAAVLVR PYLVPVEEGKNEHKRQQLRELAEMNGTLRERPAFIGGKGWSAVDIKCVHC GEISHPSSDCPLKTNPNANMHLIEAEYLKLLSEIKDIIGLDDNYQYKNQN INNNNNINNNNNNMNGYNNNNNNNFNNYGHFEPQQQQQQQQQHYNNGNLN MNNNAYQSPPYGDDQQQQQQQWNNNNNNDMMNHQQQQQLPHHHQQQQQQW GQKPPQQQSPYGPPGSSPYGPPPTNSSPYGPQSGWQ >DFA_12550 g1756, MIASSRTGNLLNLFSKPLKGVSVSSHTSSISHHRHSTRQFSSLSLSSSSS TLSYCGNRSNIRLLPSQTVSFYNHHHRYMTSNNNNNQPIVHNPPKRDMSR DNRLIWVDLEMTGLDITKDRIMEIACIVTDDNLQVIEAGPDLCVYIDDAA LDGMGKWCKEHHGDSGLTQRCRESKISIQEAEKIMVEFLAKHVHKGMCPL AGNTVHEDKKFLLKEMPLFAEYLHYRIVDVSTIKELARRWYPNVMEKAPV KRYLHRSLADIEDSIEEMKFYQKHIFIPKDE >DFA_02161 g2110, MTSNGITSPSALSNGGKATTTTTATAAAVASASATNIALNGSTGKMPIHN ISSPPISSTSSTAPLQSNGSSNQQQYYGVTEPISLASPTSIDTKQSTELE TTLRGFNLFEPPEESRLREEVLGKLDAIVKQWAIKVSVLKGFTEQMASEV IAKIFTYGSYRLGVHASGSDIDTLCVTPKHIMRADFFGTLADVLCVHPEI TEFTPVKDAFVPVIKMIFCGIPIDLIFARLSIPAIPEDLNDLIDENYLKN VDDKCIVSLNGCRVTDQILRLVPNVTTFRMALRCIKLWAQRRGVYSNVLG LLGGVSWALLTARICQLYPNAAPSTIINRFFKIYDGWKWPSPILLCQIQD GGQLAAKVWNQKRDKSHLMPILTPAYPSMNSTYNVSRSTLSLLKSEFSRG AEITKKIDSGERKWTDLVEKGDFFTRYRFYLQIDVSAPEEDTHRKWEGWI ESKLRILISNLEQTPNMKFAIPYAKSFANKASAVNGGICTCFFMGLQFNF STAIGADKNVDLTGAVTSFTNLIKDWPGKLPTIEMKIHYIKKKNLPVFVK DEGPEHPPKQKTAAKKRNVSGNLVNNNNNQNNTAAGESTSTTTTTTPTGT ATPPPSSTDAKKKVKTDHPPSQTTTPTAASSTSSPPSPLATDHVAVVALP TTTETAAVSNQPNISPQPILPISVSDNNNNISIDGDNVNMNEIKDSNIDE QLQSPPSEVPTIVAASTTTPTNTTPSTKKADNSSPTEVSELDFISSSSAP KPDVPKGPQPKKAAISLIRG >DFA_02244 g2194, MGDKTTTTSEWDETPSTKTTAAVAATPRRNRWDETPQKLATSTIEQTPKR RSRWDETPVTISGGMGGSATPQIMSGGIGATPRFDVSSTPNVLMHAGMMT PDVHQLRAEKELDERNKPWTDEDLNAALPSEGYEILMPPSNYQPIMTPAR KLMATPAAGVGGGFFMQEENRSQDYGVSETMTQGGLPIKPEDKQYFDKLL KVSDEDEEMLSPEELKERKIMKLLLRIKNGTPPMRKAALRQLTDKAKEFG PAALFNQILPLFTSQSLEDQERHLLVKVIDRILYKLDDLVRPFVRKILSV IEPYLIDQNYYARVEAREIISNLSKAAGLASMTATMRPDIDSPEEDIRNT TARAFAVVASALGIPALLPFLTAVCRSKKSWQARHTGIKIVQQIAILMGC AILPHLKGLVEIVEHGLTDEQPKVRTITALAIAALAEAATPYGIESFDSV LKPLWYGIQHYREKGLAAFFKAIGYIIPLMDASYASYYTKEVMGILIREF KTNEDEMKKIVLKVVKQCVGTEGVEAQYIRDEVLPEFFKCFWIRRMALDR RNHKQLVDTTVELANKVGGAEIISRIVDDLKDESEAYRKMVMEAIEKIIS TLGASDINPRLEEQLIDGILYAFQEQTTDETAIMLQGFGTIVLALGVRVK PYLTQIAGTIKWRLNNKAAKVRQQAADLISRIAVVVQMCEEEQLLGHLGQ ILYEYLGEEYPEVLGSILGALKAIVNVIGMTKMTPPIKDLLPRLTPILKN RHEKVQENCIDLVGRIADRGADFVLEREWMRICFELLDLLKAHKKGIRRA AVNTFGYIAKAIGPQDVLTTLLNNLKVQDRQNRVCTTIAIAIVAETSAPY TVLPGLMNEYRIPELNVQNGVLKSLSFLFEYIGEMGKDYIYAVTPLLEDA LMDRDPVHRQTACSAVKHMSLGVQGLGCEDALVHLLNLVWPNILETSPHV INAFLEAVEGLRIALGPAVILQYTLQGLFHPARRVRDIYWKVFNMLYVSS QDSMIPAYPKTIDDGLNTYQRYELEYIL >DFA_00019 g22, MSEYGKAGSGGMQSSQYDNIDRRERLKKLAMETIDISKDPYVISNHLGSY ECRLCLTQHNNIGNYLAHTQGKKHQTNLARRAARDQKDNPNNHFNKSSSA MSHRPRIIPKKTIKIGRPGYKIIKQRDPDTGQLSLLFQIDYPEIEQGLQP RHRFMSSFEQHVDHVNKDYQYILFAAEPYETIAFKIPNKDIDRTTGPDGK FFTHWDKNKLSFTLQLYFKESSNKDQQQQTQQQQPPPTPTTRINRNDMYR CRLVWNSNNKLIIPFIIFQEKQKEKT >DFA_02618 g2559, MIRRNARLRQEYLYRKNLEGKEKDVYEKKRKIKQALNEGKPIPSELIEFE FNSRKEMSIEGEEDYKRLNIDDEYARAGVLDPKVFVTTSRDPSARLTQFA KELKMLFPNSQKMNRGAHVIKELVDACRANDVTDLVIAHEHSGEPNGLVV CHLPYGPTAYFEIVNCIMIHDIQDAPPASLAYPHLIFNNFTTPLGSRTEN ILKYLFPVPAQDSKRVLTFSNNNDYISFRHHIYEKDGHKNVLLKEVGPRF ELKLYKIQLGTIDQPEADVEWVYRPYMNSTKNRTFL >DFA_02670 g2613, MSSSLIERTRNLHESIERYELLIVGEQANEPKTVKESIIQSHRVNHYLES SIECAKELGKIYKDEDQTRKNEISGITGTGNTVYSNFYENLREIKEYHRK YPNLPVENLNTTLYYTPQISFTGNESYGRFLDLNEIYNQYVNVPKVNRID YVKYLTTFTSFSYDDINRLGIQKYKLYIESLYEYLISFLKKTQPLFDLQK TLSDMDKEFEEKWSNQEFTSKNDGVVVAVENNSENNNNGNGIGGDDDDSN GKEPKDNDKNGEEETTKEEEKKKNDTKTTTIVSLDCKACKKSFTSQGVFN SHLKGKRHIMLQEILDKNSETSKSSSGMLPFKPIVQKEFYISKFGDMLSD QIEDSKENTLKKQSRTLKEIEEDLYADETVLDDDEMDEEPLKLRIANYPV DWSGKPIPYWVYKLNELGIEYKCEICGNQSYWGRKAYEKHFTESRHAYGM SCIGVPNTVHFNHITKIKDAIELYKKIKDQNATAAFNADREEEYEDENGD VMNKKTYEMMVKQGLIKKRKH >DFA_02697 g2639, MRPNIKGGVWRNVEDEILKVSVMKYGLNQWARIASLLTRKSPAQCKARWF EWLDPSIKKSEWTKEEEEKLLHLAKIFPSQWKTIGPLVGRTAAQSLEHYN RLLDAVQQEGGGIGGGEGDGGNNEDVRRLRSGEIEPLPETKPAKPDPIDM DEDEKETLSEAKARLSNTHGKKEKRKFREKQLEEARRLAFLQKKRELKAA GIILKEKQKKKDTKRFDYSQEIAFHKKPLPGFYDTTEESQVDPNKDRQFI NARMDKMDASKTSEDTERANKLAHIKKKKREAMALPDLIKQVNERNDVDM TVKRGKMVLPTPQLTDDDLEEIAEFEKHNNKIAASGSSSATSALVGGFKV PQTPANSVGGSTSSTITARTPLREDNLKAEAKALLAMTTAQTPLKGGANP AFNPADLSSVTPSMTTQRTPNPIRTPNTLKQELLAQSTPASSGFASTPLS TSNKQQQRAERQSLLGQLNNLPKPVNEYEVSLPDDEPTIEEMDEDQDGVV LDESERGIRQDQEFRHKQQTKMKNRSTVLKKSLPRAHSTINNKKDNKDNN NKDRSSSSNNRFVVTEKITDDLENAELLLIQEMNDIILNVNRSFPMIIGD NQQQPQQQQQQQQQQQQQQQQQQQSIIEEVDENYETFTNKEMDQAIILLK KEMESMKQENYNQQQENVDGDKLLKEEFVNNWEKVNEKYVFVSNEIGYME REKVTDDQYVKMLNEEYTHIINSMKTMSKKTAVIEKKMTSDHQPFVQRLA NSAKSISQLHDDLVQASIELQCFRDLASTEKQSLENRHKHLENLVYDQCE RENNLQTRYSKLILKKNQLLSSN >DFA_03304 g3216, MPKYYCEYCDKYLTHDSPSVRRSHIIGKVHQQAVRLYYQQFEADYHKSIT EQRIKEMIKPGTVPLPQGPPMFPQAPPHGMMMGPSPYGMGGMGMVGPPGM GRGGMPMPPHNMMMAQGPPPPYGGQPDFNQPPPPFPFIPKQSFNPFPPQ >DFA_03692 g3576, MGIPAFYRWLVDKYPKSIQYFQQQQQVEEQQEQDEVSSSSTTTTPLKIDP RCNNVKFNNLYIDMNGVIHNSTHAKGEPSVESIMASKPNTDDTIKENIFK RLNDIIVTTNPSDLVYIALDGVPPRAKATEQRRRRFRSAKDIREILSKGP KQPNPRHNNNNNHSSSSPSTSSPSTSSPPITTDESSDPSTPSTPAAATTP AVAQPNYLELLDKVFDSNSISPATEFMNKVNHWIGEYISTVLAKSHPHLA VVLSDTTVPGEGEHKIMDFIRGNHKNWSESTSHIFYGMDADLIFLGLSVH LQRFFVLRDFQSLSYGCSICKSEYHMSYECKSALAMKKLHASNKEGDIDW QSNGINTTKISVRNIPSQASEADIREIFSYYGEIVDIKFEKAPTKKPSLT AFIQFGSVDAVKDIACRGAYFYVNNIKLTIAQYYKDEKVEVESPLIEEQA ENAVVEEDDGVYYNNSIFCTRLELDVTEKDVVDFFKPTGKVESVQFLTTP RRKAHPFKFCIINFGNLDDVKKSLTMDGKYLKGNKVTLKKSRPPAQKEPV VPEEPKPKVVISEEEKAEKEAEKEKRRIEKEEKANQFITIAGHGMNLDSS YFYLGLAEWDFERANELFIKFGRALSKQEDEYLTVERKFEFVNLDNLRQY IKYYLTFGLENESQLDINNCINDITVMCMLMGNDFLPHLPAVSIKDGSIE LILGWYKDWIHNSIKSTGKVNYLTVDTNIIYSNFSELLNVLGNWESVIYP DKLERMGKKDLARLTHILKVSKEREDKEKEEAEANGRMASEFEEERNDIL TSQLANLQEGELNYYKLKLQASTSEVENVKQWVCQSYVRGLSWVLKYYTV GCPDWKWSYNFHYAPLAADLAAYCKEMVVGKGFKDTSHIEFKLGAPLQPL AHLLSVLPVYSGKFLPAPLADLMKSKQLSTFYRERYRIDLNGEEVSWKGV VLVNFIDVSALESLANPIAEQLSKNDPHIAQLNKLGNDIYYLQGDAQSSP DSYKIFLDQQKKEEEESSHTNQSINNSSTASSGLNEKDFKWATRYLITLS PQDSDLYNSLD >DFA_04168 g4009, MVQDPKTFRSGIIIEDWAALARRLARKYGEGRETTETDYLYTQHRVTVDV NGQVIKSEAKIDKTAFDQSTSLIYDVRVSTSIERQLGQLPLVPPPESSSQ RTKFRHTFIDVDNPQWKIDMTLVRTNTGEETYEVELEIFNTYIIDALEKN TLHNLISKFINETKNIITMIQPGRLSFPDIQMTKLEDPRFVDVLKQKVLG YIPDCNPNRTREFPGAMPINFGKKHFPTIQRDMYYVSEKTDGIRYMILIY KGVMYMIDRKFDFFKIDGNDELCKVLHDDTLLDGEMIRHLESKEPMYFIF DILARENTKFGDKLFQERMQHIGKVVGDYRQSVGSGELGKTPFILIAKSF FEKKHISKIFSSIKTNKNGERIFSDQKRNHQTDGLILTPNNAYKAYADQS LFKWKYLDLWTIDFKVAQNSDRKWFLHCAGPNNTDIPCKELMLSTEDFAL LSTDYKRSRDQGCFIAEFSFEFSKGIWKYHLVRPDKKRANYITVFVDTME SICEGITKEELEYRFLCAGGHDNWDVEVEKMRHHIASTLTNKLKQQQQQQ HQQQKIQQPQHHSQHHGQHQQHQQHQQQDDIFGGGSSSNHNYDPHGQR >DFA_04210 g4045, MGVGGLADYISTYYPSVVRFQQQQQHGVGVGGGRPRYDHLAAGSSYMSVG QLRNKLGGRHSNNTRGAETTHLFMDMNSIIHTIFRRNPNTDTSKIYKQIN MRIKQTVDEHFPVKTLFLTTDGPGPRAKIPLQRKRRSKSKEDGISSSLIT PGTMFMSGLKDSLANYFKHSRSVSSAIISASDRYGEGEFKIFEYINSKTW TDQDSVIVFSDDSDVILCSMLSSAPNIIVKGTSSTKCYHIADLKQQLIAS APLINPKQLIEDFVFLNLFRGSDYYPRMDGFNFVRSWTAYLEEKSKKGLY NPKTRSINKELLQKIFNIGEGAVGGGGDSVNSLRQLSWNTSLKNYIAMTW SHLGFKFGKNNIISNNNKNNKEGTTSTTTTTTEPSIVEEIESKLLRPTFS KEGNGQYYMTLDGIKHGPFKVDAKYENVDGWADLSPIVSRAILTEPDNIF LNYYQPQLSAEKYQILKSKRTSFAMEIEDQENASPPDVGQYMQCVIWLME LLKGKCTNFHHRYLPKYSPSINHFSSLSKLNNKELDRFALPLTPLECNIA LTHQKTIHNVHPIFHSIINHSNHFSLIDYIAEQAWNDEESVNNLLKQINE TDTSLLNEKEKRLMTFSPTVIYTKSGNQIYYQEEKLANETNKIPHIYSER KPYIEDIIPPVEEEELPDPLNQQSSAFEPSTAEPFKTFNSFKESQDKLSN LIKSNANHRKTTMLNNFNKNNNNQNNQNINNDTFNINININGKSLIGSIL ESKSTNIKPIPTSFSSSSPIINNYQLVFNLLKKIK >DFA_12844 g5464, MCKKELHHGSSSNPTVFNKEDIPCTSTSNTSPIHSPTLSTTTTSNNNNNN VISPTSPDQRLSLSFITNNNNQNNNQNNNNQNNNNQKNNNNTQNNNNNQK NNNNNQNNICRNVSTNCINFKVIDQLSTPPCIPSKNNSTTTTTTTTTDAT TIINDNTDIISSSSSTTTSTLTPTFQSYRQVDQNFSINQLPTLVIKNILD YVHGGKFPGYHAVVAKSVCKLWWDLTLSTIKSFILVEMWMECRDTEIANW CNQVSRFLDRAYGHSSEEMKFKRRCCVDFIQLKMGVTSPMIEPVLLRIKD YFGVVDLDLSFNRIDDNGAEQIATLLMGQLKGLNLARNRINDRGACAIAK ELANNTTLVKLNLSGNFFGRPMIGRLFRIIQDSNDTLRDLDISGALVAEA MECLEPTPKFSNGPSLTKLTRLNIASTASGRHIDFLFNCRQIANLLVHLN ISDNNIQPSACKIIADALIRKECRLEILIIDDNSIGDIGLYELCKMIPHN RSLRILSLGNNNISFSGVSHLCHALSNPGVKLQELDLSNNSLKSISIPYL SFLFGAHSRNRHLHTIKLRYCQLEDEGADMMSRSLFNNTYIKSIDINGNN IQSKGCDSLAKLIQQNRVIETLCLGNNSIDNVGAAILANAIKQNTTLKML DLENNNINYFGAFPLLEALKVNTTLRDLNLYIVNIPTRNLRFFLNGTIG >DFA_06069 g5858, MSDQWAEATSADGKKFYYHKVTRVSVWEKPDELKTPQELAAGSSSSSSSS SSSSNGAVSSSSGAASSVPVSLPPNWKEYVAENGKKYYHNAITNETKWDL PTADNLHNNNNNNNNNNHNNHHHHHQHHNNNNNNNNSNGDEAPSTTTPTP TPTTYEPNSKEASIKMFKELLQSHDVASSWSFERAQRVIINDERYQVLKT MSERKSAYQEYMVDRKKYEYEEKKKQDKKNREALIKLLKESGEVTSSMTW RRASLYFDGDPKWMAVESEREREDLFRMVVIDLEKKEKEDKDLAKRDLMK QIKAKFEVNLTITSRTQWRKVKEEYENDALISTCDKYEVLQVYESYIREL EKKEDEAQRSEKEAAKKEARIHRDSYREFLNEKYNEGEIHAYTRWKEFYK KYQSHPIVVQLAGQVVGSTPLELFTDFIEELESRYEKDFKRLKTMTQDVN FLFSPQQTTLDDFKQSISTHDKFNSISALNIVPFFEYLKEREEKKQKDSI KKRQKAILNFKALLEDTRTISKHSKWEEIKPTICKIPHYTDLDDEEEKAK IFQEYLDFLSQEESDEEGIIKGDDDINARKEFSSKKRYSREGSDNITDDR KRKESKH >DFA_06329 g6090, MFSKKYKDVFEQEEGDSKSNGNKKQDNYDDYEFDEDELMKDSSDDDDDDD DDSSDDDDDHHQDKDGVHIQSTENVEHLEIKLRENQYNFDLHIEYIDALK KAKLLDRLREARFAAQRLFPLPLSVWVSWLSDEQQLSSPLQEQEKIDLFE KAINDYLSINVWVQYCKFIENQVISNLGGDIKSGEDERLKRVRDMYERAV IACSDHMVDSFKLWNTYRTFEQQVLAMIPTEATEDIKTKQLARIRSIYQR QLSCPQMNLEQTYQDYEQWEQSQVNSSSSSSSSAAATNIQTRYQLALKVI EDRKDYEKAVVDAKTTGEGGSTLEKWQEYIGFEKKDQSKKLNRIAILYER ALQENYFVFDLWKQYLGFLEHDFKAPSATIFSVLERASRNVYWSGDIWSI YMSRLEKYSDKDDMILKVDQVFERALVAGLSGPTEYQHIFSTRFDILWRH QKKEGGAGAPLLDEEKVNMFEQHFQKEYEVLVSLGMDVSESLMFRAKFEA YQLDNSTLADQTFQLLYAGAPHLYHLVDEYIRFKITKQKDIDGAREIYKK AVKTIAETSRIWQDWLNFERVYGTLQTSDHATHVYQDTVQRYQAKQQKEF EKQKLQQAQQRKALEDKKRKKEEKQEGGKALGAGDGRGSLKKKKIEKTTI YISGLPFSAHSNDLVKMINERVGDLKEVHLVSDKNGKSKGIAFAEFNTSD AAQKCIDTLHGDITFNEKHPINVTYSKKEFKQQSEQEKNTQLHFEQEQKR LAQIEINFENSEGKTVFINNLSSNVTKEKLQSFIESNGATVSDVRVIVKA RPFAYVDLPTPEQVQNALKLNNKYFLGNYMRVALSKPPPGSAPREHKPNP NKTEIVANPFDSKPGDEEMTTSTTSTTTAVPTRKPVLFIPRGLKKAPATS NK >DFA_07065 g6797, MDSADEDDYYDDEDDDDDYVTSLNSQQYNNNSNNQQNQQGGDTYNYYDDQ TDLVIDEDEEDLKNLQKVLEDEDSNDDDDSDNVVVEKSKSNNNQDFKITI KKKPTTTTTTTTTSTTSTTSTTNNNNNIDKSIVLHSNNNNNGVLNNIPSL GISFLENMEVLETSGMDKEEALKTLQLNQEYQKQLRLYLRNIDASIILNQ QLLSKARASLSISVNPKTENVGNSKRAGVAPYFQDSEGAVPNDNPDTQFI KATYNNMPTYFKSKRWTKNELSTLSKGVREKNMQILLFRLSQRTHSKDEY DREKKKIENLTLSDLEENLDGLEWESIVHEYLPGRTPMECELRWRNAEHP LINKQPFTKEEDKKLLELSKKYGSHSWSDVAQELGSNRPALHCCQRHQRS LNTKFMKREWTKEEDEILLREYTKYRTFGDKSWQQIAEALEGRTGQQCLH RWQKTLDPAIRKGRWTAEEDELLTKAVESYGKGNWILIKNHVPGRTDMQC RERWCNVVDPALIKDPWTEEEDKILKDLTAKYGVGKWAIIAKELGRRTDN QCWRRWKQINSKTPFLKEYREVLSKKKEIIVSNFVGREKERPAFDISDFI SEEKLKEMSSSEALQQLSRQQTPKSKGLIPSKASKKRKSKRKQDSDDQED NGSTLEEQQQEEEDEEEEMTEEQVRGLPTIQPPTMETLIERLMDQSKQLE QQLSQKNQNNNSRMDTSDDQVIEEMIDNNNNNNNSTPPTNTLLPFTPTLT HTTIPSPPSSTPKKPPRSSVARPRKSQKTTHVNNTTTATTECENDQTSTT STTSTTSTTSTTTTPRTPKARTPKPPKAPKPPKPPKEAKPKASRSRKSTS STEIPQLPNLVQVLPIATREQLDENININNYIQQNEQ >DFA_07485 g7217, MKPMKSKKDEEIDGKVDIKFTPNEIKNKARRVILWNKLRQQTNKEKSERR RDRLKEAKKLGDQAPAKLLPRTIERLRKGDETIVQDEDEEVSEDVNLDEF ASYFDGKEPKVCITTNTRPQGRHITPFVKMIEEILPNCEYFPRKDFKLKD IVKFCANRDYTDLLVVNEDNGVVHTMMMVHLPYGPTVQFRVTNITMPDKI ENCGKMTSHKPELIINNFTTRLGLTVGRMFASMFPQDPNFKGRRVVTLHN QRDFIFFRHHRYEFASNEKAYLQELGPRFTLKLMYLQKGTFDATGGEYIH LHKADMDVDRKTFVL >DFA_07649 g7386, MEYSDNNIYHPTFIGQPHLVNNATGPAMAQLYQPMLTATAPQPIYNYYGQ SHMPQQHHQHQIQPQQQIQQMPQGMMQNTSYVMPSVPSPSMFMNPTTTNN NNNNNNTTSSIYTSNSASTYALAPSSMIDNSHQQQQQQPIPIHQNIQHQQ QNIPILQQQQQQMPQFGYYVQQQPLISPIPSPLFYSHPQQIVNNNPTQQQ LSNSNNNNNTAMEIASTPTTMIQNTSPNSSSLSIGNNNNNNNNVTSSPNN NNNNSGNTGTSQIMTKNRILINPLPLHTLSNLNSNSTSSSGGASSPRSAS TTPKQKPYSPRNTKAKPSPQQQLHSHQLPLSPSAIKAGISSLPTYPAGGQ TSPRQYNRTPRMVDGSLVADSNAFNSPRISSAATSPLSSSPITMNSILTS AQFNLINSSGEITDKMATTMTISDPLTPQQQQQMFYQQQQPNLPLTPKQH QQILSESTTSNICPAATYLSNSGQIAPSLPLQQQQQQQQPTIINNNNGSF LGYQPISVSDNTSNSSLDEYCPMTSSPVQLNADDTNMVGQLAHHLQNTMG GAVSTAMSTGSVPLQMQTLPILTPCGRCGNSVTTNDQAIACYSCAHLFHH VCVVSHSGQQWQCTYCNQIQTDPQVHLMQQLQFNWNDQMSSSPLHITSSL PPPPNHMIMPQQQLQDTTMQQLISPVVSQTPTTMVTSQQAIVLLPKDHDH HSVNDNNSDDDEQDDDDDDEDEEEEEKMDKTSDEDEEDEDDDDEDDDEEE EVMTTNSPASTPRGSSKGHWSKEEDELLKNLVDIHGTKKWKYIASLLTLR NGRQCRERWSNQLDPTIKRDAWTLNEDRIILEAHAKHGNKWAEISKLLPG RTNCAIKNHWNSTMKRKITKNQYDLTLINVDSSTIEAIKNESASKKKVTP LIVAVSNSSNNNNNTIKSPRPSTRQPNTPRQSSDQQQQQQQIPLTSQTTT TTTTMIIPKIQSFDQLQQFQQQQQQQQQTNMNDHDGDLHLPTLVSINTDD QHFSTSTTSTTNNNMTSSSSPFSQQPPSSPPKPCYICETISFIRPKSSDN KVHSLTMEHCAHFNVQYPHNYSTEKVYGLCNQHYVCYKRNTKTTTVDGNS PRLIIEDPVQRELREAGLWPDLADIDRMKMETKTDTSKINLLNLYTVLHQ TKNLPIEKSYLALYEAFNIKAKTKKNQVGAGGVPEKVDDINKKLVKNNIK FKIKNLLLTYPHLQFYGGVKEFHLGRLQKVPEVLLVKNNHHLKMRFQESD >DFA_08024 g7749, MTSRKNTSIDHSNQITTAHSISSLVSSSSSSSFDSSTSTPSSSFIMASPL GAPTMTMSTLMTTTTSTTSPPSTVIAAPALTRTTSRTGSINNMMMMMIPP ESPISTASTLSSSTDSAFSDISSASTVNGGNGGPLKKSKGKWTLEEDDIL RQAVAKHNQKNWKKIAEHFPNRTDVQCHHRYQKVLHPNLVKGSWSKEEDD KVRELVEKYGARKWSEIAQHLNGRMGKQCRERWHNHLNPAIKRDGWSEEE DRIIKEQHVIHGNKWAEIAKSLPGRTDNAIKNHWNSSMKRSKKPSNFKRA PRKRKVITKKEEGDEEEEDYGEEEFGEDGEGGEEQGEEEKSLNLNTTPQK TKLAVDVPSLQNFITNGYIISPKPRVSLTTTLPASPRPPLTPVQAISNLV TPIKPFSNSCKKKSKTTNDNNNNNNNISPLKMNYDYINPEIFPQNFESEL SPIRSNTSLLATTSFIPPPTTPVGGLGIIATTPNSANTTASNGSSSQHLF DHSNFENSLSSPNKLCLLSPYRSNLSDLLHNNNNHINNNNNHINGNSLNP FSNILSPSPKYKTTSSTPTTPGSTIQYLNNVNNHNNNNTTTMVHSPPNHS VNNNSSGNNCTGTNKTLGIQLSDKGIDKTSLQNINSKLKGITTPPKINNA NNNNMMIDNNNENDTITTYSTPSKPSNNNNNNNFYFVPFTPFKDNNVSHI STPLGSTRSSTAKLNNHIKQSEMENFITEDNFDPSTKKTSCISTTTTTTK LNTTGLGNFEGCSSVALKMLNDTSKRSIFDKARKLLMTTENQQQNNNNQN NTNNNNNNKNNGDDFVPNISFLTPSSSNFTPTTPLKQQQYFLPTPYKPPV NMNMNNHEINSINNNNIENNNNNNNNGNNKPNNQTFLDHHGCYVGIH >DFA_08117 g7838, MSSTTTSNNSSSENNNQLDDSKLQEKAKKWLQINSKRYSEKRKFGYVDPP KEDMPPEHLRKIIKDHGDMSNRKFRHDKRVYLGALKYVPHAILKLLENMP MPWEEVRNVKVLYHITGAITFVNEIPLVIEPVYLAQWGSMWVTMKREKRD RKHFKRIKFPLFDDEEPPLDYADNILDVEVEYAVQMELDPEDDKAVYDWF YDNKPLINTKFVNGPSYKKWRLDLPIMSTLYRLASPLLSDLTDQNYFYLF DDKSFLTAKALNMAIPGGPKFEPLFRDMGDDDEDWNEFNDISKIIIRHKI RTEYKIAFPYLYNNRPRQVSIPYYHHPPNCFTKTTNPEAVGFQYDPILYP IPSYKIDRSASIYGDEDDDFVLPEDVNPILLKSAQVNTENTLDGINLYWA PKPFNQRSGLTRRAEDVPLVKAWYQERCPSQHPVKVRVSYQKLLKCYVLN KLHHRPPKSLNKKYLFKALKATKFFQTTEIDWVEAGLQLCRQGYNMLNLL IHRKSLTYLHLDYNFYLKPIKTLTTKERKKSRFGNAFHLCREILRMTKLV VDTHVKYRLGAAEAFQLADGLQYLFSHIGLLTGMFRYKYRLMRQIRMCKD LKHLIYYRFNTGAVSKGPGCGFWAPTWRVWIFFLRGIVPLLERWLGNLLA RQFEGRQYNTTAKTVTKQRVESHYDIELRAAVMHDILDMMPEGIKANKSR VILQHLSEAWRCWKANIPWKVPGLPVPIENMILRYVKNKADWWTNVAHYN RERIKRGATVDKTVCKKNLGRLTRLYLKAEQERQHNYLKDGPYVSAEEGV AIYTTVVHWLEKRRFSAIPFPQTSYKHDIKILTLALERLKEAYSVKSRLN QSQREELVLVEQAYENPHEALARIKRHLLTQRTFKEVGIEFMDYYTHLVP VYSIDPFEKITDAYLDQYLWYEGEKRQLFPNWVKPSDNEPPPVLVHKWCQ GINNLDGIWETANGECVVAMQTTLSKVYEKIDLTLLNRLLRLIVDQNLAD YMSGKNNVVIAFKDMNHTNNFGLIRGLQFASFIVQFYGLVLDLLILGLNR ASEIAGPPQLPNPFLTYKDVETETKHPIRLYQRYVDKLYVVFKFSSDETR DLIQKYMSEHPDPNNENVVGYNNKKCWPRDCRMRLMKHDVNLGRAVFWNI KNRLPRSLTTIEWEDSFVSVYSRDNPNLLFSMNGFEVRILPKCRSPNDQI IPKDSVWALQNINTRERTAQAFLRVDKESMDRYENRVRMILMASGSTTFT KIVNKWNTSLIGLMTYYREAVVVTREMLDILVRCENKIQTRIKIGLNSKM PNRFPPVVFYTPKELGGLGMLSMGHVLIPQSDLRYSKQTDSGITHFTSGM SHDEDQLIPNLYRYIQPWEQEIKDSQRVWAEYALKYEEAKTQNKNLSIED LEDSWDRGIPRISTLFQKNRHTLAYDKGWRVRTDWKQFQVLKSNPFWWTN QRHDGKLWNLNNYRTDIIQALGGVEGILEHTLFKGTYFPTWEGLFWEKAS GFEESMKFKKLTHAQRSGLNQIPNRRFTLWWSPTINRKNVYVGFQVQLDL TGIFMHGKIPTLKISLIQIFRAHLWQKIHESVVMDLCQVFDQELDNLEIA VVNKEAIHPRKSYKMNSSCADILLRAAHKWQVSRPSVLQDTRDTYDGSTT QYWLDVQLKWGDFDSHDIERYSRAKFLDYTTDSMSYYPSPTGCLVGIDLA YNIYSSFGNWFPGVKPLVQKAMDKIMKSNPALYVLRERIRKGLQLYSSEP TEPYLSSQNYGELFSNKIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIF IFNPRTGQLFLKIIHTDVWLGQKRLGQLAKWKTAEEVAALIRSLPVEEQP KQIVVTRKGMLDPLEVHLLDFPNIVIQGSELQLPFQACLKIEKFGDLILK ATEPKMVLFNIYDDWLNSIPSYTAFSRLILILRALHVNNERAKIILKPDK NTITQPHHIWPTLTDQEWIKVEVALKDLILADFGKKNNVNVASLTQSEIR DIILGMEIAAPSQQREDQIAEIEKQKKESSQMTQMTVRTTNVHGEEMIST TTSPHEQKVFSSKTDWRVRAISATNLHLRTNQIYVNCDTAKESSITYVIP KNILKKFITIGDLRTQIMGYMYGVSPPDNPQVKEIRCIVMVPQWGTPVFV NVPNQLPEHDHLKDLEPLGWIHTQPTELPQLSPQDAIMHGKLLADNKSWD AEKTIIAAVSVSWPCTLTPYRLTPSGYEWARANKDSQNFSGFQPSHYEKV QTLLSDRFLGFYMVPDRGSWNYNFMGVKHSANMTYGLKLDYPKNFYDELH RPSHFQNWTQFDQKSSTNQDSEEVTNNADNENLFD >DFA_09027 g8714, MNAGIAIQQHIPPSQQLQPAQQVTNQYQYQQQLQQQQQQQHQTQQHHIIP PHSPIYHSAIPQYLHYQPQPQLQQYSHQYHHQLQQQQPQSIYQTQQQPVH SPYLTAVLSTSNNNTNASNNNLSNSSQMATNNSREYGESTTTTSVIPPSV SSCMTPTMPSQVMSPLIVGTPTSGSVAKRKLEDDGDFSLLKVQPLLVSSP LLTPVTQSPGLTSVQLAFQNTTLSNPPTPLSMSPSLSPSMAPMSPSKKSK NSRSTSKSKWSEGEGSGRWPKVGTIVKGPWKDEEDAKLIELVNKNGPKEW STIASKIPGRIGKQCRERWFNHLSPDVRKTNWTPEEDRLIIESHQELGNK WTAISKLLDGRPANAIKNHWNSTLVKRIGADIRNHPPSTTRDSKSGDDDD DDDDEDIDTQSPALSPISLYPQDNALGKKSQLAHHTMASSTDSSSSSSNY NIPPFVLSDSISNSVPIQTHAQSSSSLLHQQPLTHQQPLQLPQQQQPQQQ QQQQTGTIVAPQIIRLQNSSTTPSSSTSSPPPPTQSNNNNNNNNNNSSQT KKPLDLNFPQQQHEHHSVPQQQQTQSKAGYYQGAGGAGTTGGDITNYPPS HDPTSTTSNYWEMTPSGEMPNHQTLFTADFHVFENPSEFLFFGDSDHQHL QQQPQQQQQQNQHQQQNQQNNMPLTKTDLNPPQHQGSQPYDLNNLFNNNG DISL >DFA_09245 g8922, MKGLEPVTIDTDEIREVWAHNLEEEMALIRELVDDYNYIAMDTEFPGIVT RPVGSFRTPSDYHYQTLRLNVDLLKIIQLGLTFSDSDGNLASNTCTWQFN FKFNLNEDMYAQDSIDLLSRSGIEFKKNEENGIDVLDFGDLLMSSGIVLN EKIKWISFHSGYDFGYLIKVLTCTALPQEEPEFFDLVRTYFPCIYDIKYL MKSCKNLKGGLSELAEDLDIKRIGPQHQAGSDSLLTCTTFFKLRKMYFEN QIDDSKYQGILYGLTSSFSQDNSQQNNNNSNSSSSSGTTSTSTSSTTSTS TSSSTSSPSSSSLPNNNSYTASASSLNGHVITS >DFA_09249 g8927, MEDQEGGTNNANDDNNNNNNSNVEEEDTNRRERDEDQQINNNNNSDDGGG GGGEDDGDDNDDDDDDDDDDDDDDDEDVNIVLDTESVEAGRTSTKNSSGS IKPSFYKSAPVTNLAGVKYQISKQATASQHQPNRPQKSIYDLDLGGFEDR PWTKPGADMSDYFNYNFTEETWKLYCERQIQLRAEQANLGKIKSYESTNK MINGGNDQKIDLPPEFMPQEMNKQQQQGGRGGGGMPTDKRGAVPQQGGRG MPGGWQANMQPPGGGPPGNFYPGGSGGYPPPYGGGVEGPVPTMPPSDERR RERERDREGTSSGSGTRERERERDSGSGSGSRSDRDRERERDGSGSGSSR SDRSERDGSSRSDRSERSSGSDRDRDRERSRRGDDSEYKRKSSEDPNDDR LKRRR >DFA_09405 g9080, MAKTKRNYKPHLQTSKIKPTPENKDDESNGSGTSEVVSDDTIREETPEYT IDGSILEGGGQIIRNCLALASLFNKPIRINKIRNGRDQPGLKAQHRSGVD LLVRLFRAHASGCKVGSTDLYYHPRRLAHEIKDTSIEADTGTAGSITLLL QISLPCLVFFGSSTKLVLGGGTNVAFSPMIDYIAQVFAPAAKLMGVDMDI TIDKRGYYPRGCGQTTVITRPLTTEPLKPIEILDRGQVTKIIITSYFTST RINPSVADRMCQHARKLLKKEFKVEIEERSVDVAKESYGDCAYIFIMAET STGCRFGASAIGEIKVPAEKVAEDATIQLINDLNGGGCVDEFLQDQLIIF MALAKGTSKIKTGLISLHTETSIHFTSLMTGAKFQVIPDPDGKKDVNIII CDGIGYLNGQNQNQNINNCTTSTSTTTTTTTTTNNTTTTTDS >DFA_09630 g9281, MEDNQDGHDVSSSSEQVVAPETTTTTEEQQQQQDNNNNTETTTTTTPTPP LTNTNTDTTTVVVEETSAMDTTTTTAEPPLNSSSEEHTDNTTTTVTNGSS VGAGSNGDVVEEHQKDEQKEEQKDEQSNQQEQSIQQEQKEEQSNQPKQKP TITILKSPPVPLSSIASVMPAIGKRLNVAIDQLEARITSDKYDTEAWTLL LNEVQSQPINIARDIYERFLAVFPTAGRYWKLYVEQEMAAKNNEQVEKIF VRALRSVRNVELWRTYIQYIRSGQQNDREEVIKAFELALEYIGMDIASTP VWIEYISFSREDRAAATNPQDEGHRMNSLRKLYQRAVENPMHDLDALWKE YEQFEMSMSKQLAKTMLAEHLSKFQHARNVYRERKALLEGILRNMLAKPP RASDKEAHQVRLWRRLIAYEKTNPQRFDAVQLRNRVTATYNQCLLCLYFY PDIWHEAACYQVDVGSVDAACQFYERGLTAIPNSLFLSFSHADVLESSKK VDKAKEIYEKLITATAPSTPPLVWIQYMRFSRRHERIEGPRKVFKRAKSS PDCTYHVYIALGFIEYYINQDTKTARDIFEIGLKKFGTDITFVNFYVDFL SNLNEENNTRVLFEKILSNVIPQEKSEAFWRKYLDFEYRQNQDLATVIKL EKRVAGLSPAFEKHSLLQHLNRYKFLNLWPCHPNEIEIMSKNLLRDDGED EDDMDEDGGDDQDTSSSSYSSNKYRNARGGGGGKWDHRGGGGGAGGGDEK NEREDKPTSQTKIPLSTLKASRPDTSAMILYRNEMGKISARGGGGGIGGS GEISPPIQSINVPPNNMPPGIMGGGMGGGVMGGMVNGIPDFLMPIAQFYP PPQQFNGPWVDVDQLMMLIKESQFPPQLLQMMAMAGINIGMNPNMGGGNM GGNMGGGNMGGMNQNINMGNINNNNNNNMPPNQSFNKQQQQQQQQNSQPT TPTTTDRSNQSPPTSGMSGGQQQQPPQQQLLGKRKIDEPDNSNNGNQLSD QQPPNDQPIKPNVIPAPVPSSSQPLPSSSVVQPTSNITSTTTPLNSAQPI SSTISSTKLPDNDIYRKRLASKLSKKI >DFA_09793 g9438, MSLKEKVVVQQQQQEKEEEEKEEEKETQEETKDENGSTIPKPPPLPPVET VKSSLSSYPSSNRVIPDDALRIIQFNIQADIYTHPQRYHYCPSYALYRPY RQYIIPEYILEHNGDIVCLQEVEVEFDRLRKVLIESGYNHTAVLAKETDR QHEQCITFYQTSRIQVIEEHLVNYNTIEKHPELISKEQIASLTNNNVHNT NMYNQLLHTLHHNRHNILLLECKKTNQKFIVVNVHLYWGASSNDTNYYLQ ILQMNMLLIMVQNILTRHKLGSWTTIDDNFETNTPIIISGDFNNGPANYT YRYLAKGNLNVNTNQGLVNYQHPFKFKSAYNLHPNGELKYTCITRDFKGC VDQIFVNDKIKVESLLEVEKYYGECLPTITEASDHILIASTITFLKNKE >DFA_00932 g946, MNEEEDICRVCRNGSTPDNQLSYPCKCSGSIKFIHQDCLLEWIKHSKSSS CELCGYPFRFTPIYSDNTPDILPFKELSVEVLKRSFKFLKRFARISFSFM CFLVMIPALTCITFHLFFGMSTKKMLPYTIFNSFLIGVTLYFFIIATSFL SYIFLTFLNGKLIELEIEEPTTTEQEQQVQDDDEEDEEEDDDDDDDTEED ILYDQDHIFDNGLEQQMNQQPAIPAHIPPPPPPAPQQQQQGEVRHIFRIP GVIEILGVQQQDIPLNPEAVLENNNIVVNEHDDGDDIETLIGLRGPISDV ILRTASFAIYNFIFLLIFLYLPFQIGKQAILVISQIDTGIGGFKLSQLTE GIINLFIGYSSLALVSLYILSECIKHKIAYKISRLLYSFIKISMIFFIEL GIIPFLFGVALDLLTLPLFGGNLESRMNSFTNNKIQYILTRLAFGLFSII GISSSSRVLHQIFRPEVIWFLKDSADPDFSLVKFLIKAKLHHIFFNISMA FLTYVIIGFLIIFLPLKVLSFVPDLLPMDFGDIFNKLGTDICLIVSTSYF PRFHPQFTFNSFIKTTFTFFVTKLGLDDYMLIRKPTTTNTTNNTTEPVQQ PEYNYQPPDQQPEQQQQPEQPQPQQQQQQSQPEYIKPNNYGYKVIIFMVF CWFELFCASLLALSIPVSIGRYLFSLVQFTYHNDTVSLFVGLAVLWLVTK GISTIFSGRTSVDHFFSKIPNVLKLVLMIAIFTVALPLLIGLVFELVVII PLIANYDESYYIFVIDLFNIWGIGVLLLNFWYQWITSRAPMNNIRNRRPP EDELIPHQNGEQDQQGDGDRWMDRFNQLKRNGFMGIDLWFSLQKIVFPIV YFLLKLLTVPYFISKGVVPFFGGSPILENITFIYGYPVFVFLLASEILYF KLKKSVIHFHNIIRDDRYLIGKHLHNLEQQRQQHH >DFA_09953 g9594, MLKVQGSKQFRQRIVCSLLSGRAVKITNIRDEDERPGLADYEASFLRLID KMTNGTRIEINNTGTMVTFHPGVLTGGKLQHDCPTSRAIGYFVEAVVCIA PFSKAPVDIAFTGLTNNDIDLTIDTIRTTTLPIIRKFLGGEEGDTSSLSI KIVKRGAPPSGGGLVYFKCPIVQQLKPLQMVDEGKIRRIRGISYATRVSP AIPNRVLDSAKGILLQFTPDVYISSDIYKGAEAGSSPGYGLTLVAETTTG CCLSAECMASEGEIPEDLGQRTANLLLEEILNGGCIDSNNQSLALLFMIL CPEDVSKVRLGRLTPYTIEYIRHLKEFFGVTFKIEADDETKTIIFTCLGI GYKNLARRTF # Dictyostelium lacteum >DLA_01330 g1210, MNMNMPPVNDDDEEDADSKRMKSKGNPTTTTTTTSTISTNGKNTSNNNNN NNNNNNNNNNINDSNNGPNGGLNGLAQLIESTQELHSSQSMDDFSDISST TSTISFVAPSKPKNKGKWTKEEDDLLVKAVLENNQKNWKKIATNFTNRTD VQCHHRYQKVLHPNLVKGAWTKEEDDKVRELVAKHGAKKWSEIAAHLNGR MGKQCRERWHNHLNPNIKKDAWTEEEDRIIREQHAIHGNKWAEIAKLLPG RTDNAIKNHWNSSMKRERSDASSTSSSSSHSSPVSSPTIIQQKLKLKKAA EKLWKKKPAPNSNNIDSNNNNNNNNNINNNNNNTTTTTTTSTTKKLTKKQ QLQLQQQQQQQQLQQQQQQQGSLHHHIDVDTLHGYLVNGLPITNVNGKRP LGSTMNASTGNIMTSTDQYPYEYSIENMISPVKLFTNSPNDHLKKKQKLD QTPTKQQLSSTQNTVFEGYTSEIYELGFYSPIGKSDKNTSSFLNSSELSP LKSPSKIINNNANQLQQQQQQQTQNGTPCKHFLDSNVCFPYSGGGGTPNG HYININSPVKSFQSPYKNFNNFNLIQQQQIQHLQNQQFLSPFKQQQQQQQ QQLLQQQQQQQQQQITTTSNSLPPRFNGENNKSSKSIIGIHLSASGIDKT SLSTINAKMNGSSTTTAVGSGKSSTEHHQSNIYSNNAITSANNTSTNNTI PYSPISQNDGSFQYQFPLTPGTTKSLSRFSSSHSISPQKQFQNHSDSFEE DDVDVQQPQPQPQQDKQQEQVVQQSPMDPSMIALKLIKDNSSKSIFSKAK RILNNSVSSSPATSVPGKFSTPISSSNSVNIQNNHHQQQQQQQQQQQLQL QNGYLVPSSAGGNSNGMTAVNQSFGSPMNGYGNSYNTHQQQQQQQLLNNT TRSNDTYLLSPATIQPQSPTATTTQCNSSISLNSPNLPASLVLTTPTTTT TNTNINNREPIVFQLNAHSYKSQNISTTPSTPTTTTTTNTTTNTTSSPTT TANNKNAFSFKESDHGILT >DLA_02388 g2168, MEKIIIFPPIPVVHVPITLFNSNKQAIDSKYILECNWFIDDEIVQSFNDN VDHHIGTQLKRTSSTSLLSLVTSIDQPKTLTYIPTLKDANKTLKVTCKFK SKILNLIPDTSIFQYEHKVLSSKSNSQREIYYLDNSLNHPNNNTINNNNF SSSKSNLGASLQNYRVIQYNILADGYVSKFIFPYCEPYALFYQYYRKYLI GKQILQYNPDIIGLQEVEDSYVDLFKEMEECGYVRSPPFSNCTGLPITPG AAQEGCVIFYRSSRFQVISHLLIRYQTINAQTCSVPNSILTLEQYQLLLQ EPIFKPILEKVMPFTDHHTKHVLLLLQDKQTQRIFVAASIHNYWGSISKM EFNYQFQCLQIVILSMILENFLRSNQLPLDTGVVLCGDFNAGPESESYKF LSKGFFSDTAKITIPFQSPFIFKSIYSQLPYGEPRFTTYTKSFQGNIDQI FINNSFQTHSILDISDRSLYNEFGYLPSIVLGSDHILLLSDIELKK >DLA_02701 g2436, MSFEQTFQEEDGDICRVCRNGPTPDNQLSYPCKCSGSIKYIHKQCLLEWI QHSKSSSCELCGHPFRFTPIYSENAPEFIPLSELVFEAMIRLKWYIKRFA RIIYIIFCWLFVVPVVTCWIFHFYFGKQWFLSAYDRKADFTVNSFIYDFF IGTMLFFWILFATVVSYVILDFIHHKHSEIDIQNEMMNYDIQQIQQQMAA NNNQMHQHQQQQPVQPLFQNNNESDSETESDDSDQELIPNNVNEAYNQLP QQQQQQQQPQIQQPQQPQARVIFRIPAVFDILRGEDQPIQQQLQQLQQQQ QQPQPQQQAQVNDININNNIDDDGNDDIEHFIGLSGPLSNIITNCIILVL FNAAFILVFLYLPFLLGQFVQELTLTKLEMSIFLKGSLDIFIGYAVFSIT SLLFLSFLISKNIWLKFTVLLYSFIKVVIITVVELGFLPILIGMYIDFSS LSLFGSSISSRFDYFMSNKLPFLITRWGFGIFFMLNFTYLLSTLHQIFRK GVLWFVRDPDDPDFDLVKDMIKSSFQKHLFKISFSVFAYIFASTLLVYLP SKALSLIPNLLPINIDFGQATNKSTSDVIFIYAISYFPRIDARVTIKNVT KFWVKEASNLLKLDGYLLPQPAVPSNQATNNNEANSQAAAVVVKPDNFGL RISTFIFMGWLSLFLIISTYLSLPVIIGRYLLGPVSGNDIYSIILGLITI WIAGKSIYILVSNGSSINLLQWSIILLKIIFISICCLILIPILTGILLDL IFFIPLTTPYDETLHFDYNEKFKILFQYWCSGALILQFWYRCVTAANYNP NNIRNNRPEDLERPRDKWIDRFEVLKRNGFANINVKYTLTKVVFPIGHYL LTLFTVPYCISKFVIPYFGGTLTLESIAFRYGFPVYCFILIAEKLFMQIK SWILIFNNAIRDDRYLIGKHLHNLDESKY >DLA_02844 g2562, MYNFISTVQKPTAVSHSVTGRFTGPHDNNLIISKCTKIEIYKMGTDGLKP MLDVNIYGQISSLKLFSVNGYDQDLLFLSTERYRYCVLAYDQQRKEIITK LSGISEESTGRPSEPGQICIIDPQSKMIALHIYEGLLKVIPLSTNIFNSS SVSGSGNITTQEAFNLRLEELQIIDLVFLDKCDRPTLAVLYKDTRHSRHI STYEIKIDKDHSPGPWSHNNVEIGSNLLIAPPFGGVLVVGEQVITYLNGK FPVSVQIPFTTISCYEMVDKDGSRYLLGDQSGQLYLLLIKLDNQGNVIEM HIELLGETSTASSISYLDNGVVYIGSSQGDSQVIKLKTEKDTQTDSYLEI LDTFENIGPIQDFCVVDLEKQGQGQIITCSGIFKDGTLRIVRNGIGIAEQ ASIELQGIKGIWSLYDFNQVGSSSHQNADRYLVVSFLTSTKILQFDGEEI EEKEYIGFDLSNQTIYCGNIGDHVVIQITRNGVYLIDGKAQQLLDQWKPS TASGQINLTSRNSNQILLASGNQLFYLEIQQKKIKQISTVEMPFEISCLD LSSFEGQEQSQLCAVGLWTDISVRLLRLPQLEEVCKEILGGEIIPRSVVL LTMEQQHYLFCSLGDGHLFNFSLNINNHTLHDRKKLTLGTQPIILQKFQK NQSMNIFASSDRPTVIYSKSKRIFYSIVNLKEVSHVCSFSSKVFPNCLAI ANQSSLTIGTIDQIQKLHIKTVPLNGEMARRITYSEESSVYAIATLRYSL DDQSSSSTTTTTTTSSNNNNNNNNTQKKTNSTGYGIPEFQLKLLNDQTFE QTSSYQFQPDEYVWALTTCKFSSDHNTYIVVGTSYREKESGPLKSSQGRI IVFSVHESRLVLCEEQPTVEPVYYLLPYQGKLLAAVGKRIQVGSWKFNSQ EENGKLQLSESVYKGHTMIVQLAARGDFILVGDAMKSMSLLSVTADGKFN VIGRNPQPIWLKSIAIIDDDHFLGAETSNNFVVIKKNSESTNEQERQLLD SVGHFHVGEGTNWLKHGSLVTLPEQDQQQRKIPTILYVTINGSIGVIASI TKEEFDFFSKLQEGLNKVIKGIGGFSHSDWRSFANDHHIMPANNFIDGDL IEMYLDLDHDKMLKAIQGMNMSTDEVYKKIDTLMQHIR >DLA_03103 g2802, MTYQFNELPEQFNNEAEDNFFKEMKEIGRGRSRTRDEISKIKRTKLTYFD EKVNVSPFSSIPRVIPPGLNDQQISALILRIRIEEITKKLLSGVFEITDR DRERSPSPPPIYDNNTGKRTNTREQRIKEKVMKERHQLILAAQQISTSYK PPSDYQPPVEKKTCKIYIPIKDHPEYNFIGLIIGPRGNTQKKLEKESGAK IAIRGKGSSREGKSTKPQYQENDELHVLLTADTQEQLDKASILVREFLVP VEEGKNEHKRQQLRELAEMNGTLRERPAYIARSWERADIKCVHCGESSHP SSDCPLRNNNDQQMLSIIEEEYKKCISEVKEILGYDFELNLNDNNNSNNS SNNNMANGYDEFREALDKDEQLQQQQYQQQYYNNNSNNNYMQNNNYNQQT NNNFNQYQNWNQNNNNDGNFNNNSPYGPSNPSHQQSNIYNQNFNTSPYGP SR >DLA_03551 g3204, MSEVENEQDVVMNTDNSNLQDESNEQNTDKQESNIDENNTNDGDENNNED EDDEEDDDDDDDDDDEDDVVLVLNSESVEAGGTARSFKTGQNGKSGFYRA PTSMTPGLNKYNIVKQAGGTTFSSNRMQKSIYEVDLDNFEEKPWLKPGAD LSDYFNYNFTEETWKAYCERQNQMRLELTNQGKIKGYESKSVDSKSDLPP ELLGVEQQQQQQQQQQQQQHQMQGHIPKRVPPYLLKKGPPDQRQNFNPHD QDSQDRSHHHHQGGGGGGRGGNQYANNPNVSGNGSGSGGGGGGGGGGGGY QGRNSVGGQDYRKPYQNNDEDGDRRNSSNRGGSGSDYHRDSRSSTDTRER ERDDRRERERDRDPRDSRGSDERDRRGGDDRRERERDRDPRDSRSSSDYK RKLEDADDDRSKRRR >DLA_00419 g377, MLHSYCPISGQYIMTSQNPNNMNSSQNSNPMQGGREYGESTTTISIVPPS ALTPTLNSQPIQQIQQIQQQQQVISSPMMGSAKRKFEESDSQISYSDYSS MLKNPVLVSSPLLVTPNTQSPGLQSVQMAFQAASLSAPSTPLTMSPSLHP TSPMSPSKKSKNSRNSGKSKWGPTTLHLSQSSIPEELQMPSPNSSTSSLK ANIIKGPWKEEEDAKLVELVNKNGPKEWSSIAAKIPGRIGKQCRERWFNH LSPDVRKTNWTPEEDKIIIDAHANMGNKWTAISKLLDGRPANAIKNHWNS TLLKRVGGESTSSPRTRKQKSDKSGDKDKVSTEDEEEDEDEDENTSPALS PISLYSSDNSVQIQNNIVHTPTQHPNISYNNIPPFVLSSNSTTPYPMLDQ QQIQQQLQQQQQIQLQQQQQQQQQQQQQQQKNTGANGNTVIAPKIVRLQT PNSSPSLGSQQQTQQTQQLSKKEQKKQLQLQQQQQQQQQQQQLEQLQQQQ QEKMHQPPIHQQIEQHLQQQVSQYQIPIQSNIYQQQEYIIQQQQLQYYQQ QQQQQQQQQQDYIYHQNSQQTQNNGQQHQHNSHVLQGSTQEVPYFDPYNF IQQQQQQQLQQQSLQHHLNQQQGQNQQQPTQQNIDPNSHTYAQDYNHDFL LFDSDHQNVNMPNQLHHIKQDQTSYHQISQQHHPNILQSPTQTQQNNQQD QKHVNSYDISGLFNLEV >DLA_04230 g3813, MSGLHEENGITIQEQSGNLKNITIKDVWSHNLEEEMAKIRELVDDYNYIA MDTEFPGIVTRPTGNYKTQSEYHYQTLRMNVDQLKIIQLGLTFSDSEGNL AKSTCTWQFHFKFNLNEDKYAKDSIDLLSKSGIEFKKNEMNGIDALDFGE LLMSSGIVLNDKIKWISFHSGYDFGYLLKVLTCTDLPQDEIDFFQLVKTY FPCIYDIKYLMKSCKNLKGGLSELAEDLDIKRIGPQHQAGSDSLLTGTTF FKLRKMFFEGQIDDSKYLGILYGFTSYLQDSNGNLMVHPVQPQQQQAPPP QQQQQLQQQIQQQQQMQHQMHMQMQYQQQQQQQQQQQQYTMYNNNNNNII INNQQNNYNNIYYPTSSPSSYNRGYPYYPNSPINTNSPSNSNINK >DLA_04254 g3836, MNGVIHSAVKLDDSGNIRGNLIKIRDLPNEMIKSSLYYRLDQIIHSIKPN HILYIGIDGVPPRSKSIEQRKRRFKASKEALDVINKMRSYQKSNGNGTQL VSGNQQPQQESYFDSNSISPATEFILLVNEWVREYCVKLSKIYENLNIVY SDSSVPGEGEHKIMDFIRSYQKSSHYQSEKESHIFYGMDADLIFLGLSTH EKNFYVLRDALDLIQCSVCKSNQHLSYECHSAFAKRKLNFEKTTKITVKN IPIQTTEETLRKLFGFYGTIVSLSIERANTKRSSLSALIEFSDKKIIDEI ASRGGSWFINNDRLTIHINYPKSTKSRGSSGSTSTSNNNNNTDGEESGDE EEDDSNGDIKDTDEDIIDNCLFISNLDILVTPFDLEAYFQEFSLVKSIDI LPSPRYPKQRFARIVFQDQKSARNAFNIGRNSHFFGSDITIKIPIHKPPP VTETPQEKAEKEKQQQLEKERRMLEKEMKVATFLKMADTDTSRDSAYFYL GNSEWDLEKAYKLYLDYDKAPMEIPSKPIKFTFDYVNLSNFREYFRYYLV CGLSMEHQLMIDENRCINDFTLMAMLLGNDFLPHLPALHITSGSIELILS WYRDWLHESIKNKEVRYVTTQDSNNINYQNFYALLSVLSDWESHIYPDKL EKQVKREFKLKNPSLSPPSSPNSSNTKSKVVTLKDLNIIDKEKPKKNSNS NNNNNSGNGNNGEVLEGFEFTYEKDCYYKIKLEQQFIEQKDGLIKSMCES YLEGLVWVLRYYTHGCQSWEWYYPFHYAPLARDLQQYISEKMSGGQLNTE FQFNMGAPLNPLVHLTSVLPIYSSKFLPEPLRSLMVPPSPVSKYYKEDFK IDLNGEDVPWKGVVLLDFIDFQLLRKLAEPIIESNLSDVEKHRNHVGNNQ LIVNSQVLEMPDISVLITSKEFHDEVMSPRSSALTVRDLNWTTRRVFTLS PQTMVAYQEIPQTVAVRAPLHQNATVQMTELQTQFLEWRKTLGINQSLVV LDSIDSKSCNALNLDSKQSIKSLLRVFSNEISTEKDIVIVSQDGDPQMIV NIGFSKPTKVQGIKFISTFEQKYQPKQIKVYINTNLDFSNAASEKPIQII DIKNSSDLAIVSTPITFDSLKFKTIQSLSLFIESNFGGNNITKLEKIVLL >DLA_04891 g4424, MSARLLRMKKDVDTDDYSPSEYKLRRSPNDIKCKSKRMELVGKLMAAKKI AREQSRKLRKKERDNLGDQAPPKQVPRTIESMRRADETVVDKDDGEFDEE INNDEFASYFNGKPPKTCITTNQQSGHEAKNLARMFSKIFPNSGYFNRRT YNLKEIIEFCNNREYTDLVVINETKGKVDELIISHLPNGPTATFRLTSLE FPHEIAGSGKITSHTPELIVNNFTTRLGHTIGRMFASLFPQQPEFHGRRV VTLHNQRDFIFFRQHRYAFESLSKANLHELGPRFTLKLLSLQHGTFNTSS GEYIHIHKHDMDVDRKKFVL >DLA_00493 g446, MKKVNSFENQNNYEFSPPPKIHIDNLVSSGGSGSLSPPSPRSLTSSSESN SGGRLRRASSPLAFLQSLSPKNKKKNKKKTEFDPLSLMGDINTTNSDSRS SSIDSGYYKPLLNKILNINNLPHRILIKIFNYLIIDKFDTVKLEIRKSKQ KPQQLVSLDNNNNNNNNNSKSSSSIKGFGKSKKKSISKDEEDEEDKQNDN IDLESICLVCKLWGLEIAPQVFHYFIVKSPKDLKSLIGLVTQGLQEGGRK FQFYYISMIIDKSSTYQKFMNILKHRMPDKLPDKFVKATTKPILSNLFSK SLFAQFFENSTSTRYFRFYQKWMSKDNFSAIGMALRSNTSICHLSFRNNN LEDVIVEDVIKALYDNQTITYLDLCGNKLGYQTAHELAMVLTKNRHLETI DLFYNNINTDGGSALFKALRINSTLKNLYLRWNHIKTPAAIDLAETIKLN NTLQSIQLDRIEDAGGGCLFEALCENSSIVEINLSDCAFQQKSSVAISKV LSSKISKLSVLNLKSNQLGLLIRPLAISLGSCQHLTKLNLADNRISDDTG YLLGESLGENKSLTSLSLSMNGLSNHFSESLSLALRINQTLLALDISANK ITFEGAKMIAESLQSNSSLKLLNLNQNSLSPQFGPIIAETLKLNQTLTHL EMAYTGLRNEGSLPISKVLALPTLHIKKLNLSENSISDQVGIEFANALAT NQFLQDLDLSYNTLSSKSKEIFEQSLLTNLSIINLTYSSVPLKWKFQI >DLA_05215 g4709, MENVEKDVLSLRQIIDNRQRVLSQRYEPTESNENDNIEDYDEDDEFDDSE VDDELIIDGDDDNNIESTISTQKPMNFTRSLQQQQSNPIHQLDLDIDDED DQDYEDDEEDLIDDIDSRPLKKHKNLPDSSRTISIPPLTTTTTTTTTTSN NNNVNTNYNFDKVPFSEQDFRNYQQILKKNSIQQEPPQLIELNTLDDILP DDILSDLPNHSTTDKQYAKDALQLNREYQQLLKTFQVQIDEAIKRNSQLI KKITTQQKLSYSNMYAGITQKKNERKAGVSYFSYEIQVDGVTQTFYPAEN LDSEMIKKNFGTMPLFFKCRKWSKGDINLLHKGVQDRNKAKQMFRISESN LTRAEYSARMDELNNIAPKEFENYPLTFDDFAMICMDSFAQRQPDEIKLR WDNFENPSINNGNFSKAEDKELLRLALKYEGRQWHEVARELNDKFAPWPP VPEPTRDNPHPARPPRPPIRTPISCLIRYQRSLNPTLMKREWTREEDETL KMAFAIHGDKNWQTIAEYLSARTGQQCLHRWQKTLNPNIKRGKWSAKEDE LLRNAVEIYGYGNWVMVKKHVPGRTDMQCRERWCNVIDPQLNKTPFTPEE DRKLKELIEQHGVGKWATIASALGTRTDNQCWRRWKQVHNKSEDLVKYQE KISKKKQVVVGNFVGREKERSSLSVDDILEVQESLKTNTTPTSESPNLNS NNNNNNA >DLA_00535 g483, similar to H. sapiens REXO2 and S. cerevisiae REX2 a mitochondrial 3'-5' RNA exonuclease there is a second copy of this gene MLKFIFGNYRTTSIFRNYRHYCTNNNKILDMSDNRAHRLVWVDLEMTGLD LNKDVIMEMAVIVTDENLNVIESGPNLVVKVDEDKLQSMNKWCTEHHGQS GLTQRCRDSKITTQEAEKIMVEFIQKHTDKGLAPLCGNSVHEDKKFLNKE MPLFSDWLHYRIVDVSTIKELSRRWYPNELKKAPKKQMLHRALDDIIESI EELKYYRTNVFK >DLA_05449 g4922, MYNFNYQDDDQGNITMIPTPTKVNSKPRLSSAKTTIVSAGLGNEIREVWC HNLEQEMALIRELVDLYPYIAIDTEFPGFVTKPIEAMRMKPDYNYQTLRI NVDALKIIQFGITFSDNTGKLPQPTCTWQFNFKFSLKEDMFSHYAIELLT NCGIEFSKIEKDGIDVSDFSELLISSGIVLNDKVKWICFHGGYDFGYLLK VLTCTDLPKKESEFFDLLKIYFPCIYDVKYLMKSCKNLKGGLSGLAEDLN VLRIGPQHQAGSDSLLTVSIFFKLREEFFENEIDDFKYKGVLYGYNFETH HDEPFNP >DLA_05543 g5005, MIRRNARLRQEYLYRKSLEGKEKDIYEKKRKIKKALDEGKPIPTDLVDFE FQVRDEMKLNGDGEVKPPSIDDEYARCGIVDPKVFITTSREPSSRLTQFA KELRMIFPNSQKMNRGLHILKELVDACRANDVTDLIIAHEHRGEPTGIII SHLPYGPTAYFEIKNCVMIHDIQDSTPPSLAFPHLIFDNFTTPLGERTMN VLKYLFPVPKDDSKRVITFANNEDFISFRHHIYEKDTYKNVILKEVGPRF ELKLYKIQLGTVDQEEADVEWVYKPYMNSTKNRLFL >DLA_05599 g5059, MAISNINELPESILILLLNYVNSVSSSSYWLVNYSLVCKLWSTQILPEAW TDLLIATSQPSEPLIEYNEKGSLSQIASHGGASRAYKFNRLIMRIGKVTE YLDQLKSVTTVDLRNNPSTNSVIDRLCDSLKTNKTIKSLNLYNNRLMQKG GVSIARALEKNTTLTHIDLGLNLLGANGGNAIADALKKNQTLIHLDLSSN QLGFRGVGPIIEALKINKSVKYLILHSNQLRDESTLLLADILRQNSGFIE LGLNDNEIGSKGGIALARMLKTSKTHTHLDFGKNELGEDGGVAMADVIKF NKLITQVRLNWNKLGVKAIKAISEALKQNTSVNWVDLSFNNLTDEGLTIL SDCLKVNKAIRYLDLSRVATSAPGHKALAESIKVNQYITYLDLTNCKISN DGGVAIAQSLQSNKSIRTLILNQNLISSDTIQEFSKTLSVNTTLHQFSLV QNSLDISGLESLFQVLSTKNSTLGVLDLSSNLLGEEGGKTLAKYLSSFKL SEISMANNQLQSSGATAVLANLSQTIQTLDISNNAISADSATQLSKTLTN STTLLKLNISQNKLGDDNVPALVQSLQSNKSLIHIQISSNQFSQSSNNQL LNSIRSNKSIFFYDLVEEGSN >DLA_05656 g5107, MNNQKKKPQQGGNQQQQKQQQQNLSGSGGSGHSKLQIQQQPQLSASQSQM KDRSTFIYMSLIGYYVNVTIKNGVVYEGVLHSVQPTVGGGIGIVLKMARK KETGTITTQPTPTIMIEAKDFVSLVATGVQLEQHSRINGSGSPHHHHMMK GDGTINTDTDISGFDGNLRERELQPWSSETHYDESLEGDSGRENGSDKKW DQFATNEKLFGVKTTFDEQLYTTHLDKASEFYKSNIHLAEKKANEIENDK SLNMHLLEERGHIQGNDYDEEERYSSVVRKGNPNPTTTSPTARVPLIPNT DKYIPPRERQRLLQQPSPTTTTPATNPTTTSPTTSNKPEQKSTTPQKQQP EQQQQTPTKESTTPSKEKDGGDQPQQPTTPNTMISGGTGTANVSKLILNR DKQPNDEPDHGVLGSPRDGLSPRFVEYMKVRQGLTDSKTKNLSGSDTPKS PLIQNRELLNSLSLEVVSGVNPDVVNDFNNFKLGKMNQSMDRQTNFENLK TFQRDYNIKSKSRPSSPSVSSPRVLPPPSHFSLSGSSKDDQKSDDASSTT TTTVATTTEQTTVQEASNQDTKKDIESKPSKSETTSTSKDTTTAQSSSAT ATTSGDKSTSATTPITSTSATPTNSSTGSLSKFKLNPNAKSFTPTVGGTK APFKGSTDDLNKSTSNLQTTDTQTPINDVYYESMKKRQAIQEPSDSVPPY WVDPYGRQYEDDPYYHMRGIPSTMVTMPINAIPFYPNMPPGKGPTIITTK PLPYQPTPPRTYANGQNAFLSYVHQPQPPPGYQYVPQGIPVFNTSPPPPL IGGKRYFHPPPNSTQYSIPLIPNQPPPPQQPGTSPNRILTPPTIYPQPYG TITTTRYQPPHEVSPQHGYHPGNFQ >DLA_05769 g5197, MSNLFHVFQKQVQQSTGVEHCVKANLTSANDINLIVSKTNILQIYTIRYE KIEKPENHSNGDGGGSEDKQKIETRPCLDLVLEKSLFGNIESLNVIRFPD EQRDAIILTFRDAKISVLEYNVDLMDLEIRSMHYYERDEYKFGRQHFKHP PLCKVDHQQRCAVILLYDHSMVVLPFKQAISILDDDEDTTMNINDDDISS IMQYAQQQQQTQNPYYQQKSYSSSLLDPTDFCFLHGYYEPTLLILHEPTQ TWTSRISAKKLTSVLSAVSLNLSAKQTPTIWSIDKMPYNCESLLPVPEPL GGSLVVAPNILFYVNQSSRYGLAVNEYAQTDTGDQFPFPLDNTLNLVFTL ERSTYVFLESDRFICSLKGGELLIFHLISDGRSVQRIHVSKAGGSVLSSC ICVLSSNLIFLGSRLGDSLLLLYTETTVSDSGEEHENFSNPYKKQKTSEL FDLFDEDNDTVQMQKQQQLQKQQEEEEDDEDDIFKEKKSQIKTYQLGICD HITNLGPITDMVIGNSYDMYQQQKEQEEDQDDYNPPSKSSDSATTPHNLD LVTCSGYGKNGSIVQLQKNVRPDLISSIPILDVTNSWTLYYESEILQKTQ HNITGKKRTIDSISTESESPNTEESSNSDSKQSKDSSNDDSNSYHQFLYL SLSDSTLIYEISQDLKEIGKFNQSTLAMGNIFGKSRIIQVTVNSVKLISG ASTVTQELSFPTLKIRQCYIVDPFILIHFQNGSISIYQGNEQVHQLMEFP FLKDRLNITASSLFIDHHNLYFKSTSSSSILDTTNSSQLSTKVNLILIDS QGIIEIYKLESKELLYQYNNFFNEADILHWNENINDYENTISQYLNLNKT NKLTNGQHQPNINNNNNGELKHSKITELSVHFFNQMEWSNPYIIGINQLG DIIIYRGFKTPKDNILFRKFNHGIITRPLENSGGNGNDGKRIIEFSNIGG KRGLFITGKSPLWLFCEKNYLRVHCMNNEGAINIFTPFHNENCSNGFIYF TESSALRICQLPMDMNFENQFPIRKYMVKNTCHKISYDQVSKCYCLILSY PVETGEIPESDQRKPVIVEYKYQVKLIDRRDLSTFIDSFSLQEKETALSM KMVQLKFTDPDGQTRLKPFLAVGTAFTYGEDTQCKGRILIFEIITHIGQQ RLNLLYEKEQKGPVTALSSTNGYLLMTIGPKLIVNNFMSGSLIGLAFYDA QLYIVSISTIKNLIIIGDMFKSIYFLKWKDGKQLVLLSKDYQSLNVFTSD YIINQKTLSLLVADLDKNILMFNFDPKDPNSRQGKMMLCKADFHIASNIQ KFIRLPLRSTSNSTTSTTSNGNGNGNNKNHNSMIIPDQQMVFGGTLDGGL VTLIPMNEQQFNLLSHLQTKLYHIPHGCGLNPKSYRSFKSYQQHYSPSIQ QPQKFILDGDLIHHYLTLNNNDKHLLAIQINSTPDEIISILNQINYSSST F >DLA_06010 g5401, MLKYQGCTHFRQRIICATLSGRPIKITNIRDEDEKPGLRDFEASFLRLID KVTNGSKIEINTTGTALTYIPGIIMGGKSLTHECGQSRGISYFVEGLLCL GPFAKAAIDITLTGITNNDLDLTIDTLRTTTLPIIRKFGLEEGLSIKVLK RGAPPNGGGMVNFKCPIVPQLRAVQLVDEGKIRRIRGIAYATRVSPQFSN RVLDTAKGLLLEFTPDVYISSDHYRGTESGLSPGYGLTLVAETTTGCCLS AECMGSSSGIDSGESPEDLGKKTALALLEEILNGGCVDSHNQSLALLFMV LCPEDISKIRLGKITEYTMEYLRHLRDFFGVTFKIEPDQDSKTVIFTCLG IGFKNMARSTF >DLA_00627 g569, MVFIGLRLKTALIEALTVILSSILIFLPIKFRNFNALILVPVFVTIIACN LSSQSSMISGGIVVISTATSSLVIYAFLKIFQERIWVSFIVGFFFAFFLQ CTVLRGGRWFNGLACKKILLDLVIVYYFSPYPDSEKETDILETFICSLFM LFSIVIVSIIFPVMATKLFHFNLMRTLRTSRDLFRAIGYSVESKLYKDPN QQTSSKIPKNHSFTFPLNDLTGHDDEDGEDDIKVQHTSIEQQLQQKNDMT TLEEKTNESQEKDIVSSSNTMKSSEKNIEKIVLPLSSLVTKKEVTFQLPL GGEKEPKSPNVSGDIKLKKLKKSKSVEILSKSIPIPTDKEIKELQFRLTD EVNRLTLVLKECKEERWNSTLVESYKAILNLVEMSLKHLMSLRISIESGF SQNASRELVSPMEPFLDSLIEEVYLQIGLMIDVLKGKLHLSETPTSSPNS ANAVSGADNPKRKFSRKEQKTVIERNILESSFEETDELIVKLQEFYKQLV GEYQRSGLPGLHESEISRLHFFIFGIIEYARQQKVIYQLVLQIKARIRHE SIRYQVIRYGLVYVLTALPIHWYKVTLFIISKFSKKSDVDKETPNLNQQQ TVRNPDSEVPKKDHILKRIFKFIVNYIYVLCFKNGKWKFPLQIAIAYTSS VIVFWYINGETKGELVIKGVWTCATAILVMSPSVGASLLKGFNRVIGTMG GGGVGFLVSWLCSVIPKGGKEVVILAFTFVWITIISIIQQNPSFSYSGAV SGLTFVLVVYGQYLYGFDYWYALFRSFHITMGVVWVIIICLTVFPYFSFQ YTRIKMVNTTIQMSRTFVNIIRLGLKIETLNQSQEIMMDIDYTDRDRRAK EIRKSLTDQRMILDQIKLSLNDIKSELILMPNKSNAYRKVYKDLSYSYTR LVAAEASFRSSFSDPLLQAMSPINQKIQGIFNELDALAKDLNIFTTLTIS KSQRSQLTVDHEKQLTDSVKALGDSFQEVRVDLLKRRILSTLHPEMIQFG SGMYXEKMSEFGKAGSGGIQSSQYENIDRRERTNRVAMESVDVSKDPYIM TNHLGSYECKLCLTTHNNIGNYLAHTQGRKHQTNLARRAAKEQKDNPNSK IVPTAAKRLVTRNVVKIGRPGYKIIKQRDRDTGQLSLLFQIDYPEIEPGL QPRTRFMSAFEQKVEVPNKEYQYLLFAADPYETIAFKIPNKEIDRSTGPD GKFFTHWDRNKLTFTVQLYFKESTLKSTTSTTNTSTTNM >DLA_06760 g6073, MELNTEIYNSMLSVYPTANGNFIYSNMLPTGQTAYATSIYPSNNNTGTGL CNTTPTTPQYYINQAYSISPTLTSVKVSSPISSPYTSPLPISTTNNNNNN NTTNNQCINSNINNNVSQSSCSTNIEISSSTNSIYTSPPMNSANCSPMKT NNNKRDRDQMSLSVQSPSSQTNSPRKSITSSPTLKPQSPPTSASTTPTFN SLPVGLPSNGVVQQQQQPPPPQHQMYPQQQNYEIHHNFQMVQQQQQQQQQ QPQQFILHPSEKSQISSVIPQQQLSYDDMFLSNICPIAYQIIDVPQQPPQ LQTTPPSFYLYQAPIIASQTPILAGHQLDTHDQHVDKKLKLDVSPSPSVA YYSTQQPGTPTMISTPTSHIHVEKACGSCLTLVNKFMVDPQSIIHCPSCD TIYHRNCLMFNSNTHWYCNVCSQYQYLYIQQQQQQMVPPTPTLPIQITQS TLSSSSGMPPPLPTQLSNQQLNRSDSDDDTDQQDSDQVAANCDDDDDSEE ESDDDVSDDDSDSSSEKKSKISHSSNSTLTSSTSTVSKKKKSSKKSHKSM NGDVKAKGHWTKEEDEKLKNLVDVHGTKRWKYIASLLCLRNGRQCRERWS NQLDPTIKRDAWTLEEDRIILEAHSKFGNKWAEISKLIPGRTNCAIKNHW NSTMKRKLSKKQYDDILLPNSNNSINMSPNNSMEIPIVCTTKTDFANIAV VTNLNATTTSPTIQTTPYDNILHHPYSLSLMDENSSSSLMDYQQAPQTIP VQLDFANSNNNNNQHHLNDFVHNNSNNNSNNNSNNNNNNNNNNNNNNNNN NNNNNNNNNNNSSNVFKSTTNFFLEQKQYFENLLSQQQSSPLTSAVTTTT TNLNIPTINNNNNINNNNNTPATTTTTITTINNSPSLSSPRGKSKHSHQI LNANSNCWICESITFLPPKGSDFKSNKQHPLSREHCQYFEIASPPLDPTM EKKYYICHAHYNSFRRRSNSGKLDSSNNSSGSPLITSSNGGSSYFDPQTV EDQVIQQMRSQKQWNDLESILKMKNENTKSDTGKIDLVQLYQTLIDTQST PLEECFLKLYQVFNIKAKSKKKKSDLSSSAASTNSDKDSTDEPNKKLIKN NIKFKIKNLLVTFPHLKYYGSEKEFQLQRLQKVPEILLVIDNPYLKKKFL E >DLA_06993 g6279, MGIDSLNSFLAGQFPKFKCTTTNFEQTDHAYVDLNNLCYMNSGKVKLNYN QFFFKLIPRLRLFSTVLAPKKTLFLSLDGPGPRSKMLEQRKRRWKRSDKS NFANSINQMNVDDEEDDDNDRFEDDIDSDRLLREEELEDFESESSDEINS SPTTDNNCNGEDKMNVDRQPIVNILEDKEPNTFILRHGEEKFVSNNLTPG TEFMGALKDFIEKFIVKEFTLKRKILDIYFSPADRAGEGEWKIFQHLNAQ NYDPNDRILIYSNDTDLIISSLLSKKNIVIVSRCMGKKYEIIDIPALRKM IIESVRDFEQKDPDRIIDDFVFMAGMCGSDYLPKFTFFSSEKYWLSYKKL KVDEYIYDSNSQQINIENWKSIMSHSLFLPNWFHKPTTPNTNTNTINTEN NNNNNKDMRPEHIPEAVYSLHLNNYINEKFLKASRIVQASLFPGNTDYTS TIYFNHNEMENCLDLVVGHQVVDSEPMDYSNPFGRAKSEKNIMKNLKSRF TDCSHPFWLENGSKILPTLNDRLQEVFKMNPHELVVKKFKVVENPNEKVN NYLYALVWQMKYFMGNCNDFNYYYPYFTSIEMDEIKNFIFIPNYSPTKIA QPLKPLYFGFVVLDSSYPEFFPEAYHPIYKTLPHYGKAKDANNELLEEDG VRVITNLLDKYTEENKSNFTKYQISQLQFQPTIKFSFVRNFILLQIETPK SYIDISVPEFGPPRYMLMRNINFNSTLSEHPFYTNEEINRFSYSKANLFR DRPRLNNSNINIRITTNTKTNFNSNTNIKFTTNPINTSNTKININTSSTK IITNTNNNINSNINSNINSNINSNINSNINSSNTITNTNINNNINSQLTQ KEIEHILVLAKKRDSIQKSINNGTAQPYSTKKLNSTNDKIQKLINMRNVN TDVQALKKMIESNNEKKEGWDSKNIKYELDKKNLAMELMKQNTIITPIGF KAILDRSDAALAGKPHPKDDETSTLKRYPSLRAKQQTEFNLILSKKNKIL SQLNSVDNQSLSTLSDDDKNMLEKLEIELFDLGCICPSVFKVSTNTQNND QKVTVDSAKPNTMNVDNAKPKPQKKEKKLPKIKLTTTTTTSLETKSVIVD SNTTPTKTCATSPKDNNQKPRKSRAKKQEFHKRTLESTERDINDSKKIKI SK >DLA_07314 g6563, MESDKVELNSNSNGTNVNKKKLKFQKKKEQKKKQKLKKILESKTSTNGNH IHSNGNGSVEKKYESDDIGFKLDENDPTFDFYSKLLNHFDGNDQEQSQQQ QQSTDTIDKSEKDNEKSNEIENTDNKTSENGDDSKPTEKDKEKKLSNKER KRQQRANLPVLKQLVDRPDVVELHDVNSPNPGLLIGLKSTRNTISVPAHW CQKRKYLQGKRGYVKPPFELPSFIAATGISKIRDAILQKEQKKSSKQKQR ERLQPKMRSMDIDYEVLRDAFFIHQTKPKLSVIGDLYYEGKEFEVSIKNK KPGQLSQELKRALGMQDNSPPPWLIYMQSFGPPPSYPNLKVPGVNAPIPE GAQYGTHIGGWGKPLLNEFGKPLFEAYTSQQTNVVTMNDNGEEIVREYWG ELIPEEEVEEEEEEEDQQQDDQENQQDGNSLEEMDQSNDGQSSVPSGLET PDLIDIRKHRMNMDPNGPKQLYQVLEQQNVNNNNNSGFMESTHRYQIPQV IRNSTSQSSARGGGATGNRVDIIKSHRTGPVDINFNPSELENLNEISEDL IKKKYEQAVAATQDNYRNKQSKDDYSDLIEEQSKKRKQQLQKEQDKLKKF KF >DLA_07524 g6756, MQNYHNQMNNVHENQQYYYLQQQQQQQQQQQQQQQHQQEDDIKKEIDSVG ASYPPIILKPQAILQNIVDNPQQQPQHKSPELDNKKQTENLKNENPPQPT NPTVKSSSSSPIINSNSSNNNTTNTNNNNNNNNNSNPTTTTTTTTTTTTT TTSTTNNNNSIFEIPDILLTPGTRSPYIQSLSNQIISIILGKTHRDEPTS IIFTNISSVCKLWRQVSVERIQSYIYHLPADKQITNFFHNISHNLYPKLT NLQFKVTTPTSSTFDVSNFVKLLMSNNKIIVNLELSQNGIGNKAAHCIGA CLLENQTITSLNLSFNSIGNEGAEEISKALQVNKTLISLDLSQNCIGLKG SKALGTALQSTIVLQTINLSKNRFGTKGIDSISEAIGKNQSLHSVDFSKN DLCEKSAKIIGEAIRKHPFLQTLNFCDTKLSAEGVKYIAEGIQGSQTVSY LDLSRNEFGYKGLKPIASALAQSHSITYLDLCGDIIGDKGALMLAEAIQT NNTITNLSLAFNSIGYPGAHAIGRAISVNTSLQNLNLSINAEIGPNGAYS ISEGLCFNKKIHTVNFCTTGFGPQGGRYLGDALRFNNTLTDLQLRGNEIS DEGCKAISDGLKQNTSVTEINLSGNGIGNEGARQLMEALWFNHSLTSIQL THNNINPSGVQYMKEVLQQSHLVNSDSYFHPPNSTTTVSCLYVTRSNNTI CRIVI >DLA_08046 g7211, MNSVSTPPIIASTTAPISTNGEISKPQYYGVTEPISLSFPTSVDLKFSQD LENTLKSFGLFESQEESKKREEVLGKLNQIVLDWAKKVSLKRGFTEQMAA EVVAKIFTFGSYRLGVHGNGSDIDTLCVGPKHIMRSDFFDDLSEILRVHP EISEFTAVKDAYVPVMKMVFLGIPIDLLFARLSLASIPEDLNDLIDESYL KNLDDKSILSLNGCRVADQILKLVPNIPNFRMALRCIKLWAKRRAIYSNV LGLLGGISYALLTARICQLYPNAAPSTLIHRFFKVYEGWKWPAPVLLNHI QEGGIFAAKVWNQKKDKGHLMPIITPAYPCMNSTYNVSRSTLYLLKNEFI RGAEVTRKIEKNEANWSLLFEKSDFFTRFRFYLQIDAISANDEEHIKWEG WIESKLRFLILNLEQTPNMKNAFPYPKCFENKVSQTVPTGFKCSSFFMGL AFNFTGENKSVDLTKAVTEFTAMIKATDTKTPTMDMKIHYIKKKSLPVFV KDESPPEEPRTGNAKKRNIKDISVAAAAAAASTATTTNTPPTLTSPITTP ATNNSLEAINKKLKSDLGEPIVSPSSTAAISSTITSISPSSSSSSIPIPP FSVSTPSPISRSTSSSDLNLQPISNITTATITTTTTTVDTQMSDANTSID NISADITSNQDQVQMTKKINTLEVNELDFISGNSVTKEPKPSMKKPGISL IRG >DLA_08247 g7376, MGVPRFFRWISERYPQILQKVLESNPPEYDNLYLDMNGIIHACSQETSKL LSFSEDELIRQVCNYVDLLFHTIRPKKLFYMAIDGVAPRSKMNQQRQRRY LAAFNEEKTKRELLNAGKPIPEVEFKRNCITPGTPFMHNLSEALQFYIQK KISEDLSWREVQIIFSGPENPGEGEHKIIDYIRKNKASPDWDSNQTHCIY GLDADLILLSLVTHEPNFSILREEISFKKESSKKPKKEKPVDFQLLHIPI LREYLDLELRTDNLSFGYDLERIIDDFTLIMIFFGNDFLPHIPMLEISQG GLNSVLELYRNSLEDLGGYLTNGAEIDLDRLQGFMVKLKNFEQSQKVLPD SKEQEENEEMIEDMLLEHDSLEEGEKKRLEQLAMDRLKQHFSDIHVSEDS TTDRDSPSMYENNYYRVHFQDFPETYEEIKKFKQDLVLNYVEGLSWVLNY YHNGCISWNWHYNYYYAPMAGDFVNVPQLLIQFDYGAPVTPFQQLLSVLP PQSSELLPECYRYLMTSQSSPIVDLYPVTFEIDAQDPHYFDGIAMIGFIN HQRLIDATFDQSQFAYTDKEKQRNTLKNAVIIYHDDDIKKHMVPSPNNKI FSDLAESTATTEDIILPIHDSGLKPFRYCDDVLTGVHGPPGFPTLMTQKF QWTMRAGVIDVWGMRTKKDSFIITLENPHIRQHLNREISSTEQLKQLAQQ FMSKKCYVNWPYHTEAKIIGFSTIDQMINSEGQISQFSASQKLMFVDEMK DMKSKYLQHGIDLYEPGKKVDYNYCPGIMVHVNKLVGVDSLPGGGTKKRY SDAESVYPIELMVDYSSLKSDPRFEEIVNVPFESRYPVGKKVIYTKRDKY FGCVGTVTMAYDNQLKLDLNVPSQPIDLEFGHSVAQMDERYFAINEVAKL LEMPISHVNLLTGGLYIAKPSSDIGLNLKFAGRQQQVQGYCRGIVVARTP EGHVQRKWEFSKLAVDLMQSYLKEFPIVYKILQYYTTNKAEVPTQNGYSR TMVDITPLFSNNEEKTATLLKIQEFLDRSEIRKKRIVSCDTMSLSKDLIQ KIQEHYLAISEKCEMTVQSIHTIADNVNDPLSYESIISYERQQNHQHPGN THGQNSPGNKSPKFSSSTPNNNHSNNSKKPSNKFRIGDRVLTSLEKGNVP FGRLATVVAVNDTKVDIVFDQECFSANSLEGFCAEKRGLSISTLRLYNLS NPFSLYQKSNHHQRGYKQKSVDPHEFWEKLNSSKDGKLPQDNESIPTHKL HTPEQVEERLLNTSNQEEEQLSWQQLEMLNGISGSQSNNNNNNNNNNKNQ QSNKNSTYSTNYVANTLRTSSGQQKNVNYLMKRAPQYTQNNFSSEKEYTK QNPKLPTNFYYDQNGNPVQPPPQKQKKQKSNKVNVEKQHRPVENVDPKKQ KQLDFIFQNIKADQNNNSNGGSSVGTSTTPSTSSSVESNPPQTAQAALLL RDIFSSTQPEHSIPPPQYAYPHYPYPHPPPPPPHFQQPYPYPPQFPFPHH LHTSNNNHNNNNNHKLNLNNQNNQRNQNNQKPSNHQIKNFNLKMHKLKLN NHQNQKHQNQRMVQPKRNNLQKIESINQKFLHQNHKIHHQLNNNNLNSSA PSYHHLINNKKKK >DLA_08495 g7593, MEQQQNQTQQQQQQQQNKIENQIYELKKQQELRFEVEFDVKGTIKLIEGS AEYFGTELSLGKTYKVTSSKGAIFTWSGCKIEVSGNVVSYIGQETPMLLY AGIHKIIDDKRTEILDKPSESGPRVIIVGPTDAGKSSLSKMLMGYSCREG YQPVFVDLDPGQGSITLPGTICASLIDKPIDIEEGLSNSVPFVLYYGHTS LDVNPSLFKAMIASLASNVERRLETSEIARASGFIVNTCGWIDGLGYQIL LDSIQTLKANIIVVMDNEKLYSDLANQFSSGGVVVKKLPKSGGVFLRSPV FRKKTRMSKIREYFYGINGDLCPHNVVIEFKDVVIYRTGGGPQAPMSALP IGTQSQIDPLQLSEVSPNPDIIHSILAISYTKQPQNILKSNVAGFLYVTE VNMETKKITALAPCSGPIPSKYLLLGTLKWLE >DLA_08534 g7631, central component of the U4U6-U5 snRNP complex contains the PRO8NT PROCN PRO C-terminal and Mov34MPNPAD-1 domains found in pre-mRNA splicing factors of the PRO8 family MTDVTPMSDDKLLEKSKKWIQLNNKRYSEKRKFGFVDAQKEDLPAELLRK IIKDHGDMSNRKFKQDKRVYLGALKYMPHAVLKLLENMPMPWEQVRNVKV LYHISGAITFVNEIPLVIEPVYVAQWGSMWVTMKREKRDRRHFKRIKFPL FDDEEPPLDYQDNIFGCEVEDSIQMDLDPEDDQAVIDWFYDSRPLMRDQR YVNGPSYKRWRLDLPIMSTLFRIASPLLSDLTDPNHFYLFDDQSFMTAKA LNMAIPGGPKFEPLFKDTNMDLDEDWNEFNDINKLIIRHKIRTEYKIAFP YLYNNRPRQVHTPHYHSPNSCYIKSNDPDLPGFYFDPLLEPIPSYKTAGN YTSNANSEIGEDDDEFTLPDHVEPLLNGYELDSLNTPSGIRLYWANKPFN TRSGRTRRAEDIPLVKTWYQEHCPPKHPVKVRVSYQKLLKCHVLNQLHHR PPKSVNKKNLFKALKQTKFFQTTEIDWVEAGLQICRQGYNMLNLLIHRKN LNYLHLDYNFYLKPIKTLTTKERKKSRFGNAFHLCREILRLTKLVVDTHV KFRLGSAEAFQLADGLQYLFSHLGLLTGMYKYKYRLMRQIRMCKDLKHLI YYRFNTGAVGKGPGCGFWAPMWRVWIFFLRGIVPLLERWLGNLLARQFEG RQQTVAKTITKQRVESDYNIELRAAVMHDILDRMPEGVRANKSKVILQHL SESWRCWKANIPWKVPGLPVPIENMILRYVKAKADWWTNVSHYQRERIKR GATIDKTACKKNLGRLTRLWLKAEQERQHNYLKDGPYISGEEGVAIYTTT VHWLEKRRFSAIPFPQTSYKHDIKILTLALERLKEAYSVKSRLNQSQRQE LALIEQAYDNPHDALATIKRHLLTLRTFKEVKIEFMDLYSHLVPVYDVDP LEKLTDAYLDQYLWYEADKRQLFPNWVKPSDNEPPPVLIHKWCQGINNLD EIWETANGECLVLMETQFSKVYEKIDITLLNRLLHLIVDQNIADYMSGKN NVVINFKDMNHQNGYGLIRGLQFASFIFQYYGLILDLLVLGLNRASEIAG PPNLPNTFLTYKDVETETRHPIRLYQRYVDRIHVLYKFTQEEARELIQKY MSEHPDPNNENVVGYNNKKCWPRDCRMRLMKHDVNLGRAVFWNVKNRLPR SMTTIEWEDSFVSVYSKDNPNLLMAMCGFDIRILPKCRTTTDQIVPNDAV WSLQNVNTRERTAQAFLRVDQDSQERFENRIRMILMASGSTTFTKIVNKW NTSLIGLMTYFREAVVSTKEMLDLLVRSENKIQTRVKIGLNSKMPNRFPP VVFYTPKELGGLGMLSMGHVLIPQSDLKYSRQTDTGITHFTSGMSHDEDQ LIPNLYRYIQPWEQEIKDSQRVWAEYALKYEEAKAQNKNLTLEDLEDSWD RGIPRINTLFQKSRHTLAYDKGWRLRTDWKQYQVLKLNPFWWTNQRHDGK LWNLNNYRSDMIQALGGVEGILEHTLFKGTYFVTWEGLFWEKASGFEESM KYKKLTHAQRSGLNQIPNRRFTLWWSPTINRKNVYIGFQVQLDLTGIFMH GKIPTLKISLIQIFRAHLWQKIHESIVMDLCQVFDQELDTLEISVVNKEA IHPRKSYKMNSSCADILLRATHKWQVSKPSLLNENRDTFEGAITQYWLDI QLKWGDFDSHDIERYSRAKYLDYTSDSMSLYPSPTGCLIGLDLAYNIYSA FGNWFLGVKPLVQKAMAKIIKSNPALYVLRERIRKGLQLYSSEPTEPHLS SQNYGELFSNKTIWFIDDSNVYRVTVHQTFEGNLTTKPINGGIFIFNPRT GQLFLKIIHTSVWEGQKRLAQLAKWKTAEEVAALIRSLPVEEQPKQVIAT RKGLMDPLEVHLLDFPNIVIQGSELQLPFQECLKMEKFGDLILKATEPKM LLFNLFDDWLNTINSFTAFSRLILILRAMHVNMERTKIILKPDRNTVTQP HHIWPTLSPDEWVKVEVSLKDLILADFGKRNNVNVASLTQSEVRDIILGM EISAPSQQREDQIAEIEKQKKEASHLTSQTIKTTNIHGETMISTVTSPHE QKVFSSKTDWRVRAISATNLHLRTNQIYVNSDFAKETGFTYVFPKNILKK FITVADLRTQIMGYCYGVTPPDNPQVREIRCIVMPPQWGTPVFVNVPNQL PEHDYLKDLEPLGWIHTQPTELQQLSPQDVITQSKIMSDHKSWDPEKSIV ISVSVSWPVTLTAYRLTPQGYEWGKSNKDSLNYHGYQPQYAEKVQILLSD RFLGYYMVPDRGSWNYNFMGVKHSASMTYGLKLDYPKNFYDDCHRPSHFQ NWSLVSDTSSTTTSTTTDSTENQGPDSENLFN >DLA_00904 g824, MGKNKHNNQKIKQSYKKHKEEPTTQSNGGDASKNEKLTWDQVKPDFEIDG SLMEGGGQILRNTVSLATLYQKSIKIEKIRYNRDQPGLKMQHRTGIELLS QLYKADTIGCTHQSTQLYYKPTKEHVDQVEIDADTKTAGSIGLLIQQTLP CLLYSQHETKMVLGGGTNVDFSPHADYIVEVFQPIFTKHFLEGTNAQMDM SIEKRGYYPRGGGCVKLNIKPTQQALKPITLLDKGNVILIQVKAYTSGRV TPLVGQRMTQQARKSLKKEFKKVDIECEEIDCTNRSFGDGCFIFIKAITD TGCIFGGSSIGSIGVPAETVAQNAVDSLVKDLSDGGCVDEYLQDQLIIFM ALASGQSKIKTGPISLHTNTSIHFTSLITGCSFQIEKVPKEQEQPGEDTF IITCNGVGFIKSSNNQQTNINENNNVTTTSTTTTTTTTTSTN >DLA_09295 g8329, MPKYYCDYCDKYLTHDSPSVRKSHTTGKQHNMAVQLYYQQFEADFHQEMH EKNLKELASGKMPIIPQFFPPGLLPVPYFLGPEGAATPPGLFPPPNPQQH QQQFQQMQMQQLQQQQQNQEQHHMQPQQQGMYPPQMQQHIQQQQQQQQHH QHHQQQGYQQHHMQQHMQNYQD >DLA_09379 g8410, MEVDKNQVSEWDDSELKKKPSTVSATPRRNRWDETPVSSSGATGGFKGTV ADTPNNKRKSRWDETPLNVTSQTQATPMYNLGGATPKYDGSQVAMTPNYS GLVKQTPMIGGQMMLDPQQLQIQRDIEERNKPWTDEELNSLLPSDGYEIL TPPAGYVPLVTPARKMALASQTPVSGFFIQDENRKQDYGIDTQGPVDGLS MKPEDKVYFEKILGSGDGEGEENLSPEEIKERRIMKLLLRIKNGTPPMRK QALRHLTDRAKEFGPSALFNQILPLFTSQSLEDQERHLLVKVIDRILYKL DDLVRPYVRKILSVIEPYLIDQNYYARIEAREIISNLSKAAGLACMTAKM RPDIDSPEEDIRNTTARAFAVVASALGIPSLMPFLKAVCKSKKSWAARHT GIKIVQQIAILMGCAILPHLKSLVEIIGHGLEDKEPKVKTITALAIAALA EAATPYGIESFDPVLKPLWYGIRLYKEKGLAAFLKAIGFIIPLMDEGHAS YYTEQVMLTLINEFKTSEDEMKKIVLKVVKQCVSTNGVKPQYVREKIVPE FFKHFWVRRMALDKRNYKLLVETTMELANSVGGGEIVALIVDDLKDESEA YRKMVMEAIDKIISTLGAADIGPRLEDQLIDGILYAFQEQSDESSVMLNG FGTVVLAMGTRIKPHLTTITTCIKWRLNNKSAKVRQQAADLISHIAVVMH ACGEEQLMSHLGLILYEYLGEEYPEVLGSILGALKSIVNVIGMTKMTPPI KDLLPRLTPILKNRHEKVQENCIDLVGRIADRGADFVLEREWMRICFELL DMLKAHKKGIRRAAVNTFGYIAKAIGPHDVLATLLNNLKVQDRQNRVCTT VAIAIVAETCAPYTVLPGLMNEYRIPELNVQNGVLKSLSFLFEYIGEMGK DYIYAVTPLLEDALMDRDAVHRQTACSTVKHMSLGVVGLGCEDSLVHLLN FVWPNIFETSPHVINAFLEAIEGLRVALGPTIILQYTLQGLFHPSRKVRN IYWKVFNMLYISSQSSLIPSYPKVLNEGPNTYQRYELEYII >DLA_09523 g8542, MSMKRNVDSIYNNNNQNGNTKNIKRTKPQEEDDPLDSILGEIKTTQQGKY NNSSRSTFSLENVISKIMEMTNNGSNCEKLEVEGRVGLIQPGMNGGINFK PGMIQDDWERLREYLASRLSDKQLIKETDYIYDNHRVTYSEDQKKCIRKE AKTSKITYDQSSSLIYDFRISLCWEESSPPPLEVPTDWKSKREKMRYTFR DRDWKIDLTRTMVYDQFSQIIENPYEVEIELYPQSIKSCIGNRNLPVMMG NFIQEVRNLIAIIQPPGAMTFPDVLMEKVTVPKEIDQLRDFVFAYLPEAN KFKYEMFPGSMPINFGKKHIYNVQSNEYYVSEKTDGIRYMLLILASGSYF IDRKFEFYQIQNYSVLDETFGNGTLLDGEMVRHLQNRKPVFQIFDILGID NQSVCQLPLSERLKIIGAKVIQPLRQVLPPNTEVPFTLLGKVFLPKHKIA DLFARIRDHHSGERIFSDDDKRNHFTDGVIFTPNTAYMPYTVQNLYKWKY LDKWTIDFKVTERNRVWYLCCVGSGNVEVECREVNFSQEDLDRLKKEFLR ARDISCIIAECSFQPKYGTWKFHQVRPDKRKGNYISIVMDTMESIAENLS TEELKYRIPLKPDQDQWDEEFQKLRSTMLLNISKNQQRK >DLA_09626 g8633, MGIPTFFRWLIDKYGNLLSETIEPREADGSRSRVDFTLPNPNGEYDNLYL DMNGIIHPCAHPENGPQPTCLQDIVDSIYEYLDLVFAIIRPRKLVYMAVD GVAPRAKMNQQRSRRFRAALDSRLSREKEEREWRLRINSGNATEEEYEDF KKQKSLKFKFDSNCITPGTEFMNHVALSLRSYVDEKISTDPAWKDIKVII SDASVPGEGEHKIMDFVRHQRAQEGYNPNLKHVIYGLDADLIMLGLATHE VRFDVLREFIPKMKCHKCHQSGHFSVNCRTIPEDIEEPSEKEFLTKNYQI LHLHLLKEYLELELKVNTPNFKFNLDRLIDDFIFLCFFVGNDFLPHLPNM EIRGGALDRVSKVYKQLLPTFEDYIVDKGDVNMERLSSIFQELSKSEIEL IKRNANRERQFLQRKQSMARISEPPKSLDLNQAVTFEHRQAANNLLAEIF QPVTDIKDGTEERPAKQLKSNKQAALEVKQEIKAGGKKSTIMISNKEAAD IIKNQLLQKQQSIVEKESKKGSKKRARDSDEEEETTKGSEVPVNTAAFKE KDDETRSIFFDNSRNIRYEEEGWRARYYEAYFDIKDEESDKIKDICKSYI EGLIWVLRYYFRGCCSWGWYYPYHYAPFIADLAKYCDEIEYPTYSLGQPF KPFSQLMSVLPTASSQFVPKPFQTLMGITSDGKETLDGDSPIIQYYPLEF KIDRAPHQPEYKGVCHLPFIDETLLLPTLEKYESLLTQEEVDRNSLGHDI MFFSKDDQVSQQYLKLKESPKVVHFTIENSEILGFVCENSELIQKHMPQL NKSMAYKYNHPALPKGYTFKYTTLKGAIIPAKTITQLKSQSMNQNSAADR LVNGAQHSSQYKNNQGYNKQHGFQNNQNNQYNNNNNYNNNNNNNNVNYNN QQQQQYNNNQQQYNNYNNYNNQQQYNNQQQQQQQQYNNAYSNGYNNNQQF NNYQGYDMYQNYNYSGNDMQNYNNNQQFNNYNNTGNNFAAYNQQKFDAYS NNQNNYLNNQNYQQYLQQQQYNGYLPNNSNNFNQQNQQNQQNQQQQQQQY NNYNGNNNNKNHSNNGKPQQQKNITQNQNNAKQPMKYNPFGGSNRK >DLA_09996 g8974, MSDWAEATTEDGKKFYYHKKTRKSLWDKPEEMIQYENALNSAKKSATPPW ASSNSSVANNNQVSSGSVNNNNSSSNSSNSNNNGPIWKEYITKDGKKYYH NLVTGHTSWDAPDFYQPAILPNNVQQKQQQQQQIQQSSGTNMNQSSGSTS GREIFIELLKENEIGTTWTSDRAFRLVATDERYQALKTMTERKLVFSEYI AEKKRTEMEEKKKKEKKNRDDYLALLKETPEINPLTTWRHASLILDGNPR FEALDSERDREDLYKLYLDDMEKQEKDETLEKKTENMKLLKQKFEQNPSI TFSTQWRKVRDEYETDPLYTSLDNFDFLSVFEAHIRELEKKQDDLKRLER EKQKRESRKDRDAFRQLLADKYQSQELHALTRWKEFHSKIQNLPEYEKLS QQTSGSSPLDLFVDFKEDLEKKYEKDYKKLKDIVRALDFNYVPQQTTIES FKEAILKHEKISTVSPQNFLPFLEYLRYKEESKEKSLAKKKKKQQKHFQQ LLSDQRNINAESTWQQVKQQIQNEKYFEELADEDERERLFNQHLEYVKKY LEENPPSTATNSNGTIELEDGEEGELIEGESNDLKKKRPHNFDYNNNNNS NRMDYDQFYDRNYMVYDIDDRPFKKDKRR >DLA_10273 g9222, MVSFFIVPETLVVNVPTYICYQQTNSTKTVIPESAIKSIQWFVNDKEVQH YPSVSDDHKEPKKESLFQKIFKLGKEESGSEIKKNWFVPDSSHVGGTLKV RVVLKIKELHEKSEHVVEVLSNGTFTTHPTRQMYIYNSESDENDFGFTTY NIMADCYTHPGRYKTPEYALFRPYRKHLMAKYINFYKSDIVCLQEFEPEF TEVIKEMDEEGLTSTPTIIRDGSRYQPPDQCISFYRRSRFHLIQQHIIDY NIITSSGLISKEQIEKLKSNPVTNHFLEGVLKTNHHNRFSFLHLTDLKTS KPLIVINVHLYWGAPTEDWNYKLQLMQFYIITLILDDYTAKHSPNTPIPI ILCGDLNNEPHQKVNKFITQGVYNENHEKDLYTFKHSYKFSSVYANHPMG ETSFTIATKSYQKCIDYIYITKSNITVKSWLEVGNHYSETLPSVTEPSDH ILLKANLNLNSEKLNTTEKKIDS >DLA_10308 g9253, subunit of the splicing factor SF3A required for spliceosome assembly contains PRP9 domain characteristic of splicing factor 3A subunit 3 expressed in pstO cells MSSTLLERTRNLHENIERYELLIENEMNIPPPNFKETILQNHRVNHYLES SIDCAKELQKIYNDDDESRKNELESMSGKDIFGSFYGKLKEIKEYHRKYP DLKDQRNNSSLYFNATVPFLGNEHFGKYLDLNEIYDVYLNLPFLQNRIDY TSYLSKFYEFQYANIVRMKYPVYREYLEKLYQYLISFLERTQPLFDLNQN LERYEKEFNEKWDKHEYDPKMEAEGDRKDDDEDGGDTSSPLYCKACRKMF SSESVFKGHLGGKKHKLNQTKMTSDRDSHSYLNLKQRKPTTFLEFKISKL GELLSDQTHATKEMVLRKQSRSATEINVEEEPVEDEEINIDDMDTVDEPT KLKIANYPVDFSGKPIPYWVYKLNELGVEYKCEICGNQSYWGRKTYEKHF QESRHSYGMSCIGVPNTIHFHEITKIKDALELWAKIKKQNNEKTFKSDRD EEYEDENGEVMSKIAYEMLVKQGIIRKRKNM >DLA_10395 g9327, MNTHNTNYKWKFSNPLKTNWFLKLPNEIIQLIFSQFFNRLELPNNTQFLR LLLTCQHWNSIATEMYTQFKLDRVKKLPKVGDFQYLKRYKRSLQSIQILG GQSLHMGYINIIVDIVQINQLTHLNLFSNKMNDYCIIRLIQSLQYQSSLR FLGLSDTGLTSHTGPYFSDLFKCNKSLREVVLSHNNLGEIGAVAMSKGLE MNDSLQILNLSYNDIGDIGAREIGRSLQLNKSIQELDLRSNCISPNGSSF LSEFIELNQSIHAIDLWGNSIGKDGASDIGKALASNSSIRSINLTRNSIQ SAGIKFITAALVSKNCNLKSIDLSSNSLCSDGAKDLSEALFRNQSIQSIT LSSNKIDHVGIKALCKALRHNQSVTYLNLAFNEISTLGSRYLRKLLKRNT SLRCLDLSSNQLGAESVPVIESIQLDQSPLESLILSHNSINITSIQSIYH HFNVTSQTTPIANLKQLRLEFLSPTNQDKEKAASLKLLNRIHNNLVIKLF >DLA_10564 g9478, METQNENINIKVEPNTDDNTSDITTNSSNDQPIPEIKNENLDSDQQQENQ QQEEEEEKENKSLVQTEEIKIPIVKKEEEQEDDTKKVKSPTSSSSISTTI TGTLQPTTSQLINQPKKSIQIEILENRISIDLYDTEAWTLLLGEIQSQPI DNCRAIYERFLGHFTTAGKYWKVYAEQEMQARNYDLVEKIFFRSLRNVRN VELWRTYITYIRQHKSQNQREEIIKAFEFALEFVGMDISSTQIWMEYLNF LKEEKTNNTFEEGQKMTNLRKLYQRAVENPMHDLDIIWKEYDQFENSINK VLAKSLLQEHHQKYQHAKSVYRERKSLLEGILRNMLAKPPGSSDKEEHQV RLWRKLLAYEKSNPQKFEQATLRNRIAATYNQCLLCLYHYPDIWYEAAVY QAETGSWEGSNQFFEKAIQALPKSLFLHFAYADSLEGQKKIPQAKELYEK LIASVQPVDPLVWIQYMRFARRTERIEGPRKIFKRAKASPECTYHVYIAL SLIEYYVNQDPKMARDIFEIGLKKFPLETPYINFCIEFLSNLNEENNTRV LFEKVLLLPNHENKTIDLFWRKYLDFEYRQNQDIQSIQKLEKRYLSSFYS SNINSLDKSGVLQALNRYKFLNLSPCPSLEIEVISKNLQPSDGDDNSTQQ QKDSESGQHQNLKEGKGKKQKKDKYQQQQQNESSSTSNISTTSYQNPDNP EKPTSSTIIPVSNWKVKRPDITNMLPYRGELSKFNSNNNNNNNNNSNNNN MIGNQNNSPTSRSPQFDIPEFLFPLLQILPASSSFNGPLVDVDFLLMTIK DSPLPIINPNMGQQIPQQQQSQPQQQPPLTLSPTTSTSNNLLNSPTNVQM SNISSPTVQQPMQPSSQPQQNPHKRKLEDEDQISQPQPQTQSYSFQNTKP PTNDLYRKRQASKLSKKA >DLA_10567 g9480, MDIENEDKSMNNFSDISSDETSDSDSSVQHDVEHSSNAYEIDILEIKLRE NPYSFKEHLNYITAIKKLYMASNCKDLQLFQRLRTSRETFQSIFPLSESI WLEWISDETSQKNNSEYLQNLYNKALNDFLSVSIHLSYCKFIEKINNNEL EVIRNQYEKSIKICCNDIVECKKLWSSYRIFEQMVFGSLDNIDKQKNQIK LIRNIYHGQLSNAHFGIEQTYQDYLVWEQSQEPNNINPQLDEKLQESYKR ALNETTERLPFEESIKYEPTHGDKHEIFLKWQEYLNWEISKKQKDRIITL FERAIRIYYNSKDVWLKYLDYILESEEKDNREMLNNLYERLLRSIYWSGE IWSRYLRFLQKDNRNYQEISAVFEKSLVSGLQSSQDVLVVFNSFIDYCWR ASRDLIRVKGENYQIAIQSLREQFQRFSEYFKENKDIYSLEELQYYWAKL ELTEFHSPESFKVIMDQIFNYQTSHYKNYQQVIRHELSLNHFDKCRKLFI KAIKSVQSIDLNRVWDDYNQFERQYGNLDQYELLLFESNKIFKQQLQQQQ SNNNNNINNNNNNNDNEKRLLKRQERQDKKKKIKLEKVEDDKVSDEANNL IFVSSLPYEYDESKLEQYFNNITNNIKECRVVRDKYGKSKGIAFIEFNDI DSATRSLSLNHPIVIDNNNNNNNNNNNNNNNNNNNNNN >DLA_10612 g9520, MRNNSKGGVWRNTEDEILKVAIMKYGLNQWARISSLLTRKTPAQCKARWY EWLDPSIKKIEWNREEEEMLLHLAKIFPSQWKTIASKVGRTAAQCLDHYN KLLDEVQQQQDGTSSERPQRHSEMDPNPETKPAKPDPIDMDEEEKETLSE AKARLSNTQGKKEKRKFREKQLEEARRLAFLQKKRELKAAGQYLHQKKKI ERGKFDQSHEIPFFKKPQAGFYQVPDEEIINDPNKDREFIGKRVDQLEKK KYLEEQEKNNKLEELKKKKKEITNLPGLLMEVSKLNDVQQIKNRKKTQMF LPLPQLTDDDLEEIAEFERVNGGQELEVQLQQQRTKRTPMQQDNIMIEAQ NLYNLSVASTPLKGGQTPHLVNTNLQITKPVNSKDSQQTPSVGKTPNPLL QIAQTPKRKFDSLQDREEIERDNQVQQQKNKSSLLESLRNLPKPKHEIKI SLPDVEPEDIDMDTQSVGASSTGGASAMELDESEVHIRKQEELKHKEQFR QRNRSNVLKKNLPRLYEPVEISSSQDLIQKMVSIEMNKIIKNDNQLYPVL AVNGSGSKQKQKQQQQQPEHQLQYEYYTNKEMDQVNQLINEQIKSSGMNK DMVLNVILQELDSLQENFQVVPGDNKLVDRSVVTSKQRIETLKMDYEMIV KDLKSHQMKSQQLEKKLTVYNGGYQNRSKQLVQSIEELYSSIQKANIELN CYQDLRTLELNSLENRIKSVQHDLYDQVETENHLQLKYSKLLQEKKEILK NKVTKFLYQEQKNKEN >DLA_11108 g9959, MEALRAQLDEFLGKDRNLLPKDRVKTESHFTDADICKYYLCGLCPNELFT NANIHDLAPCSKLHIEGCVKQYQNSKDKEVYDYERDWVRLLENLISENDK KIKKNKERLAANPNENIQDEDLELDRELNQRIEEMDKQIQIYLKMVEDLG EEGKITEAQQTMEIVEDLKAKKIELQREEMIAHEKNENKKMSVCEICGAL LFVGDKEKRSISHLEGKKHIGFERIRKVMEDYYKTKNRQPRHFGGGGGYY NRDRNYHGGGGGGGGGGDGGNYHGGGGYHNRENRDHHGGGGGNRYEPYGG SNRGQGDRERRTYNFEYRDDNRGNSGYRGGNGNDDNRSYRNDDHHFNNNR GRDNNSNGSYNHDNYQRSSRDREDDRRKR # Polysphondylium pallidum, PN500 >PPL_10728 g10191, MGVPSFYKWLTEKYPKITCTNYLLENIENNNNNDNNNNENGNGNNNNNHG NKEKKKLRLNNLYIDMNGVIHNSTHAKNSTTLSPVESDDVCRMNLLKNLD ELVGTVQPTNLLYIAMDGVPPRAKAIEQRKRRFRSAKDAKDALSKRLPSD PVFEPFDSNCISPATEFMCKVNEWVLTYAQALVKRMPSVSIVVSDASVPG EGEHKIIDFIRAHREHWPSDTSHVFYGMDADLIFLGLSTQLSHFFVLRDL GAQIYCSTCKNNGHLCYECECAVAKKRMNDPERSSRLSIRNIPIQADEEY IRSFFGRFGKVLNVRLERALTKRPSLTAYVEMDSVESARSVLAYGANYFI NDTKLSVHYVVEKVVSPATAAGGSTTPPAAADQEDIDNVPLKAIFIPNLD VRTQMFDVNAFFSGCGAIEDSTFIKSPKDPKIKFVVIKFVEEESAKRALA MNGVDFYGTSLIIRKSRPMTKEPKAPLTDQQKKQKDQEKEDKKLKKLEVV NEFLTKADPNTNIDTAYFYLGMAEWNIEKAFETYLAYGKSQHKLLSDTPT MRDDDSFDFVNLDYLKEYFRYSVLSGLSESVASRVDCNRMINDFTMMCML LGNDFLPHLPTMAIKEGSIELMMTWYRDWLSSFKDESSTIQYLTDGPNIN YKPFHKLLLMLSKWESLYFPDKLEKMYKKEIFRLDNLLKTSPHLESHRRN LDGDINYYKLKIGVTIDKVEETANDICKAFVDGLAWVLRYYTVGCPAWSW YYPYHYAPLISDLAKFVEKQSQVYNDTKHIVFDYGAPLEPYIHLLSVLPH YSSKLLPPKLADSIVREPAPLSRLFEERFRIDPNGEEVSWKGVVLLDFIN TNVLKEHASPIIKDQLSAEELERNKFGKDKYIRFNSEILDINQHLDVVKQ QSTDREQGKHKLEIDNHYQPLSQLDLLWCERKFYTLSPNTPDSYEQLPIA HPMINPPTSTLPTEVVISDEQNQFLEWREKQSHVEGSLETLMSSVDLNKS FTVGVNKKLSTPLDRVLSNEISKTKHIQLVSDVDQQMLIHIQFNKQTKLN GLKLISTISKETTPKVLKVYFNQQSVDFSSLQSLKPAFSFELDELSCSLI SDYQAFGQTKIHQANSITLFVESNHSSSDDCKSIIEKIVLI >PPL_13621 g10431, MNYYCSKTFINTTTTTSRFISSYNNHNHQCSFSTININCLSSLPKQSIFE NSNNKFNSRSSQFLKRQQQQKSIIYKNFYTSSSLSYSDTNSNYNDNNNKM SSTTNDRANRLIWVDLEMTGLDITKDHIMEMACIITDSELNVVEVGPELI VHIDDKDLNSMDQWCTEHHGQSGLTEKCRQSKLSIQDAEKQMVEFLRKHV DKGQCPLAGNSVHQDKRFLLKEMPLFADMLHYRIVDVSTIKELVRRWYPS VANGLKKRNLHRTLADIEDSIEELKYYRSTVFKQQLP >PPL_11028 g10461, MSEFGKAGGGGLQSSQYDNIDRRERLKQIALETIDISKDPYIISNHLGSY DCKLCLTVHNNIGNYLAHTQGKKHQTNLARRAARDQRENPNKTTFAPKAR IQPKKTIKIGRPGYKIIKQRDQETGQLSLLFQIDYPEIEHGLQPRHRFMS SFEQRVEPSNKDFQYLLFAAEPYETIAFKIPNKKIDRTTGPDGKFFTHWD RTHLTFTLQLYFEESVNVEDIDPSQQQ >PPL_11702 g10956, MATTTTAASTTTTSSASTTSPTNQITYGVTVPISFSNPTPADLKLSTELE DTLKSFNLFETPEESGKKEEILGKLNLIVRKWVIDVSLKRGFTEQMSLEV VAKIFTFGSYRLGVSGPSSDIDTLCVAPKHIMRSDFFDFLGEALKVHPDI TELNMVKDAYVPVITMIFSGIAIDLIFARLSLSSISEDMNDLIDDAYLKN LDDQSITSLNGCRVTDKILTLVPSRATFRMALRFIKLWAQRRGIYSNVLG FLGGVSWALLTARICQLYPNAAPSTIINRFFKIYETWRWGVPGPTPVLLC PIQDGGIFAAKVWNQKRDKSHLMPILTPAYPSMNSTYNVSKSTLSLLKDE FARGNQIAQKLESGEANWNKLLEKSDFFSRYQFYLQIDCSAQNEEEHRKW EGWIESKLRKLISFLEQTPKMKFAIPFPKSFENKPTPAATTTAANGEAVT DANNSNVCTSFFMGLGFNFSNAPGADKSVDITKAVIDFTHLIKDWAGKGP TMEMKVHYIKRRQLPAFVKAEAPPEEKPKAKKRGSANSADVAKKKNRTDQ QQHINSPTGSSTTTTPLLNADSKTSAINKSSDSISSPPIVVATSSTTPKT ATPISSPKALSPSQQHQQPVVNITNNNGSPVAATSSTTSTSTTIITPVPS TTGMDTTTTTAITSNHQTNESPTDTTNNTNTESTTTLSPTPDTDNVNNNL VPVLSSTTNNNNNNNNNNNIINEVDELDFISSSSSNNNNNSNTDKKPAIK KIDLIRG >PPL_00003 g11381, MPKERAKRQPHFSDHEICKYFLCGLCPNELFTNTNIRDLGPCTKLHDEDC LKQYNASKDKDQYDYERDWVRLMDQIITDNDKKVKKNKERLILDAAKLAA EEGLQDTPSELKVAITQMEERIQALLKKSEELGEEGQITEAQDMMTQAED LKKQKAEMQIEEDARSHDKKMSVCDICGALLFVGDKEKRSMSHLEGKKHV GYAKLRAHMEEYYKTAKRDYRLPRRDNYNNNNRDYNNNSNRDNNSSNNYR DRDGRRSDYGGGGSGGGRYRDSRDRDSRDGRDSRGSPYSRDRRGGGDYNR EYRDRDSRNHREERDRERDRERERDDRSYDRDYDQRDHDRRY >PPL_02616 g2309, MFENNPDSPKDTQTQTPQITWVPYTLKSEDELRFEIDKEAKIKLADGTAE YFGTELALNREYTLNNVKGAIFSWKGCKIEVTDNVKAYISNGTPMLSYAN IHSIMDQHRMSILSQKNQQGPRVLIAGPTDVGKSTLAKILMGYSARLGYN PAFIDLDPGQGSITLPGALCASLIDRPVDIEEGLSNTVPFVQYYGHTSLD INPTLFKAQIQSLGISVDKRMEQSDNARVSGMIVNTCGWIEGLGYELLRE SINLLRINIIVVIDNEKLYSDLSREFSSGGGNNSSSGMKVMKLPKSGGVY LRSALFRKQTRMQRIREYFYGIQGDLCPHITIVDFKDVCIFRTGGGPPAP STALPIGSTSVIDPLALQEIQPSPEMLHSVLAISYAKNSQSLLRSNVAGF LYVSDINMETKKISFLAPCPGDLPSKFLLMGTLKWLE >PPL_02883 g2563, MPKYYCEYCDKYLTHDSPSVRKSHTIGKVHQQAVTLYYKQFEAEWFKSQM QQKGGQVPMMPPFGMQPGLLPPNMVPGQFNIPMMPPGQFPFPPPPGQPMG GMPPHQQQPMSFNPHHPYPPPHLQQSAQQFNSNSPPSNNDQ >PPL_02974 g2639, MYCNMNHLPPHQQQQQHYHQPQQMIYHPGLQMATAHNNMNREYGESTTTT SIIPPSVTSCLTPTMSSQVVSPIIVGGVPKRKLEEEDFSLIKNQPLLVSS PLLTPVSQSPGLTSVQIAFQNASLSNPPTPLTMSPSLSPSAAPMSPSKKS KNSRSSGKSKWNQGSSDDLSRWQKTKSPGIVKGPWKEEEDAKLVELVQKN GPKEWSTIAAKIPGRIGKQCRERWFNHLSPDVRKTNWTPEEDKIIIESHL ALGNKWTAISKLLEGRPANAIKNHWNSTLIKRIGADGKSHQPSPSKDLKD DEDDEDEDSETNSPALSPISLYPTDPSSAANHAHLTGTPVQTTEQQQYNI PPFILSGNHQVENNQMLNTNLPNDLYRQGTIIAPQIIRLQQQTTPSSSPQ MNSQIKKSDPSQNLQRQIPANQSPQLHHQQQTHQIQQQPIQQQSVQQQQQ QPIQQQPIQQQQPIQQQQIQQNTQQHQQYNQQQYNQQQALHQQQTQQQHQ QQYNQQGQQQFNQYQVSHEVPYYNQSFWGQPTAENIAAGDHLVTFPQNPF ISDFNFEHSDFLFFDHGDHSQQNNIKNIDTNQNQSQQPQQQQQQQQNNSQ NNYDINNLFNVEI >PPL_12977 g2705, MVTLATDEIREVWAHNLEEEMAIIRDLIEDYNYIAMSEFPGIVTRPVGSY RTSSDYHYQTLRLNVDLLKIIQLGLTFADSEGNLANHTCTWQFNFKFNLN EDMYAQDSIDLLSRSGIEFKKNEENGIDVLDFGELLMSSGIVLNDKIKWI SFHSGYDFGYLIKLLTCTALPVEEPDFFDLVRTYFPCIYDIKYLMKSCKN LKGGLSELAEDLDIKRIGPQHQAGSDSLLTCTTFFKMRKMYFENQIDDSK YQGILYGLTSSFTQDNSSSNSSSSNSSSNTTTTTTTNNSSGTGTSNSTNS TPNSSHQSISNYSLLQNITQSISPLSSSASSTTTTTNNNTTNAMNGHVIS S >PPL_03919 g3458, ortholog of the conserved splicing factor 1 binds to the intron branch point sequence (BPS) of the pre-mRNA necessary for the ATP-dependent first step of spliceosome assembly MAVEQQDNIGSDFNDDEDDFFRQINEIQNDYERGRPRNREEIKEEKRTRK NKWEPEKTQLGLPGVPKSLPPGLTDDQLASLIIRIRIDEITKKLTTGPID IDTKDDRSRSPTPVYDNTGKRTNTREQRAKDKISKERHNLITQAQQINPQ FRPPADYQPPNEKKTMKIYIPVKDHPEYNFIGLIIGPRGNTQKKMEKESG AKIAIRGKGSMKDGKSTKPQYNENDELHVLLTGDTQEQLEKAAVLVRQYL VPVEEGKNEHKRQQLRELAEMNGTLRERPTFFGAGGKSWQPVDIKCIHCG EVSHPSSDCPLKGQDHNMHIIEAEYLKFIEEVKDLIDLNDRVVDPYDELK ASINNNNNGNGNVQNENNNNQYQHQQQQQQQHSSPPNHQQQWNQYSNNNN NNNNNNQYQQQQHHSSPPYEQQQQQQQHWNQQQQQQQGGYQQQQHHHHQQ QWNQPKQHFNQNNNSSPYGPQSSYY >PPL_04210 g3727, MKSTIKGGVWKNTEDEILKVAVMKYGKNQWARISSLLVRKSPAQCKARWY EWLDPSIKKTEWSKEEEEKLLHLAKIFPAQWKTIAPLVGRTASQCLEHYN RLLDQVQARNDAANPDASGAAAGDDPRRLRVGEVEANPETKPAKPDPIDM DEDEKETLSEAKARLSNTQGKKEKRKFREKQLEEARRLAHLQKKRELKAA GIIVHDKKKAKEKRFDYSQEIPFYRKPMAGFYDTAEEQKQAPDKDKQFIN QRLDKIDGESRAAELERANKLEELKKKKREMTNLPDAIKQINKMNDPEMT RKRNKLVLPEPQLTDDDLQEIAEFEKQSKSYSAASGDGSGGELTATTALV GGLMRPPTEVPQSRLNMVARTPMREDVVMMEAQNLLAMTTAQTPLKGGAN PVLNPSDFSGVTPKPQNMASRTPLRTPNPLAQGMTPRQQKQQNNEDAMAT KHSIANGLKNLPAPVNKFQISLPDEPTLEEIDEDGQQILDQSEQEIREQQ ELKHKEQFRLRNRSLPLKKSLPRATTLPTQTAGLAIIGKAEEEQQQQLLQ KEIDNLILNEMCGIIKHDDKCYPLEGGTNDDNDYEYFSEKELKEAQKLLV TELNVVKQEQQEQLDEKELIDKFVNIWTSVRDDYVYQSNQFVERASLSTE QKIGSLKQEYDAIVNAMKTSAKKAQLIEKKMTTELTSYQASLAKVLKQID EISQQIEQSSIELSCFQQLRIIEQRAIESRVKFVQNQVYDQCDRENRNQM KYSKLINERNTLLSQQQQQNGNNNNNNNNNNHPK >PPL_04213 g3730, MEEEGDVCRVCRNGPTTNNPLSYPCKCNGSIKFIHQNCLLDWIKFSKSSA CELCGHPFRFTPIYSENAPDVLPIREFILEAIIRLSGFLKRLVRVLYVVF CYLFLVPFFTSWSFQTYFYLKLPDSIYDVNTIARDFFLGFMLFFWIIIVT ISSYLIFDILDHKHSELDLEENLDNNNNNNNNNNVNNQDDGDDDDDATDV EEEDDDEDIRYNQQEWLGPQHQQENQHANNHGQPLFGGVFEVFQQHQAPQ APQQVPANGVDQVDQMDLNTLIGLSGPKLEVIAKGVCLIIYNTIFLVVFL FIPYFIGYLSTNAVSTLFDIQLPASIISKYLLNISIGYIIASTLTMFILS NFIKNFIYYKYSRIFYSFIKVCIIVILEVGVFPMLFGAFIDYASMELFGG TFDTRLQGSLHHILPFIITRWGVGFFCIINISSLCKILHQIFRRKVIWFL RDPNDPDLDVIKDLVKVPFVKHLININLSLLIYCVVTILLIYLPLKALSL IPNLLPVDFGDPLNKVGIGADVIFFISTFYFPKFHPQLTFTNFIKYFNNI ITRTLGIDEYILLPPAIPNQQQQQQQQQQQQQQQQDGQEQQGAEQPPQPQ PIRNPQDFPDVKPTHYKFRIIGFLFLWWLLLFTIICCFIGMPITIGRSLA GLASISNPNDIITFFVGVVVVWVLSKLVNLVIFHRSTINIIQWIPVAFKV LILGFSICIFLPVLVGILFDLILFIPFVSSYDETFYIFSSDIFYSWCIGA LILKFWYRWATAVPNEGNIRHNRIEDEEQTERDRWFDRFETLKRNGFANV DLMFTMKKIVFPIAHFLMVLFTVPYFVSRGLVPWLGGSAILENFTFRFGY PAFTVLLIIESLYNKAKVYLIKLHNSIRDDRYLIGKHLHNIDTTN >PPL_04436 g3925, MSLVCRLWRKRTSQTIKVIDLSDIEIYNTPWVDTFFQSFTSLLVLRLKQC TLELNQLSALLSYFQSLIEVDISFCKLHSVEHAMGIVDTGLSVFLERLEK HPSLENLYLSFNHLDTKLTQDLLVLSRIPNHYKINLTINLMFANGFNIIC EIMKSNQSIIKALNLNKSKIGRETNSLVAFSDVLKLNHSLTSLDLSSNQI SDSAAKILSESLATNDTLVQLNLSFNEIKKEGSVALANALKSNRSIESLN FSYNFLGEEGTRAFSDLIATNTTLTDLNLSANKITFFNVPQIANALAANK TLRSLNFLRNMIDQVGAEYISQGLHYNQSLTSLNISSNKFGNLGAVLIAK ALSSNRDTKITEINMSSNCIEDEGAASFAAVVLHNNTVTSLDLSVNWINS DGVVEIANAFLENPNSTITSIDLSCNTICPKGARAMAEALSVDCALRHIN FFSNNIETDGAYELSKSIIKNHTLTSLELSTNLIGNEGIKYLSQALLENN TIVSLSLSQSLIAYEGIKYLVSLISLNHTLTFLDLSYNFIGPKGAEELSL SLENNKTITSLDLSSNSIGDDGATAIAGIFPKNNTLQRLSLYNNKIGPKG AKPIVENLLKNHSLYSINLLANRIDAYILKPIVKRLEHLLPAPS >PPL_04777 g4227, MYNFVSTVQKPTAVYHSVTGCFTSPNERNLIISKGTKLEIFTLTPEGLSP VLDVNIYGRISDMRILTATGDKQDRLFILTEKYKYCILAFNSESRELVTI ATGDAEGTIGRPAEAGQIGIVDPECRMIGMHLYEGLFRVVPLEHGQPVRE SFSMRIEQLQIVDMVFLKQCAKPTLALLFKDTRDARHIVTYSIDVVTKEL IEGASQDSVEENSTMLVPLDNGAMLIVGEMAITYMNLKGNSQPVTISIDH THIVAYEQIDRDRFLLADDCGSLYLLHITLDSSKQTALNMKWEPLGETSI ASSLSYLDSGVVYVGSSSGDSQLIRLNSHIDPNTGSYISVIDQFTNLGPI TDFCVVDVEKQGQGQLVTCSGTFQDGSLRIIRNGIGIAEQASIELPGIRG LWSLSNNSNPSSLHRHLIVSFINSTKVLTFSGEEIEETEIAGFDSNATTL YCGNTTENNHFIQIATSGIYLVDSSSLMRLDQYTPEKGSINLASCNGSQI LISQGSNLTYLEISDSKLIIKKEAQLQYEISCLDISLLDGFTSSPVCAVG LWTDISVRILQLPNLNEVCKETLGGEILPRSILFITFEGTNYLLCSLGDG HLFNFTFDVVENLLQERKKLSLGTTPILLNSFKLKNSTNVFASSDRPTVI YSNNKKLLYSAINMKVVSHVCSFNSEAFRDSIAIATESSLVIGTIDEIQK LHIRNVPLGEMARRITYVEEYHSYAVITIQRNDGNNNNNDNDNFNNNNNN GVPLTNYVKLLNEQTFETTSKYALKSFEFGWSIVTCRFKNDDALYVVVGT AFHNEVESQQSKGRILVFRIEDNRLILLDEVALPACVYCLLPFNGRLLAG INKRVQAFNWGVDTNKLTKAESYSGHTLSHSMVSRGHFVLVADLMKSMTL LVEDQQGAIKELARNPLPIWLSRIEMIDDETFIGGDNSYNLIVVQKNAEA SSEIDNELLDTVGQFHLGETINKFKHGSLVTSPDMDSPKLPTILFGTVSG AIGVIVSISKDDYEFFEKLQKGLNRVVHGVGGLPFENWRSFSTEHMTIPS KNFIDGDLIETFLDLRHDKMLEAIKDMNISIEDTYRRIESLMHHIR >PPL_05118 g4515, MQNYHQQMNNVEQHPYYYMQQGQDQNPNGQPDHINIKKDDSISDQAASYS AHHNGGANTLADLSDLTDKSVLLKHENILHHDKDSTAKMEPQDPSILQQL GYQQQQQHMSNIIHSSSSPLSNNTPITTSTTTSSHGNIGQLQQSGNSPQQ QSQQQPLVYEIPDILLAPGTRSPYIQELTDSIISIILGKAHKDEPISQVY TNIASVCRQWHRLSVDRITNYCYHLPPDKKITNFFSNIAHNKFPNLHTLQ FKVLTPTLFDVTSFVKMILIDNKLITTLELSQNGIGNKAATCIGTCLVNN TTITHLNLSFNSIGNEGAEEISKALGTNKTLTHLDLSQNCIGLKGSKALS TAIQTTKTLHILNLSKNRFGTKGIDVIADSIGKNTCLLNIDFSRNEISEK NAKIIGDVIKNHPTLQSLNFCDTSLKSDSMKYISEGIQASQTLNSIDLSR NEFGYKGSKSLAVALQHSNSLAFLDLCGNDIGDKGAIPIAEALADNKSLT NLSLAFNNIGTQAAQQLGAAIKVNNSLVSLDISINAEIGPIGATSISEGL CYNKRLTQVSFCTNGFGPHGAKSLSEALRFNNTLTKIELRGNEIGDDGCR YICETLKTNASLTEINLSANGISNEGARAVCEALWYNRTLQQIILTHNNI NQQGVQTMKDTLEQVFVVTFDSYIYPPNSQTILSNLYITRPNHIVCKVTI >PPL_05455 g4817, MNKVGMLKFQGSSQFRQRIICSTLSSRPVKITNIRDDQERPGLTDYEVSF LRLLDKITNGTKIDINGTGTQLTYIPGLLIGGKLQHDCPVSRGIGYFVEA LICLAPFSKLPLDITLTGITNNDLDLTIDTIRTTTLPIVRKFGIEEGLHI KILKRGAPPGGGGSVIFKCPIVQQLKPIQLLDEGKIRRVRGIAYATRVSP QIPNRVLDTAKGILLKFTPDVYISADHYKGGESGQSPGYGLTLVAETTTG CCISAECMGAAGESPESLGERTANFLLEEILNGGCIDSNNQSLALLFMVL CPEDISKIKLGKLTPYTIDFIRNIKEFFGTVFKIETDDDSKTITFTCLGT GFKNMARKTF >PPL_05669 g5029, MIVSLFPPVPVVNVPLRLAYRETVDPNKVDSAKTVLFSSFKNFIHYNCIE DLKIYIDDVEYVKESGVGLTADEHEQQQQHHSKTILSKISKFFKSDKSDE ESEAAQIKSVYGVTDSNIIPLNEENALFIPRVEHANKQVTLKFTIKKKPF ELKATVQHRHPRVWSDIKQLDLVSLENEKIEEEQTAMTNSSFRVMQFNIL ADCYTSPANYVGCPVYSLYRNYRQWVLPEYILEHSPDVVCLQEAEVRMER LTKKLVEAGYLHTPLCDLARYEEEQSITYFKTSRYQPIELQMVHYKNLKN LLTPAQLDPLLKSSITAKYLDLLSSSMHHNKFSLALLQDKQTSSSILFGS VHLHWGSPDFDINYIAQVIQLHIFMMVVGNLLDKHSLPRDTPLVICGDYN NGPTQKAYTLMDYGQYELNGYTLSHSFKMSSAYSHRPDGEPKYTIRTNHF TGSIDQIWMSEKLRVSKLLEIGDHYPRQLPSLTDPSDHIMMLADLYVSKH PRVTLTAADNNQS >PPL_05958 g5309, MVETDKKEISEWDEPTNKSGLGAVGATPRRNRWDETPQKMASSVVAETPK RRSRWDETPVQQMGAQTPRIGMAGVGGITPLGGGVTPLGGMSMMTPLPGS AGSSSVALKIEREIDDRNRPWTEEELNAQLPSDGYEILAPPAGYVPIMTP ARKLMSTPVGVAGTSGGFFIPEDQPRVSGGAGEYGVDQTPGGLPMKPEDK IYFEKLLNEDEEETLSPEEAKERKIMKLLLRIKNGTPPMRKQALRQLTDK AREFGPAPLFNQILPLFTSQSLEDQERHLLVKVIDRVLYKLDDLVRPFVR KILSVIEPYLIDQNYYARVEAREIISNLSKAAGLASMTATMRPDIDSPEE DIRNTTARAFAVVASALGIPSLLPFLRAVCKSKKSWQARHTGIKIIQQIA ILMGCAILPHLKSMVEIVEHGLNDDQPKVRTITALAIAALAEAATPYGIE SFDSVLKPLWYGIRQYRDKGLAAFLKAIGYIIPLMDARYASYYTKEVMTI LVREFKTNEDEMKKIILKVVKQCVGTEGVEAQYIRDEVVPEFFKQFWVRR MADRRNHKQLVETTVEIANKVGGAEVIAKIVDDLKDESEPYRKMVMEAIE KIISSLGASDINPRLEEQLIDGVLYAFQEQSTDETLIMLQGFGTIVLSLG VRVKPYLTQIAGTIKWRLNNKSAKVRQQAADLISRIAVVMQLCGEEQLMS HLGQILYEYLGEEYPEVLGSILGALKSIVNVIGMTKMTPPIKDLLPRLTP ILKNRHEKVQENCIDLVGRIADRGADFVLEREWMRICFELLDMLKAHKKG IRRATVNTFGYIAKAIGPQEVLGTLLNNLKVQDRQNRVCTTIAIAIVAET CAPYTVLPGLMNEYRIPELNVQNGVLKSLSFLFEYIGEMGKDYIYAVTPL LEDALMDRDPVHRQTACSAVKHMSLGVHGLGCEDALIHLLNYVWPNIFET SPHVINAFLESVEGLRTALGPTIILQYTLQGLFHPARKVRDIYWKVYNML YISSQDAMIPAYPRAADDGPNTYTRYELDYVI >PPL_06218 g5553, MGVPRFFRWVSERYPQIIQNLVDSTAPEYDNLYLDMNGIIHACSQEISNS LLTFSEEELIRKVCNYIDKLFHIIRPKKLLYMAIDGVAPRSKLNQQRQRR FLSVFLEDKAKQKMISEGKEIPEVIFSRTAITPGTEFMSNLSDCLQFFIK KKINEDMSWREVEIIFSGPENPGEGEHKIIDYIRKNKASPDWDPNQSHCL YGLDADLILLALVTHEPHFSILREEISFRPTKRQLDFQLLHISLLREYLD LEMRNDSLEFGYDLERIIDDFIMILIIFGNDFVPHLPFCEISKSGLNVVM DLYKKLLPDLGDYITDGAEIDLHRMQSFFNAVAQFELKQQNVSSGLDDEE ELDDSAAVAAAIVEGEDPEDAEARKEFERQALERIKHHFNELDFKDEELE DDISSAWKNAYYRAHFEDFPDNYDEIPQFKRNLVHSYLEGIVWVLNYYHN GCISWVWFYPHYYSPLACDFVDIASIEVNFEPGSPVTPFQQLMSVLPPQS AYLLPAPYRRLMESDDSPIADFYPKEFEVDTSDPHYFDGIAIIGFPDLAR LVEATKDEDSYDLTPRERMRNTLRHAVIIYHDAQIEQSVETPNAKLFDDL EKSNATIEELILPEPNENSLKPFRLCDGVLVGNHSPSGFPTFQSIDFDWE YMNGVANLWGNTSRKDSMIVNPPLPELCTLRDLKPLIGKRCYVNWPYHIE AKIVGFSDSNQHITRDSTIDYVAFQKIQYLDQLKKLPFDYLRRGINIKDL LENTVLVHVQKIVGVDTEVGGRRVKRYSDKEDTFPLQLMVEYDRVKADSR YEECEELPFEKRFPIGKQVIYVNSDHYGSVGTVLHHFDNTLELELKVQKM DTKFGHQCAREEDEYYPIQQVCKMTNLSTQQISLLTGALFIDKPMTDIGL NMKFTGRQQQLLGYCRGITMSDKTGNSFKKWEFSSLAVEVIKKYLAAFPV LNSILTMYADDKTGKKAIDISSLIPEKSDRVALIKKIDEFMKESKIRHQR LVPCDTITMKREAIKKVEDFYLKQSKHEKVSVITRTSPEHVIEPASYESV ISYERHQSQTHLQQRNNNSPSLASSNISNGGFLESNKSKIFNLGDRVVTL LEKGNIPFGLYGTVVSVQEHKVDVVLDQECFSANSLDGFCLEKRGICISK WRVYNLSQPTAVTNYRYRSNKPNYSIDSYEHWKKYDHQNNNKSNNNNNNN NNNNNNNHNNNNNNSASIQQKIKEEMPPNLNWQQQQEYYKQLHRKNYYNR QEQYTKNFTPEELVHYAKKYPHVHQSMLWQHHQQQLQIQQSKQLKNNNNN VDKKPPGLVQQQGQGQQAKPPKQKKDKKEKKEKKNNTSGAEPAKQSDSTS QILQKMFESQALPDPPASPPITSFFMKAAAQSQEGAPKAAEVGPAETAPQ QPDQQQQQPPQGPPSQLLQMIQSSLQSVPQSGQPPAMPMQYPPPPHMGMP PFPHPPPPHMGMPPFPHPPHMGMPMFPPPPFGVHPPPPMPMVNQRPNQHH HQQQHQQQFPQLSESNKQQKPKNNNKQNKVKQPQQQVSKAQPNTNTNIHT LNDLKQNNKPKKDKANKSQWVQKNKPTAGSSSAVPENNDNKPNNNNNNDE NNTNNGDKDQLNWQQKLEK >PPL_06222 g5557, MSEYKSSIFKSISSPTTTANQYNNNNNSNNSSSSFNSFNYGYESSMYYSD INKLEGMNNNSNTNNNSSISSNNNDVSYGWSYNDIVNGADELKTTAYHSP TAAPTSGTSMNSSPIYYNKDLIDKLSDDSQPPPHHSPLNLYGKSLRNSYQ NGIGGGHSRNNSGSGSSHSRNNSSGGGSSHSRNHSRNGSIGNNSTTAAEL QLLDHDDNNGNSHFNDDNDDNDDSSDVDQTDNDDNGYLSTLPPTTTTSSK SYSASNLSIFVENNYSELPITKPSGSCTPGGGSSSSPSSLQVSPKLSSKT PPTAFPSHFPPLSPSIDVPSPSSSSSPPPVPSRTHKRFHSTSNLPNNNNN ESNSTNNSNNNSSKLNVFSNQRANLTTSANNISQSLSSSFGSISSKISQS PLKSKLLKVITNVKNSSPVESILTNLSISTSSLSPRNSSDISIQPIASST SEPSSNYPSLQSSTNSTVDIEYEQQQNQQQQQQQSQDGNNIKLQDSIDKI VNEFDFDLIKIEDNSQQQQQQQQQQQQSSQQTNSNNINDVNIEKTTLTKK PQQFLSIDEKSVIQNSRRSSLLHLSYKSSLLNQISQQQSRSERYRFSLPL LDISPYPKLNNDFNATNNSSNNNYNNSNNNQFISYDELFDNYKNSSSSNS NGNNIYSYPSLKNTNGSTVKQDITLQRFKPLPPRPPTTQHLQHQQPSNSS NNNNKMSSLLQFISISLPDIITKNGESSAQSTSTAMIDDEQYRNFSQYLN ESGDRHKIYQHIQWHLEQPVVNIEIVKLLGTHYGFTDSCRAISWMLMTGY LPPNKDQRQSALQSKKLQYRDLVKKYYGDCKLFESDENNFERNTNKLLNN VAVLWSNNTQSGQQEKEKFNELVQQVHIDVIRTRPDGFYDLFELKEIEQM SERILVIWSSENKDVSYFQGLNDLICPFLIVFLDYAIEVSKVTQDSFPSY PSLVDTLIDDEVLLSKKIGDGSLVKELIEKKRFDILSRVETDVYWCLSNL MNSCKSYAANTGCGLPAEGMMKNLESLIKESNEELYLHFKKHGLDFSHFS FRWMVCFLIRELSFETGIKLWDRYMCDKNNEGFSILHICFCASILSYWSN DLLNMEFMELVTYLQRSDILPRDQLDPILRNKEREREMVKPKQNIDPEEA VVRHTPHDIKCKARRVLLSQKLRAQRKKAKESGRKERQKERQKLGEDGPA AQQPRTIESMRVADDTIVDEEDPEYQDEIELDEFSKYFEGKEPKTCITTN EKPGGKAVGFAKLFPRILPNAEYFPRQHFELSEIVKFCKNRDYTDLIVVN EDKGEVNTLMICHLPDGPTALFKVTSITMPDKIPGGGEMTNHKAELIVNN FTTRLGHTIGRMFASLFPQEPNFRGRRVCTLHNQRDFIFFRQHRYMFESK SKANLQELGPRFTLKLMSLQHGTFDTKSGEYIHLHKAGMDVDRKKFVL >PPL_06397 g5715, MTENSNNNNSDTTTNINNSKPKSFENVMEITKQLVERLTTQNDQKIRSLH NHKKNKEDNNSSNSTNNNSNIVDKKKNQYIDTKYITILSFSKKLIQLSLF QYRSFVNDENLLVLFKKYGRYIGSLDLSKCNHFTVEALIDMLEYLPNLNA LKLQFCSQLTCENLERLLQLQKERCSIKHLDISFNAMRSLIRSRLQLNPN LSLISLMHSHISDDDGVALLDSVKGNESLFSLNLSFNSISDKTMHAIAEL MSRDSTLRELNLATNKISDVGMLEFGAALAYNNHIQILDLSSNFIQDRGG VAIAKSLALDSSVQKLDLSANDVGPQCGIEFGRSLLVNKTLTSLNLHRTM IDTEGGLALCQSLATNQTLLYLDLGMNQLENVVGCAIGESLKKNRSLHTL ILKRNQFGDQAAHAIGDALQTNHTLTSLNISGNQIGHKGAKSIAYSLPLN KTLRDLDLSYNMIGDGGGKLIGEALGTNSSLIKLNLAANRIGSESCKSIA QSILNSTFNPNTQQHIDQLNNSGNLTESGSTQSPPHTTQSVQCSHLQQQI TQQLAANMRISSSHVNLRGLNNNNNNNIGNIGIGNIGNNKTVVRALFPTL VWLILDSNRVGDEGAIALSQVIANNPPLQTISLVSNLIGESGGRAIGESL KYNTNLLSLTLDSNRLGPDGAKFICQALKNNGTLTHLGLSGNHIQDQGGQ YIIDALDLNSTLKSIFIANNDICDNIKETLEAIPQCTES >PPL_06763 g6038, MSDVDKLQEKAKKWKQLNNKRYSDKRKFGYVEPQKEDMPPEHLRKIIKDH GDMSSKKFRHDKRVYLGALKYMPHAVFKLLENMPMPWEQVRNVKVLYHIT GAITFVNEIPLVIEPVYTAQWGSMWVTMKREKRDRKHFKRIKFPLFDDEE PPLDYSENILDEEVEYSIQMDLDETEDAAVIDWFYDSKPLSNTKYVNGPS YKKWKLDLPILSNLLRLASPLLSDLTDNNYFYLFDDKSFFTAKALNMAIP GGPKFEPLFRDMEDDDEDWNEFNDINKIIIRHKIRTEYKVAFPYLYNNRP RKVAIPFYHAPNICYAKSTDPDLPGFYFDPVLLHPIPSYKLDKSQPQTAY GDEDDDFALPEQVDPFLQETELDTETTPAGIQLYWAPKPFNQRSGLTRRA QDIPLVQTWYKEHCPPGHPVKVRVSYQKLLKCYVLNKLHHRPPKSLNKKY LFRSLKATKFFQSTEIDWVEAGLQICRQGYNMLNLLIHRKNLNYLHLDYN FYLKPIKTLTTKERKKSRFGNAFHLCREILRMTKLVVDTHVKYRLGAAEA FQLADGLQYLFSHIGLLTGMFRYKYRLMRQIRMCKDLKHLIYYRFNTGPV GKGPGCGFWAPMWRVWLFFLRGIVPLLERWIGNLLARQFEGRQNDTVKTQ TKQRVESDHDVKLRAAVVIDILDMMPEGVKENKTRIILQHLSESWRCWKA NIPWKVPGLPIPIENMILRFVKSKADWWTNVAQYNRERIRRGATVDKTVC KKNLGRLTRLSLKAEQERQHNYLKDGPYVSAEEGVAIYTTTVHWLEQRRF SSIPFPQTSYKHDIKILTLALERLKEAYSVKSRLNQSQREELVLIEQAYE NPHEALARIKRHLLTQRTFKEVGIEFMDLYTHLTPVYDVEPFEKITDAYL DQYLWYEADKRQLFPNWVKPSDNEPAPVLVHKWCQGVNNLDSIWDTSDGE CVVMMETQLSKVYEKMDLTLMNRLLRLIVDQNLADYMSGKNNVVINFKDM NHTNSYGLIRGLQFASFIFQYYGLVLDLLILGLNRAAEIAGPPNLPNPFL TYKDVETETNHPIRLYTRNVDRIHILFKFTQDESRELIQKYMSEHPDPNN ENVVGYNNKKCWPRDCRMRLMKHDVNLGRAVFWNIKNRLPRSLTTIEWDE SFVSVYSRDNPNLLFSMSGFEVRILPKCRATNEQMIPKDSVWSLQNMNTR ERTAQAYLRVDRDSMERFENRIRMILMASGSTTFTKIVNKWNTALIGLMT YYREAVVVTREMLDMLVRCENKIQTRVKIGLNSKMPNRFPPVVFYTPKEL GGLGMLSMGHVLIPQSDLRYSRQTDTGITHFTSGMSHDEDQLIPNLYRYI QPWEQEIKDSQRVWAEYALKYEEAKSQNKNLALEDLEDSWDRGIPRINTL FQKSRHTLAYDKGWRVRTDWKQYQVLKSNPFWWTNQRHDGKLWNLNNYRT DMIQALGGVEGILEHTLFKGTYFPTWEGLFWEKASGFEESMKYKKLTHAQ RSGLNQIPNRRFTLWWSPTINRKNVYVGFQVQLDLTGIFMHGKIPTLKIS LIQIFRAHLWQKIHESIVMDLCQVFDQELDNLEIAVVNKEAIHPRKSYKM NSSCADILLRAAHKWQVSRPSILQDTRDTYDGSTTQYWLDVQLKWGDFDS HDIERYSRAKFLDYTMDQMSFYPSPTGCLIGIDLAYNIYSSFGNWFPGVK PLVQRAMDKIMKSNPALYVLRERIRKGLQLYSSEPTEPYLSSQNYGELFS NKIIWTFEGNLATKPINGAIFIFNPRTGQLFLKIIHTDVWLGQKRLGQLA KWKTAEEVAALIRSLPVEEQPKQIVVTRKGMLDPLEVHLLDFPNIVIQGS ELQLPFQSCLKIEKFGDLILKATEPKMVLFNIYDDWLNTIPSYTAFSRLI LILRALHVNNERTKIILKPDKNTITQPHHIWPTLTDQEWIKVEVALKDLI LADFGKKNNVNVASLTQSEIRDIILGMEISAPSQQREDQIAEIDKQKQEA SQLTAVTVRTTNIHGEEIISTATSPHEQKVFSSKTDWRVRAISATNLHMR TNQIYVNSDFVKETGYTYVVPKNILKKFITIADLRTQIAGYIYGISPPDN PQVKEIRCIVMVPQWGTPVFVNVPNQMPEHEYLKDLEPLGWIHTQPTELP QLSPQECITHSKIMSDNKSWDGEKAIIISVSVSWPCTLTAYKLTPAGYEW GKANKDSQAYQGFQPSHYEKVQMLLSDRFLGFYMIPDRGSWNYNFMGVKH SANMTYGLKLDYPKNFYDEAHRPSHFHNWTQVSSEINEDNEAADQENLFE >PPL_07062 g6321, MDYRDAKGPCLARYLCQTLYRGERYYLQIDSHMRFIKEWDEILINQLKQC PSSKPVLTAYPMGYTLPNNLPNYTYPTLLVARSFGSQDKMLRLGSRLAAI PKLKSPVESLFWIAGFSFSYGSMITEVHYDPHLCHLFFGEEMLMSARLWT SGYDFYSPTHAILFHLWKRSHRPTFTELSFEEDRQKSHHRLREIMGIPSD SNDIGRPDIEKYSLGTQRSIDQYQEYCGVNFSSQSISEKALKGGHNDSFF LNEIIEMAIKSQQDMTDIDNNSNTFESNDQHENQDQIAQLHDETAIEMND DPSAEHDAQQQSENDGDDENRDDNEDVGNGEEDEDDDDDDDDDEEDEDDD DVNLVLDTDIVESGRTARSAIKPGYIKGVTSITPGSGAKYDVTKQTNTNF QTNRPQKSIYDVSLDSFDDKPWNKPGADITDYFNYNFTEDTWKAYCERQN QIRAEQNNLGKIKSYESKNQDNKNDILPPEFLMMTDNNNGGTNSNNNNNN NNIASNSNRDMMAKRAPLKRENAPWQHDNMNHVPPHIMRQQGGYIPPYAG GVPPDYSRGPGGFRGAAYNDNPNQPDDRRRDRDRERDRDQRDTGTVGRDR GDRERDRGDRETRDRDRDPRDRDRDRDRDRETRDRERDRETRDRDQIRER DRDDYNRSDDRKKDDRRRGSTERRSSRDTYQSRDTDYKRKLEDQEDERSK RRR >PPL_07415 g6630, MSYKRINGVDESDNRKKRRNDDFDAFDSIIGDMHKSKEATTQGKTARSTF SSEAVIKKIIDLTKENDVMNLEVEARIGRLTSEFKSGVIQEDFNTLYSAM RSKFGEPTKVTETDHIFDDYRIVFCEDTQKVLRKESKVDKNTFNLPTNLI YDIRISVSIEQQLPVPIYLPEGNRPRRYKTRYTFTDKQWKIDLTQVINYT PGVETQVSSLEVEVELLPNAIEGCTNVDNLKSLLNRFLNEAKSLISMIQP KQTLSFPDVEMEKVSNFQEIMDLKKTLFQFMPGSNENKQDTFPGSMPINF GKKYFSHVQANDYYVSEKTDGVRYLLLIAKDNVYLVDRKFDFYSVKFDKL IEIYGNDTLMDGEMIRQLRTKKPIFLVFDLLSCRGVCVAGKDLSGRIEAI RNSITGPFMHKVENQHHQTPLPFLIWGKNFFNKTQIESVFKSIKQRGEDR QYVDHKREHNTDGIIFTPNTPYTPYTQNDLFKWKYLDKWTIDFKIMDKGQ KGWYLTCIGNGNSDVEIRSLNFSRDDIENLQRDFKRARDPNTVIVECSFQ PNTGKWKYHMVRADKFKANYISIVMDTMESIAEAISSEELQYRIPLKHEA DTWDYEIQKMRAAMLQNLKNKKKAGNSSSSSSSSSSSLSSNNTSHSRPPN SNNSSANHQHSGQRPNNNNSNSSSSNNNNNQTNSAYYQGSNFEGDPFGTD EDIEPFQHDGQPIFEDTTYDREDDLDGEGDDE >PPL_07750 g6931, MYTNPPPNVGYGMMQPAMMVPVFGHQYYNPQQALQQQQQQQQQQQPTGLS NSTGMPQPPPQQYVTTNQLSQSTPQFSTNVHAYPTPQYIYQPQSQPIYTA NVQQQQQQQQQQHSPNGTPITSQHPISQPIMMDMHHAYLPYNNFLVSQPP NMQQQQQSPMMSSPTAKSPSTSPISNSQYISNATPAPATQSISQPLTQPI SNIQQQAQPAPVAPLKPKFSLPLHNLNNQQSNTGPISPKSTISTTPKKPY SPRSSKSKSSTLISHQLPLSPSTLKNAGLSSASTPLFTNPNQQHVTPNTT PRSPLSGSLELGCSTADSSLGPYQSTSTFNSPRNLSSAANSPQQQQSPKS LIIPQAISMVNDLQQQQQQHQQHQDQQQQHLQYFNNVSKICPVIYQQPTQ QQQSHQPQPIDQSYMYQPNTSTSSLDPMDSESIPLFSQIAMPIPSGLNCV RCMGSVDPQMLLSCIHCHVHFHRYCVFGPEQQNHLQPQQWICMHCQNIQT DFGLIPTTTSNTATNWMNIPMPMVDQQQINNNTNLDLQQIKQEMNVSPNN NNSSSDSEVSDSENGDDSSDEEEEDDIQPLSRQMKDNNDDDDDDEDEDEE DDSQPTSTESSPRNQTTPREGKSKGHWTKEEDEMLRALVEKYGTKRWKYI ASLLGLRNGRQCRERWSNQLDPGIKRDAWTLNEDKIILEAHAKFGNKWAE ISKLLPGRTNCAIKNHWNSTMKRKISKKQYDISLLSLESPRPSHSSSPPS QFVQSNNNNNSTIINNNNNNNNIISNSSEQAVDQYATADQHRMNHSPRVD MLSTTSTTTTTTTSSSSSNINSNNNNNNVQLVKKVGLTLPCYICESITYL PPKGSDSTNKQHILSQEHCNYFDLPFPVGLGEKKQYCMCHAHYNSYRRRQ ASKCGAGINPVVPPLDFTRKEDETILAFRSRREWPDLNDILMMKNENTKS DTMKIDLTVLYDILLETSSLPIEECYQKIYEVLNIKAKSKKSSTSSAGSS SLLVNNNNNNNSSGNLDNDLVNKKLIKNNIKFKIKNLLVTFPHLKFYGSI KDFQLQRLQKVPEILLVKDNEHLRMKFLES >PPL_07881 g7065, MFGRRRIALVHQNGIKLLSGHSNITQEIKLKSVKMAYIVDPYVLILHKDG TISLYQGNTGITQLLEYELPQPKDGVMSCSMFHDVKSFFSINNNSHTEQS NNNSSSSTFNFDTDDEDDKDGDVKMNDKDQSSSSTSTSSSSTTLQQQNVY LIILTKKSTMELYRLDTKELIISAANVSKEYDILGVASHQFTMNQQQLLA QQTQHHNINNNTNGNGVNQQQETQPKIVEIVIHYLHNSPHSSPYLMILNE FGDILIYKAIKYKDSMDNTKELIRFIKHTDQNLHSKQREYSYGIDPSSES SFYIRKIVAFDNIGGHKGVFMCGKRSLWFFCEKNYLRAHPMNFKDPVTSF TCFHNINCSYGFIYFTEKGVLRINQLSNMMNFENEWAIRKIPLRMTCHKI SFHQEFKCYVLVISYPQAPQSDEEEEEKEKSKKPLILEEKFQVKLIDPSM NWSIVDSFSMSEKETVLCAKIVHLKYADVDGIKLKPYLCVGTAYTHGEDT VCKGRILVFEIISHREVQDDTGEEKKRLNLLYEKDQKGPVTALAGLNGLL LMSIGPKLIVNNFSSGSLVGIAFYDTQIFIVSLSTVKNYILVGDMYKSVS FFKLKDQKQLILLGKDYEEMNTFSSEFIIDQRVLSIIVSDREKNLRIFSF DPNDPESRGGQMLLSKTIYHIGTNTNKFLRTPLRLPDGTLRNDMHLLFFG SLDGAIQVLAPLDKKQFQFLQQLQSRLYLLPQTAGLNPREFRQKNDHQYF TQPGHYIIDGELLTLYQFLSKDDKTLISQSLGTNINEIDRQISILNNSYS IFS >PPL_13379 g7066, MESYLFNKQLFPPTGVEHCIRAKLIDDNAVNLVIAKTSLLQVYTIRYDRI EQQQQQQQQTNEQQSQQDTLKPWLELNLELQLFSIIESLNCVRLPGDDID SLILSFRDAKVSIVKYNKATEKLDIRSLHYFEGNSELKGGRKTFRTPPLI RVDYQQRCAVMLLYDRHLAVLPFPRSFSILDDEEEEEEEEAAVVADQQQQ HDENEQQQPQDDQQQQQTSEKNKKKKQSESYVISLNSLGIENVKDFCFLH TYYEPTLLFLHEPSQTWTSRISSKKFTNVLTAVSLNIAQRQQPVIWSIEH LPYNCERLVPVPDPLGGAMVLTPNILFYFNQSSRYGLECNEYAQIDTGDQ FQFPIDSSSTNLVFTLDCANFIFLGDRLLGSLKGGELLIFHLISDGRNVQ RISITKAGASVLSSTSCVLTDNLLFLGSRLGDSLLLQYTEKIIDVDSSDN VENLSNPYKKKKTSEVFDLFDDEERNSKTGASDADGNGQSLFDDEDDIFN DKKNQLKSYRLNICDHITNIGPVSDLITGVSYDHASVSNDESFEQRSLEL VACSGHGKNGALTILQYGVRPELNTSFELPGVRQSWTLYYDDPLAASQSG SSASNAAASAASKKRQHEEEYIRCQLASLFVLIDG >PPL_08452 g7562, MDDNDQDQQEQQLEQQRLEQQQLEQQQQLEQQQQLEQHIQNESNNNNNNN NNNTMQSDNSVIEQNVDVKMTNLPTDSSDSSTNIINTDQPPTTAAITTTT KQNDIYNPEQASLEGNNNNNNNNSSNNNSPQLNAINKSSPVSTTLSPSLN AVSSPTSTSHTQTTPTTSTGNASTSMLSPSTPTSTTTTTTTTISSVMPAI GKRLNVQIDNLEARITNDKYDTEAWTLLLNEVQSQPITIARDIYERFLAV FPTAGRYWKLYVEQEMSSKNYDMVEKIFLRSLRNVRNVELWKCYITYIRQ IKGDSNKEEVIKAFELAIEYIGLDISSTPIWLEYLAFLKEEKTATSAEEG SKKNAIRKLYQRAIENPMHDLDQLWKEYEQFEQASGNKNLAKNLLAEHSS RYQHAKTVYRERKALLEGILRNMLAKPPRATDKEEHQVRLWRKLIVYEKS NPQRFEQAQLRQRITATYNQCLLCLYHYPDIWFEAATYQADSGNHELATN FYERAIQAIPNNLFLHFSFADFLEINKKVAQAKEVYERLVTPSTLAELSH NPLVWIQYMRFARRTERIEGPRKIFKRAKSHPECTYHVYIALGFIEYYVN QDTKTAREIFELGLKKFSHEIPYVHFYVDFLTNLNEDNNTRVLFEKILSI IPSDKSEIFWRKYLDFEYRQNQDINTIVKLEKRFQQLSPSNEKMSIMQVL NRYKFLNLWPCHPNEIEIINKNLIEEDQDIAIEEQSAENQQQQHHHHHHG KKKDKHDRGKGGASGSGGGGDGGSNDSSTKDGKSYYNDKPSTSTKIPVST WKTTRPDTTMMITYRLNEMGKISTPSGGGLGNGTGNDSSNSAGGSGGVGG GNQPNMMRNDPRNNNNGANNNNQWSNNDNNLIPDFIKPLLRILPAPNSFR GPWIDVDQLMMLINDTPIPNNSPITMGLGGGIGGGVGGIGGMSGLDSPPN VMMKPSGGNTGGGNVINKNINKSMNQPHKRKMESNNNNNNNPDNNSNDND DSQHQPTTQTHQPHQPAIVNKPPEHDIYRKRQASKLSKRS >PPL_08524 g7627, MTKDDNGALVETGVNKADERKKAKHQKKKEQKKRQKQKKISQLQDQSNNN TSNSSSNNNNNNNNRKKQENGNGIHKDNIENNANGDDHVDEDMGEQEEEF LIDESDPTFELYNKLLKHFDNPTGYSEEDEQQKEQDKAEQEEEQQKEIAI KEEPKDIDDNGESDEEENDNDKPSKMSNRERKRQQKLNLPILKQLVDRPD IVELHDTNSPNPSFLISMKSTRNSVSVPTHWCQKRKYLQGKRGYVKQPFE LPEFIAATGITKIREALLEKSAQQKTKTKQRERLQPKMRTMNIDYHVLRD AFFIHQTKPKLCIQGELYYEGKEFEVSIKKTKPGVLSEDLRRALGMADNY PPPWLIHMQTHGPPPSYPNLKVQGVNAPIPEGAQYGFHAGGWGKPPADLQ QQYANANSHTNAIIDSLTAPVEKEHWGELLAEEEYEEEQQEDEEDVDQQE DEEPEESDISEGISSVPSGLETPDTIDIKKGRQQQQDAGQPRQLYQVLDQ TSRTIGSGIMESNYKYNVPSTIKTSTTTTTPGRGSNKVDIIKSQRSAPVD ITFNPSELEDMNELDEDLLKKKYEQAVAAEKGPQKPKEDLSNVADDHKKR KMQSSKDDKQKKFKF >PPL_09354 g8389, subunit of the splicing factor SF3A required for spliceosome assembly contains PRP9 domain characteristic of splicing factor 3A subunit 3 expressed in pstO cells MSSTLLERTRELHESIERYELMIVAEQSEEPKTQKDSVIQSHCVNHYLEQ SIKCANDLKKIYQDEDGQRKADLSAISGQGPAIFSNFYDKLRELKDYHRK YPTLEIERIGSVLNYTPTLSFSGNEAYGRFLDLNEMFELYLNLPFVQKNI DYITYLSLFSKFNYNDISRFKNAKYKQYLDKLYQYLASFMERSQPMFDMK SMNESNEKEFEDKWNNKEFDPSADNNNNNKSDSNGHNNNNNNNSNNNDNK NEETTADESMDTKETTTAATATATATTDDTSSPLYCKACKKLFASENVYN GHLKGKKHIKLEELLQKSQSENGGLVIDMVAFNHKSRKPTSLLEYQISKL GELLDDQVQETKESVIKKQSRSIKEIEDDMNTIENEIDDIEIDDEPIKLR IANYPVDWSGKPIPYWVYKFHELGVEYKCEICGNQSYWGRKAYEKHFQEP RHSYGMSCIGIPNTLHFHHITKIKDAMELNKKIKEINASVSFKSDKDEEY EDENGEVMNKKTYEMLARQGLIKKRKAN >PPL_12522 g8876, MLTDHFNILVFNSTRIYGILVTQNDSFMLSSGSNNSSGNGVAPTSSSSAA VANSTPTTSSNNTYQISQHNTPAKTMIFHNNNNNQNQNNNNNNNHSGSLN GSGSVGNGGYQMIFSPALSSSSSSGSLIMSDTASTINENYTQDEDTEDDY DYDEYYDDDDLSSNLSSSTTITNNNGGGSSGKGRLNSSSPQPKEKNHSRG KWTPEEDEILRKAVSDNNHKNWKKIAEQLPGRTDVQCHHRYQKVLHPSLI KGAWTKEEDDKVRELVAKFGAKKWSEIALHLKGRMGKQCRERWHNHLNPN IKRDAWTTEEDKIIKEMHDRYGNKWAEIAKHLPGRTDNAIKNHWNSSMKR VTTKKETTTQKKSTGTNGSSRKRKTDSHDNNNNSNNNNNNNNNSNNNNNN NDSNNNNNHNNNNNFEMSTHLVSPFKEANLNLSLDVPSLQQYITGINSSP KQKESPLRINNLTQDHHFQNIITPIKPFSNPGNSKKKARIDFSPSKQTDI FNSDLPLPDLLLFSPIQKVHDRSILETSELSPLKSPFHNTFFDTPYKNAS LYDHYFDSPSKFQPFSPFKSNPSHSLPSFNVLSPSQPNNNYNNSFYHKNP STPNAILPSSPYHSLSLMSPSKQYQALPQPTTTTTTATSSIISAPTSQYS SLSGKFDSSHRKIIGIQLSDKGIDKNSLNTINSKLKGIDTSATSTPSSIS SSDQHNTSGNNLFVPTTPFKDPLPLYHNSPSAQFLSNNSSAFSTPGSSRD RSRMSQKYQQPLFFDQDTNPSPYKKSQQQQQQSYNSDTSSNHDNINNSVQ NNNHNNNNINESVNKNMNIMYNEQSDCSSANTTPSDCSFEALKLLKDNSK HSIFTRAKQILERSNDQTLKISNISISDIQVPSSPSTSVSNFIYILTPSL MRKSSMTENNNNNNNKPHQQHSTTTTTTNTNNNNNHNNSNNNNNHNNNNG NAQPNSLATAPQQQQPNTNNFNYNQSLHTSIIST >PPL_12567 g8917, MCDWAEAVAADGKKFYYHKITRVSVWEKPEELKNYEANFQQYTAGGGAGA SSTSASSNQHHRHQHQYHHPSSASQQLPPNWKEYTTPEGKKYYHNELTKE TKWELPTAITNVIPSSSTSSSSFPPISTSQPENNNNNNNNSNSSSTSNLN SSSNNNNLKESGDGNNNNSSSSSSSISSNIGNKEMDKDSANKIFKELLND NDVGSTWSFERAQKIIINDDRYQVLKTMSERKMVFQEYLVDRKKFELEEK RKREKRNREEFVKLLKESPEVTLTMSWRRAQLYFDGDPKWDAVESEKERE DLFRSYMVDLEHTEKDEREQAKRDQIRQLRHKFESDPTINLKSQWRKVKD EYEADPLVVAMDRFDVLTTYENYIKDLEKKEEEIQRKDRERLKRDARKYR LLFREFLNEKYQNGELHAATKWKSFYKKYNGLSVFENLSTQTTGSTPLEL FTDFQEEMEDNYDKDFKKIKDIIKDLNYQYKPKTTLESLKEDLSKHEKYN SILPANLPPFLLYLEEREEKKLREIEKRRREAISNFKVLLEETSSISKHS TWSEVRPLISGASDFDRLEDEQEREKIFNQYLEYLSNEESDEEGIIKSDG DDNGSRRESFSTKKRLSSAIDDSDRKKKKERSSSHY >PPL_09416 g8993, MSHQQKDNKNKKFGGGGGGGSGGGGGYNNSPQQKHQSPKQHHHHGGGGGG HHHNNSPQQQSHSNNNNNSSSGQLDDSTKRMKERTVFMSMNLIGHHVAVQ LKNGDCYEGILTSTNTSQVGWGCALKFARKREVSPPSIITTAPIPQLVID AKDFLGLTATGIVFDNISQSAFGKEAQFGFHTDTDISGHDGVIRERELTP WVSEDGGENLESVKINPANANWDQFATNEKLFGVKTSYNEDLYTTTLNRD SDHYRTRIKDAERLAAEIESKQSNNIHLMEERGLIRAADYDEEERYSSVI RNGTSSSSTTSPPNNNDKSKNMMPTSNSNVYIPPSKRGSVTGAAGTSSPS IPPLSTQKVSPTTTTAAASTTSPTTKQSTTTAATTPVAAAAASAKPTASV EEKSTTTTTTSTTTSTAAASTTSPSNNSPSHQQQQQHFKESTSGSTGSLL KDSPVTKLRLRDRAGSIDHNDSLLNSPRDGQSPRTLQSYNKIRAAIVAEK LRNSAEPRSPLFSPLVSDPVGLSALSLDVSKPNISEDTIKEFNEFLLTKS TTEQPPSADRKSQIENLKNFSRDLSRSRPGSPLIGPNSPRPMSNLSSISL SGALSPRTSASDIPVIRPIATAPGAATTTTTTTTTSTTTDDKEKEKETTT DATAVSSSADTKKADPATKSETTTDKPADTVAKPISKLKLNPNAKEFTPV SLNSNAPVFTPKNFIPATVAKQGLLGSGNIEFYEANSRANTYPNISINDL YYESMKRRQQNPEQSNAPSTYWNESYGVRGSSQYGADDDYMPPAQYPPAM RPPFIQMGVPAIIPTYYPPPVANVAPGAPGVPVPVKSMKPIYNPQPRGQP YAPPPPLLQAPGAMGQPPPQYVFPPQFQYVPQVYPPPGGPHTMPPKRSYY PGQNPSNGYQPIQPHGIMLPQNTSQPPSPQHQSPQIPSPTSPPHHSRMVP SSPQMINPMYPYPMIQRYPPHGNDPNATYPPPYN >PPL_09824 g9376, MGIPAFFRWMVDKYNGIIVPTKEPRHQDGSRVVCDNSEPNLNGEFDNLYL DMNGIIHPCAHPEQGPKPRNTQDMMDSIMEYLDLVFAIVRPRNLLYMAID GVAPRAKMNQQRSRRFRSALDARLTREKEAKELMENIANGKMSESEVEAL QKAKDEKFHFDSNCITPGTMFMDLVALTLRSYVAEKVSTDPAWKNLKVII SDASIPGEGEHKIMDYIRKQRAQKDYNPNQKNVIYGLDADLIMLALATHE PNFEILREFVQTKGRKQQSTPKEMQGGDDGEEEKKDFLTKDYQLLSLNLF RDYLDSELKCSPAFGFDIERVIDDFILICFFVGNDFLPHLPSLQINEGAI DRIMKIYKDLLPTFDDYLTEDGEFSIDRVGKIFAKLAMVEEDIMMRRKSK EEAMIRRKARMDGNFQQLQEDPQSLNLNSGTTKQHQEAAKNLLNEIFTET DDTQRPNKKLRVEDGAANKSAAAKIREELLAQKKGTTSNKDAAKQLQQSI VEQSTTAPADGKKKGEKRGAKVIEIMEVPVEEKKLPSGNKKRKHGEEEEE EQSNSNGGEGQSLESSTFIDRMKDVKIGTKGWRERYYNHHFESEAKEDDN VVLHVCQSYIDGLAWVLKYYYKGCASWGWYYPYHYAPFIIDISENIELIK PATFDLGSPFRPFEQLMSVLPKESGQFVPKPYQKMMGIDDGSGDVSPIIH FYPSNFMIDVQPSQPIWKGVCLLPFIDEKELLSSLKSLENKLTDEEKFRN SQGAELLIVNKDLKLTNDENAADKDHSSIDSKVSPNFLGDISKLPERITK KLPPMQSAIAYSYENPAYPEGYVFKSEMLPNAVKAPRTDITSLRFSIQNS AANRMINHAVDSNQYKNKHFNRNQGYNNQNNNYNKNYNNQNNNYNNNNYN NNNNNYNNQYQNNNNNYNNNNNNYSNQNYNNNYIQNNNPFNNNNSNYNQN NNYNNNNNYNQNYNNNSNMNYNNQNMMNYNNNQNNFNSNYNQNNNSYNQN NNYNQNNYSNQSVMSYNQQQQQQQWNQQQQQPQQNIMNNSNQQQQQFRQN QNYNNNGQHQRRNNNNNNNNNGNNNQQPQRSKYNPFAKLKK >PPL_01155 g991, MGITGLSAYLSEELGFQSSTISSNSNNNNNSNIVDENSNNNNINNGGKKF NTNRANEPRKADHVFIDMNGIIHKQVRRNSNSELTVDRVKRDLIDTLKNI MKGSGVFYHTKSIQFIFDGPGSRSKILLQRKRRSKKIEDLVDTKVNASLI TPGTSFMGEMKQLLLDYSKKLLRESQNLNLKDIHVSGSDRWGEGEFKIFE HINSMNWKENTNVSIFTCDSDTILYALLSDANIRIHDLYDPKSYKDISKL KQELSLLVPQRDKKQVFVDFVLINLFRGNDLLPALQSFNFDSVWQAYVSS PDKLGVYNLETSEINWELLLDLLSKNRIINSPQTKISIVGEFRTLLAVLA RDLKREEKLDFQLSMLEVGEKNVVDIVGVFDGQTFKEERVKSDTQTKTRF LKKFFDIDHPFWTKYKPLLTREQIDGLVRRMATSMANYVTANAEEPSIEQ YVEGIVWQMKLMRGQCTNFQFHYPYFFGPRVDDIKPVAIPKDTGVSVPPL LPLQFCLALTHAQAKNNVNQIFHPMFEELPHYSLLQHVSTDAWKHPEALD QLNQSFKKYVDTSKLTEFENSQLSFHPTLHLSKHNNSIFLREEILNSKQY TSPKHLETIPYKPKALNQNINNNNNFNSNNSYSSNYKSNYNINRVSKGAM QYQSRFGIKDSTTTTTTNTTTTDSQVNQGVINAVRNYFTTNRTPTIQTTT SLIPTIKYSSSIRSLLFKLIK >PPL_10444 g9918, MAKNEDVNKEVEHINSNNSIINNNNNNSDNNNQQNNQNTLKISRLPPPLP PTSNNNNSNSNSNNNNISSLNNVTITVQPPPQQTTTQQQQQTTTPLPPMS QHQQFQHLDFYSDGESDRMTRDNNNSNASEHTKTRPRRPSSPLAFLRSLS PKHKEKEKDKKKEKHLKSQMEQLSVSPPRGMGVHNYYNHESTPSTSPNIS LSTTINNNNNNNNNSISLVTSFNNNNNLVTSTGNNNNNNLITSNRKFSGA SLVDSGMSSGGSSLMSSNGDSRFNSLPHTVILRILNFLVQSRIDGEKSEI KLEYRSPPMPHQKSAAGGQESSLSSSGQPLSNSLQSSINNNSGVKVNPNQ DLQTMSLVCKYWAKEITPQVFHHFVVKSPKHLKSLIRLVSKGIIDGGRRF NFYYVAMILDKSSSFQKFINILKHTMPDRLPDKFYKATSKPIMTNVFSKS LFTSFFDNCSTMEYFRFYQRWVSPDNFEAIGHALRTNHSITHISFRNNNL DDEFVVDIIQALHENNTIQILDFRLNKLGNQTAISLAGALLKNRSLTCVD LFYNAIGPEGGVAIANSLRTNRTLRKLYLGWNHINGQTASILSESLKVNN VIESIYLDRIDDHSGSLLAESLAVNTSVTELNLADCQLKQFTAKALGSAF KVNKSLADLNFRCNQLGADLKDISQSLSVNHTLTRINLSDNRINDESGRL LAESLKTNHSITSLSLSLNQLGNKFADEMGVALLENTTLKLLDLSNNQIE FTGAQHIANALASNSTLKLLNLCQNSLSSKFGPLIAYSLTQNKSLTHLEL AYVGIGSAGAVSLAKAVKDNIHLRKLNLSENQIGDDGALAFADAIKSNQF LYVLDLSYNNFTYRVKEVFEKIQEQNNTLQFSISSVPLHWKFQL >PPL_05499 rtc1, ortholog of RTC1 which catalyzes the conversion of 3'-phosphate to a 2'3'-cyclic phosphodiester at the end of RNA MTKTKRNFKPHNKKTKVPQAPATSSTNSKDDSSLPEVEVNPSFVLDGSIL EGGGQILRNSIALSSLLSKPVRIEKIRYNRDQPGLKAQHKAGVDLVSRMF KAHTDGVKQGSTVLYYHPRISTTNIKDQSIEADTGTAGSITLLIQIALPC LLFTPKSTKLDLGGGTNVDFSPAADYLMNVFFPIAKQFGINSNMEVLKRG YYPRGGGKVSLITQPIKGTLNPISILKKGNLVKFTIRVFFTSTRISAEVG DRMLNAARKMIKKDYKKVEIVEELVDTAKYTFGDGCCIFITAETDTGCLY GGSANGAIGVPAEKVGEDAATSILNDLLHGGCMDEYLQDQLIIFMALAKG TSQIKTGPISLHTETSIHFTSLLTGAKFQVKPAEDKQRGEDTFIITCEGV GFENKSSETKSDEEITEQNGNDDNTSTTTTTTTSSSS