TitleGenColors Logo

Gene list

Applied filters:

COG category: RNA processing and modification
Gene type: CDS

Number of genes found: 149

Free access
Sort by:

 



# Dictyostelium discoideum AX4, AX4

>DDB_G0272997 DDB_G0272997, 
MGIEGLNLFMGKQFGCVEKVKEITTDHVYIDLNNYLHKSVKRGDNSKLNT
EVNVFRILKSFIDGILRKVRVKHSVFFGIDGPGPRSKMILQRERRLKNGN
IDKLKYFINRNKEQQQQQQQQQQQSPQLHDYDYLNENDLKSFDSSFSTLE
FTPGTTFMGKLKDFLIFYTKNKLYFAKKIFISASDRIGEGEFKIFEQILN
SNYPINDSFTIVSSDSDILLFSLLSKYKNVYIYNKDSEEIIKIDKIRDKI
YQQCYKKKQKKQKQQKKSGGEVEGEGEGIATEEKEEGVQIEEEEEEEEEE
EEDISKRRQAIVDFVMLTFLMGTDHLPKVSSYNIGSAWSEYCKIKKPLYN
EETGLINLDLLFRLIGKSSQPRLFYRNRFVNNNNNNNNNNNNNNNNNNNN
NNNNNNNSDVNNNQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQ
SPQILKSLFNNAGNYYCLSKGISQPTQNFKCEKDKGNGSNSKIKTRYNII
IENKIVDSSNDIDYLIRSFLSINHPFWESNINIIKEKYIYKLYNLFTSVK
NGCNDNDADDNSEEEDDGCNENDEDEDDNQDEDEDFEENEIENENEIENE
NEIENEIENEDGDVKMNEKETITTTTTTAAAYSKKDILLNHYLYGLIWHI
NYFGGKCNDFNFSFPIKSVISSEVFKNFPIRFLNNNNNNNNSNNDNNFDI
CEIERELLKIQNYSNILKNPLPPVPLLFGILLIDYSNKHLFPTIYHPIYK
ETPHFNIMDRKRMEKEFRKSDAIDTLTTLFNTKTDKSKLTPYQQSQLQFS
PTLFFNILPNRSIEFYEELFNNNNSNNNNNNNSGNISPLKYKLINTFRNN
HFFNSELKLQSSDDIKDCLNNLYNNNLYNNSSYNNKNSIFNYNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNKHYQNKSNRFNNFKPELEMKAQQVQYQTT
QELEDQDQALIQQHLKQYQQQQQQQKSTQTEQPREQPQPKLPKQPKQPKQ
QQQPKQQQQQQQQQQQQQQQQQQQQQQQQQQKQPKLPREQRAKQAKPPQQ
PQHPKQPKQPNEQQQQPKQQQQQQQQQQQQQQQQQQEIENNFQNIKPIYQ
KSQHPQLKKQKQTH
>DDB_G0279513 DDB_G0279513, 
MMTNINELPIPVLIKLFGFVNKINSSSYWFVNYTLVCKLWTTQILPTAWN
EMTIATSQPAEPLIEYSEDGILSQFPSTTSEPYKFNKLVMRVGKVSEYID
KIKSLTTIDLRNNSSTNYVIDRLAEALKSNKTLTYLNLYNNRLMQKGGTS
IANAMKKNQSITHLDLGLNLLGANGGNAIADALKVNNTLVHLDLSSNQLG
LRGAGPVVEALKINKSIKYLILNSNQLRDECSLPLADILRSNIGFIELAL
NDNEIGSKGGIALAKMLKSSKVLTKLEFGKNELGDDGGLAMADVLKNNKN
IKVVRLNWNKLGVKAIKALSESFKTNSTIIQLDLSFNNFGDEGLVCLSES
FKQNKSILSLDLSRVASGLVGHKALADSLRVNNTIQTLDLTNCKITNEGG
VELAKSLVDNKSISTLILNNNTFSKDTVSELAKTLESNSTITSLSLVHNQ
LTIDGVEDLFKSLSTSTNKSLQTLDLTNNLLGSDGGNIIAQHLTKSNLSE
LILTNNQLSSQGASSILNVLPQSNLQTLDISNNSIEPDVATSLCSAISNS
QILKLNISTNKLDDTVIPPLIQAIQTNQSLISIQISANQFSKESNNKLLY
SIQQNKSIYYYDLVEEI
>DDB_G0284217 DDB_G0284217, similar to H. sapiens CNOT7 and CNOT8 and S. cerevisiae POP2 which are components of the CCR4-NOT transcription complex
MVTLHTDEIKDVWGYNLDEEMEKIRNLVDDYNYIAMDTEFPGIVTRPVGN
FRSTSDYHYQTLRLNVDQLKIIQLGLTFSDSEGNLAKPTCTWQFNFKFSL
SEDMYAQDSIDLLSRSGIEFKKNEANGIDILDFGEQLMSSGIVLNDNIKW
ISFHSGYDFGYLLKSLTCTVLPLDEADFFGSARTYFPCIYDIKYIMKSCK
NLKGGLSELADDLDIKRIGPQHQAGSDSLLTSTTFFKMRKMFFENQLDDS
KYLNILYGLSSFGPDGTPTNIHTGAGNNPPPQLSNSGSTNYGYSPLSQQN
NPNNITNYNSTNPASQNQPTYYNNYPTTPTRYHNSSGPYSPTGNNLSMSQ
PNTPSKNNSNDYYHKKS
>DDB_G0285269 DDB_G0285269, 
MYHNFNYIDDENGHIKMLPSPVKPNGGKLRTPTSKATILTADIGHEIREV
WAHNLEYEMSLIRELVDIYPCVAIDTEFPGFVNKPIESMRMYPDYNYQTL
RSNVDLLKIIQFGITFSDSTGCLPVPTCTWQFNFKFSLKDDMYSPYAIEL
LKSCGIDFQRIEDYGIDVNDFSELFISSGIVLNDKIQWICFHGGYDFGYL
LKVLSCSELPKSESDFFDLLRIYFPCIYDVKYLMKSCKNLKGGLSGLAED
LNVVRVGPQHQAGSDSLLTNSTFFKLREEFFENEIDDHKYKGILYGYNVS
QNFHHNGHL
>DDB_G0285705 DDB_G0285705, contains a RING-finger that is a specialised type of Zn-finger domain which in some cases has been shown to be involved in ubiquitin E3 ligase activity contains 14 putative transmembrane domains
MQQQQNQEEEDFCRVCRNGSTPDNPLSYPCKCSGSIKYIHQNCLLEWIQH
SKSSSCELCGHPFRFTPIYSPNAPEFIPSHELFYEALIRFKWYIKKISRI
LYIVFCWLFIVPTVTCWIFNFFFGQKWLVPLGRVVMENSGFGSSHHTAVT
LFYDFFIGTTLFFWIILASIASYMIIDFIHHKHAEIEIQDEFEFDSDDTY
LQNLQQPQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQL
QQLQQQPPLQQQNIIQQADNNNNTTNEENVYNNQDLGFQITDDMPIRLRQ
RLRQHQQQQQKHLDQIRLQELQEQLAETISRDDQVGQAMIQDQITTLQLQ
IQRNQQPQQQPQQPQQQQQQQQQQQPVAAGGEPIQLQPQQNQRVFFRIPG
VLEMLRLQDVPEEHENDIENDPNQIDDLEHFIGLSGPLFNILTHCFILIV
YNAIFLSTFLYFPYLIGHTFIEKLPFHVKDILKLVDGSISQCVAGVAVGY
TILSFTALIILSILIKEKIAFKISTLTHSFIKIGVITIFELGILPIVVGC
FIDFCSLRIFGGSIEARLSFALSQKMTFLFSRWIFGIFFMVNFTNLCSIF
HQIFRKGVIWFLKDPSDPDFDPFKDMIKLSFKRHLFKVFVSLCAYAIIGL
LFVFLPALFLSTIPGFLPINLQVNDPITKGSADILFIVAASFFPKFDTKI
TIKNIVKFWITKSSKILSLESYLLPKEQQQNVQTANQQQQQQQQQQQQQE
EEVYNEEEIEEDDDEKEPNQQQQQQQQKQGQVQKQQDVIFKPNNFKKRIS
LFIFLGWLTLFLAICAYISIPVLVGRLILGPFSNDNDIYCILVGLFCGWV
LSKIAFLLFSPSSSINIIQWVSIGLKVLLIGFIMVIVMPLLTGFLFDFIF
MVPIMAPYDESFFIHFGDIFQNWCLGALLLKFWYRWINATNQNPDNNRNN
VIEDLDQPRDRWIDRFKQFKRNGISNIDLKWTFSKIIFPICHYLFTLLTV
PIFFSKFLVPLFGGSLILESISFRYGFAVYCFILLFEKILHKIKQWSSRF
PNMIRDDKYLVGKQLHNIDQQQPLKLDDSGNGSNQTIF
>DDB_G0289199 DDB_G0289199, 
MKRSNNDSNGPNKKSKQDDDPLSSILGDIDQSQHGKLQSTSIKSTFSLDS
VMNRIIELSKQHGGIDSIEVEGRIGLFSNTSNGNTFKPGMVRDDWNDLYH
HLKLKSEPVATTETDYIYSDSIRVAYDEHSKKCLRKDKKTDKTSFDQSTN
LIYDFRISTSIEEKFPPPLSLPPGYIIRREKQRYTFTEDQWKIDLTKVIV
RPDFNATVEQELYEVEIELFPEAIQACQEKTSLTELLNDFLNAIKGLTNI
VKNGGETSFPEISLDKVGNVSEFYRLRDLVFKYIPSAPQRKNDTFPGSMP
VNFGKKYFIHVQNNEYFVSDKTDGIRYMLLIDHTGCYLVDRKFDFYQIQG
FDILVTLFGEGTLLDGEMVRNLQTKRANFLIFDVLSVKNELHHQKLLKDR
LTEIGNVVSTLRSNLKVDTPFDILGKSFQLKSKIVNLFKNIKEYPNGERV
YSDGKRCHNTDGIIFTPNIAYSNYTVHTLFKWKYCDKWTIDFKVRDRGQK
GWYLSCVANDNIEVDCREVNFSNDDLQKLRREFQRARDTSTVVAECSFQP
KWGTWKFHQVRHDKKKGNYISIVMDTMESIAENLSSDELKYRIPLLPHDD
NWEEEMSRIRSQLINNIKPSKSSTSTYQPFPQ
>DDB_G0289461 DDB_G0289461, putative ortholog of H. sapiens CNOT6 and S. cerevisiae CCR4 component of the CCR4-NOT transcription complex
MEETDKNNITTTENSIKNEELPSSSTPPPPPPLPPQSTIVTKSKVKEEPL
KIILPENFKFGEAEKSFSIIPGKPITYVPFIFTFKNQNSKLKYSKAIISS
KWMIDGENIEKLLHYSNNSSAPTISFTPKKEHSGKELIFEIKISPLIFEQ
HENKSNNIFSKLFNKSSSSSSTSSNNSNNNDNEIPIIIEYKHKILFEKSR
ELLKINEPLNNNNNNINNQYRIIQYNILADCYVSDSWYTHSASYSLRWNS
YRSYLLIEQILQYKADIVGTQEVDRLYWQLFKEMNVRGGYDYYPSYANDS
NESPQTTMGGFNNSYREGCFIFFKKDRFNLLQGLEIDYTKLNRPDQKLLK
KELVEILIQDPIYKSCITHFLEHSSHHVHHALVLLQDKQTKQKMIVVSKH
MYWGSQGYNYHIQCVQIHLFTMILSNFIQVNKLENNIPIVVCGDFNSSPD
DSCYNFMTKGLMMNDDHHLTLAGKYPPAFNSSQFDNHPEIKSIKHDFNFL
SSYSLRPDGEPKFTIVSRAFTGNIDQIFVSKDRFKVNNVLEIGEKQDYKM
LPSLTLASDHILLMTDLELLPSS
>DDB_G0289921 DDB_G0289921, 
MGIPSFYRWLIENFPKVLENNLNGEIKFNNLYIDMNGVVHNAIKLDHSPT
SSSSSSSTTTTPPTTPTTTYSKDKEVVLMLKSELSDEKLKERIFYRLDQM
VNNVNPSSLLYIGVDGVPPRAKAIEQRKRRFKSSKETVDVIIKQLKSKSK
PITRDSIIEQFSLIFDSNSISPATEFIEKVDDWIKDYCKQLSLKRQNLSI
ILSDSTVPGEGEHKIMDYIRQNHPILKKDGMSHCFYGMDADLIFLGLESH
LSNFYILRDPISLISCSTCKANDDHSNYECRSAIALKKQFQKKSTSVIVR
GIPNQCSENDIKQLFSYYGNIKIEKIEKSNTKNKTLNAYIEFENEDIVNE
VSTRGSTFTINNERVSIHVEYLDDIFKPKNEKGEIIKDEDEELAEQENPS
TEKANQQLLTAEEIAKKQKEEPIPENAVFIVGLDSAVSKFDIITFFEKFG
KISSFQLSPSPRFNKQQFVMIKYETQESARLACKSTDIEFFGTIITIKRA
QLPKTESNTTINSNGSSKTPPLTEEEKQAKEKVKEKKRIIKDSKIQQVLA
KFDPNTPKDVAIFYLGLSSWNVNRAFENYLLFNKAGLEHTILNNTTDNDN
NNSNGSGSGKKKKILFDYVNIDSFRTYLEYYFFEKIEKSKREKINFNRVI
NDFTVLCFYLGNDFLPHLPSVGIQSGSIELIMCWYREWIQNCLNGDDDIK
YIVNEQSTHLNFENCLPLLESLADWESSLYPENLEKQVKKDFIKVNKNST
SNESSSIVMVPPEFKYDDLLYYRVKFDLIGKPDDQVKKLVDDMCYQYTLG
LHWVLRYYVSGCQAWDWYYPYHYAPLAKDLLNYQKRLSKIQDREMVNNQF
NFKLSSPLPPLIHLASVLPRNSVAFLPDSMKHIVNENSPFTASYRDDYKY
DFNGENVAWKAIVLLDFMDIEKFKEYLLPIVNNELTDNEKKRNLIGNDIK
FKNGEIIQLSPLCNNTTTNNNNNNYEEKEIPCINITKNLSIKDLSYPTRN
YFTRSSHPVIDSNYISIPTQQASTVDHNQLVPLTKEQADFLQWRKSKTGF
NQSLEILQSIDFSKCGCVNIDKKQISQFSGSGAATATITCWLGRVLNNEI
SSDSTIFIQSIDDSQLIINIGFKQAVKLTSIKFVSSSNRVPDRDSVPKVI
KIYTNNDQPNIDFSVIESLTPKCTIEFSSPSELESYSSSTPFSFASGTTT
TNTNFKSVNNLTIFIESNFSKNQDKVSIIEKIILS
>DDB_G0291836 DDB_G0291836, 
MSDINEDEYNDEEMKAVLKEDSSDSSDDYENNNEDLSNSSDDDDDDDDDG
DDSSDDDDDDNMESKTDYENSSEGLEVGLLEIKLREDPYSFEKNLNYINA
LSKFTKQSNYQTLREAREKFQSIHPLSQDIWLAWFSDEQKYMKTDNDKQY
ILSLYEKALNDFISVKINVSYCKFIIKINTNSGGLINNVKEIRKQFERSL
EQCGDDIIESPLLWSEYRMFEQMLLSQIKDDKEKQTQIKIIRDLYHRQLS
NPMIGLHSIYNDYQQWEHSQSIDNNNNNNNNQEKEKEEKEKEEKEIKLKF
EKSLKQFKEREPFEIALKEKKYLDQRKWKEYIEFEKQQQHNDKPMRVATL
FERQLKSFSNHFSIWSFYLTYLEKFTNFKDLHLKVFSRSLRSIYYSGEHW
SKYLLLLEERVHNDNDNDKRVKIEQEFQRSLVSGLKSEYDYQLVYNTYID
YNWRSIIKKLNTDSNANSNGNGNNNNNNNNNNNNNNNNNNNNNNNNNNNN
SISENDKQLMKSLFETMNNQMSTIDVNNYTTVDRYMYIAQFEWRQFNDLS
RYREIVDYVLSIDPSQYWIWCQYISFEMEQKQFQSVRELFKKASSHIRFD
DPSSRIWQDWFTFERGYGDINQYRAVSDRYSIIQNKYNKEQERYLQQQQQ
QQQKQQQQNKRKEKDDGKNKDEKRISKKQKNENHKEKGDQDEFKKPLPPT
SKKEKEKEKDLPKKLIILNLSFDTAEPDLHKIFDKYGQIKSLKLVLDKNG
KSKGICFILYKSHESANKALEMDQQIIKNRTICVQYSKDQQINDHVESNQ
TLTTTATTTTTTNEIDFENHIGLTVFINNLSPSVNKEKLEQFLRHNGVTG
IKDIRVVLKARPFAYIDLIDKENLKKALSLDKKYFLSKLINVNLSKPPSS
ISPANNNNDNNNNNNNNTITSNGNDTEFIKEIPSRKPTLLVPRGIKNKK
>DDB_G0269682 atxn2, similar to the human ataxin-2 (ATXN2) defects in the gene cause spinocerebellar ataxia 2 (SCA2)
MSQSKDKKKFVGGGGGGGGNNSGGGGYGSPKHNNNNNNRNSSNNKSPHQS
HHNQQHHQQQQQQQQQQQQQQQQPFDSLTAMKERTVFMSMSLVGQNVSVT
LKNGDVYEGILHTTSTSTGSSGGGWGVALKMARKKDTNNRVITTLPLPLV
IIEAKDFLQITATGVVLDHYRDSFMNRDQQSFITDTELSGFDGNLKEREL
TPWTPDPSVGESLDDFAANSEAKKPANWDQFETNEKLFGVRTTYEEEIYT
TRLDRDSEFYKINQSVAEKKAQEIENEKSGNIHLLEERGFVEGADYDEEE
RYSSVVRKGLLPTSTTSTTTSPPTQNPTPSSSVYIPPSKRNNNNNTPSTP
SVTSPPIVDKKHQQTHQDKKQTQQQQQQQQQQQQQQQQQQQQQQQQQQQQ
QTQPTTTATATASTSTSTTTTTNESPSSSSTSSTPSTPSTPKNITTTTAT
TSTNNTPTATNTNVNSPLGDRESPTISKLRLHQSTIDQDVMGSPRENLSP
RSVAYTRYRQILSEPTNKSMNKSGSNISTTPVNGSGNVGPNGTPLLSSVH
SDQAPKSPVPTIVSNHGLVKALSLELATPTVPEKFVNDFNNFKLKINNVD
RGSETQGLKSFSSNLVIKSKSRPGSPLIGSGSPRPTPTQLSLSGSSTSTN
TSTTSPPTTNTTTTTTTATNSTTPSTTEDDKSTTTPITTTILTENKSDDK
EKEKEKEKEKVDEKEKEKEKEKSDEKDKDQSSTLVEKKDESSSSSNTTTT
TTNTTNNNNNNTTTVTKLSKLKLNPNAKEFVPVVVNKPQPSFKSTTESNT
DSVTPINEIYYDSMRKRQLQPESPDQVSLYWVDPFYPRYEEDPYAAAYQM
RAHHHMVGHQPPPQLQFNPQFYSQQGHPQLQPHHHMVPPQLQQVPPGVNV
HTMKPPGSLQPGGGGVVQPQGIVQPQGIVQPQGGVVQPSAGGAPKTMYQQ
QQQQQQQTGQPGGPMGVQRGGHLPPQQQPQQQQQQPPPQFIQGIPPGANL
VISNGPPNQPFVFQGAHPPYAVPHPQYPMPPQGIQGGNKRFYQPPPQGYP
QVQPMIIPQQGQVVSQNSPQQDSPSNRLNQQVPPYSYMTHPPRGYHPNEN
QYH
>DDB_G0285829 bxdc5, ortholog of S. cerevisiae RPF1 and H. sapiens BXDC5 a nucleolar protein involved in the assembly of the large ribosomal subunit
MVKPKKEVDKDDLTKEELKLRRSPTDIKCKAKRVLLVQKLQAAKKTAREK
ARKQRKKEREILGDAAPAKEVPRTIESMRRADETIVDTENDKEFEEEINK
DEFESYFDGRVPKIVVTTNQRSTKEAVEFAQVFTKLLPNCEFFHRRKYHL
KEIVQFCNNRDYTDIIVVNETKGIIDELTISHLPNGPTAVFRLTNLVMPE
DIPGGGEMTSHKAELIVNNFTTRLGHSIGRMFASMFAQDPNFKGRRVCTL
HNQRDFIFFRQHRYIFESKEDANVQELGPRFTLKLKSLQKGSFNTSTGEY
IHLHQHNMDVDRKKFVL
>DDB_G0279311 cdc5l, contains two Myb DNA-binding domains ortholog of human CDC5L and yeast CEF1 may play a role in transcriptional activation and in mRNA splicing
MRNVKGGVWKNTEDEILKVAIMKYGLNQWARISSLLTRKSPAQCKARWHE
WLDPSIKKTEWSKEEEEKLLHLAKIFPSQWKTIAPLVGRTASQCLERYNR
LLDEVQRQQDNENGGGSGGGGTTTTTTTTTGENDPRRLRMGDIDPTPETK
PAKPDPIDMDEDEKETLSEAKARLSNTQGKKEKRKFREKQLEEARRLAFL
QKKRELKAAGINYNPKKKGKEKSWDISKEIPFYLKPKAGFYDVPDEELRD
EPNKDASFIGKRVDQIENPNYLQRQEKLNKLEDIKKSKKEIFNLPQLISE
TSKSNDVEHSIKRTKLQLPEPQLTDDDIQEISDYEKLNGSGSGGGSGGVG
VGEFPLPAPRTASISSTAANNNTNNIRTPMKQDTIMSEAQNLLALSNAQT
PLKGGAGPNVSQTPLPKSVNNSTPFRTPNPLANQTPTQHNKKQSLNDSNE
FAIEDKFKRQQGKNQLLSNLKNLPSPTIEYKLELPSELPTIEDDTTLELD
NSEIHIREQQQLKHKEQFKLRNRSTVLKRNLPRSRNLFPINKNNNNNNNN
NINQDELRILKEINRIISHDNKTFPNDSITPSSTFDDDDDDDNHHHHHDD
IDNNSINDNDEKYENYDYFTNTELEFADKLIRDEIEQIKQELKQPLPSSN
EILEEIDQIRSQFIYLPKENQFIEKSNANQTQLIENLQFEYDKTLNKIKN
SSMKSVNLEKKLNIYNGGYQNRSNTIIKNIDDMFDQLEQSEIEYQCFVAL
KNNESIQMEKRLKSIENQVYDQCEIESRLQQKYAQLLNEKNLLKKKLSIF
>DDB_G0285507 clp1, ortholog of the conserved CLP1 protein S. cerevisiae CLP1 is involved in both the endonucleolytic cleavage and polyadenylation steps of mRNA 3'-end maturation   Mammalian CLP1 is a subunit of cleavage complex IIA which is required for cleavage but not for polyadenylation of pre-mRNA
MSNDNSVNINNFSSMNGGGGGSDIQFPLKPSQQQQQQQQNSINQSTIRTL
EITQELRYEIDFDQNGWMKLIEGTAECFGTELSLNKVYKLSGTKGAVFTW
TGCKIEITNNCQPYIGEKTPMPQYAGVYQELDAFRVSILDEPKKSGPRVI
IVGPTDSGKSSLSKILLAYSARSGYQPLFVDLDPGQGSITIPGTISAAHI
QNPLDIEEGLAGGIPLAHFYGHTSLDVNPDLFKALCKNLASFIDKQLDSS
NISRISGFIANTCGWIDGLGYKILLQNIDVFKANLIIVMDNEKLYSDISS
HYSQKDNSIKIIKLPKSGGVFIRPPVFRKKTRMNRIKEYFNGINDNLSPH
YIVLDFKDVSIYRTGGGPAAPASALPIGTSSQIDPLQITEVYPSLDMCHS
IFAISYAKQASNIFHSNVAGFLYVSDIDMETKKITVISPAPGPLPSRFLL
LGTLKWMEN
>DDB_G0281585 cpsf1, ortholog of the human CPSF1 and yeast YTH1 the 160 kDa subunit of the cleavage and polyadenylation specificity factor (CPSF) complex required for 3' processing of mRNAs human CPSF1 involved in the RNA recognition step of the polyadenylation reaction
MSHHQVFQKQVLAPTGVEQCIKANLINDDSINLVLAKTNVLQIYKIRYEK
IEKYENVSDSQPQQQQEQEQQQQDITQKKKIELKPSLELIIEKKLFGNIE
SMASVRYPNSERDSLILTFRDAKISVLDYDSDLLDFEIRSLHYFEKDEFK
GGRNHFKHPPLLKVDTQQRCAVMLLYDRNLAVLPFKKTSSILDDDDDDDD
NNDDDDDENNEHDENENENENENIIKKEGDQQTKEKESVDDEFDLLFEKD
SSPPPPSTAATAETTTTIKKESNNNQDKEKKNIEIENVKDFCFLHGYYEP
TILFLHEPIQTWTSRIAVKKFTCQMTAISLNLLTKAGSFIWNVSNFPYNC
EMLVSVPEPLGGALVITANIMFYVNQTSRYGLAVNEYASIDTSTIIGSQP
FDFPIDDTLNLVFTLDRSNFVFLESDKFIGSLKGGELLIFHLISDGRSVQ
RIHVSKAGGSVLTSCICVLSNNLIFLGSRLGDSLLLQYTEKSITDDQLEH
ENFSNPYKKQKTSEVFDLFDENSETNNNNNSNNNNNKENQEKSSSSSIAS
KLLEEIEDEEDQLFKEKKNQLKSYQLGICDQIINIGPIGDIVVGQSIDPT
YDETIQPNQPEYVPKTLELVTCSGYGKNGSISVLQNNIKPELVMAFELPG
ILNVWTVYKEEIEEEHIEKEIKKNTSKKRSRDENNNNEQEDNEQEDNEDN
EEEEEEEKMQKDKNWHDYLYLSLKDGTTLIFETGRDLKEVGKFNFKSLDI
GNLFGRKRIVVIYQGGIKLINGFDRVIQEIQINEPIKSSYICDPFILLQF
HNGTIQIFKGIDEENQLIQFSINSISNNLNQSIFSSSLFFDRNKSFLNIN
NKNQKLKLQQQQQQQQPSNEKKKKKDKSRGFLDSDSDSGESSEDEEMKDI
KQENENENENENENENENENENENENEIEIKDQDNIYLNIYTTNGSYEIY
RLTSQECIFKVSDIKFEYDILGINTNVSQNQILEQVLTPKSSLSKKQLQQ
HLQKQKENGINSKNNYNQIQNSEILDIVEISLHNFNNSDPYLFMFNKIGD
LIIYKSFKREKNGELRFKKYNHSFILRDSVTEFYQKQQEKELLNGMDDDD
DMDDEKKKKKEEEEEENLNRQKRIFEFSSISGKRGLFIGGKKPIWAFCEK
GYLRLHSMDSSDNSNSNNSNNNNNNNSNTVETFTSFNNISCQDGFIYFSK
EKDVIKICTLSTLMNFENDIAIRRIPTKNSCHKIAYHSEAKCYVVIVSFP
QVTQELQEDSKKPILTDDKFQIKLIDPTIDWNWKFIDSFSLQDRETVLAM
KIVSLKFTEPDGITRARPFLVIGTAFTFGEDTQCKGRVLVFEIVSHKTQF
ESEELGEKRLNLLYEKEQKGPVTALSSVNGLLLMTIGPKLTVNQFYTGSL
VTLSFYDAQIYICSICTIKNYIVIGDMYKSVYFLQWKDNKTLNLLSKDYQ
ALNIFSTEFIVNQKTLSILVSDLDKNILLFSFEPQDPSSRSGQINQEING
NNKNDNRLPKKEQLVIFGTLDGGLNVLRPLDEKIYLLFYHIQSKLYYLPQ
TAGLNPKQYRSFKSFSQNFHFSPSTFHQLPKFILDGDLISKFLSLSQSEK
RLISNSINSTSDEIIESLKDVFESWNLF
>DDB_G0278799 crop, 
MDAIRAQLDEFLGKDRNLLPKDRIKVENDFNDPDICKFFLCGLCPHELFT
NANIRDLGPCSKLHDENCVKQYQNNKDKDKYDYEREWVRVIEGLISDNDK
KIKRNKERLLQNPNGDANHHGGPIQQQSISQLDDEEGGLLPDKEQNSKIT
ELDLKIQELLKKAEELGEEGQITEAQALMTEADELKNQKVELEKIEQEKN
ENKRMSVCEICGALLFVGDKEKRSISHLEGKKHIGFQKIREVMEEYYKSG
RRANLGRTDFYNAPPPPRDSYRDDRRSSSSSYHDIDGRRDHRYGGGSRDY
GGSDRRGGGNYNNGRGSSRDNYNNINNSRDYRNDHGKDYDRKRERDYYND
DDRRKRDRNY
>DDB_G0286645 cstf3, component of the CSTF complex which in mammalian is required for polyadenylation and 3'-end cleavage of pre-mRNAs
MEDENKEVMDTSQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQPSQDTIT
STTTTTTDNSGPTTTVAVSPPTTTATAAVSPTPENQNVTTPTPISPSLSS
VNVSPTTAVTITASPTTSTTTTASPTTVAAPTTVGSTPTTTTASPLGNIL
SLNMPAVGKRLNVQIETLENRINNDMYDTEAWTLLLNEVQSQPISIARDI
YKRFLSVFPTAGRYWKLYVEEEMKEKNYDIVEKIFFENLRSVKNVEFWKS
YIAYIKQIKGDKVENREEIIKAFEFALESIGMDISSTSIWTDYIQFLKDE
KASTQFEEGQKMTAIRKLYQRAIENPMHDLDNIYKEYEVYENSINKTLAK
ALLSDHQGKYQHARNVYRDRKSLLEGILRNMLAKPPRSSDKEEHQVRLWR
KLITYERSNPQKFDAVTLRNRVIATYNQCLLCLYHYPDIWYEAATYLADC
GDSSGCIAMFDRSLIALPKNLFIHFAYADYLESQKKQPQAKEIYEKILQA
NPEPLVWIQYMKFSRRTERIEGPRKIFKRAKSTPDCTYHVYIALGLIEYY
INQDTRMARDIFEIGLKKFPSEIAFVNFYIEFLTNLNEENNTRVLFEKLL
TWPSLEKSESIWRKFLDFEYRQNQDVSSILKLEKRYQVTVNSNTDKSGVL
QALNRYKFLNLWSCHPTEIEIITKNILDDHSDQNKDDHHSHSHAHHHPPH
SRHNANKDSTEKDQGAIDGKEGAVAAKLHKKGKGKEIKPVPQIESKPTFS
TIIPTSNWKVKKPDITQMVPFRGEIGKFTQSSVVSIPQQQQQQQQPTPIS
SQPISQQQQQQQQQPTPLSSQPISSQPSLQTNTQQQGNQPPNRSGLPDFI
FYFLQNLPSNQSFMGPYIDPEQLIGIIRDTPLPIQFLNQQQLQLQQQQLQ
QQQLQLQQQQQLQLQQIQQHQQQQQQANRTSPTLSNETLIIPNKPQQPQQ
PQPQQTQTHKRKQPDDESNNEQQQQPPPQQPPQQQQEQQQQPPTTTTATT
SVVSPITTLNSTPISAPTTVSPITTTTIPSTSSPTTTSTTVTAKSQQSAS
DIYRKRQANKLSKKS
>DDB_G0290485 dlrA, 
MQNHQNQMNVDHQFYFISGQDQSNNLETQIKKEDIDISDQASYHQPGLDK
NNNNNNNNNSNSNSNSNSNSNSNSNSNNNNNNNNNNNNNNSNNNNNNNSN
NNSNNNNINNSNNNNSNNNNHPINHMQQHHIHQHLQNYHFQSNNNSNNNN
NNNNNNNNNNNNSNNNNSNNYNNHNNNQGNGNQQEPTSSPQISHHNNNNF
NKNDNSITNLTNSDSIEESTSSSSSNLMAGSGSSASNQSPRVTSSTTPPN
NNNNNNNQSAPNNITYETPEILLKPGERPPFIGSLADIIISNILGKAHKD
EPNSQNFTNICSVCSKWRKISVGRLVNYTYQLPPDRSITNLFRNLSNNVY
PNLINLQLKVSTPTLFDVSSFVRMLLTKNTTITTLELSQNGIGNKAAHCI
GECLLANKTITHLNLSFNSIGNEGAEEISKAILVNTTLINLDLSQNCIGL
KGSKALGQALQSTTILQTINLSKNRFGAKGIDFIVESIGKNSSLTEVDFS
KNDLNEKSSKYVGEAIRKHPCLASVNLCDTKLSPESMKYISEGIQASQTI
AYLDMSRNEFNYKGLKPLAAALSMCQSITYLDLTGDSIGDKGAVQLGDAL
AQNHSIINLSLAFNNIGASGATSLGNALKTNRSLEILDLSINPEIGHLGA
IHIAEGLAMNKKISKLSMCTNGLGPIGAKRLGEALRQNSTITDLQLRGNE
IGDEGCRALSDSLKQNQSITELNLSGNGITNDGAKALCEALWYNQSLASI
QLNHNNINTQGVQFMKELLLRSYLVNLDSYFYPPTSTTIVSVLYVTRGRN
RGSNNRNNNLNTNLRIIV
>DDB_G0288797 fip1l1, ortholog of yeast FIP1 and mmamalian FIP1L  component of the cleavage and polyadenylation specificity factor (CPSF) complex required for 3' processing of mRNAs directly interacts with poly(A) polymerase
MSEVEEQTLIKDASMNEPTDKTTTEGDNGGENENENENIAEGGENQEDNN
NNNEGEEEEEEEEEEEEEEEEDEESDDDDVVVLLDQESVEASSSKPGATF
RTTPNKFSYRNPSSITPGSGGKYMLTKQTPTGGGGGGSGFNSAKSNQKTI
FEFDIESFEEKPWLKPGADISDYFNYNFTEETWKAYCERQNTMRMELNNL
GKIKGYESNKPNIGGNTTGGTGGNGIGDKPNITNGNLGVGVGGGIGGGSS
GSGGSIVGDLPPELQGDKLQQGIQRPQFKRQPSRSDINDESNLDQQQQTD
GRFNRGGQQQIPPQQPQPQQQQQQPQPQQYQTNYQRDRIYNQDYRGSGRT
YSTTNQYEDDSNNNNNGSGSGGGSDRRRNESSSSSSRGDSDRGERERERD
RDDRDRSERERSDRDRSERERSDRSERERSDRDRSERERSDRSERSDRDK
TSSSSSSSNNNSSSSTRGSDRYDRERGGDRDRTSESSSRGSDRDRDDYKS
RSSSNTGGSSSSLRSDRDSRTPSNTSSNYSSSNPSSSSSSSDYKRKNYNT
DSNDRSKKRK
>DDB_G0302418 imp4, ortholog of the conserved eukaryotic IMP4 component of the 60-80S U3 small nucleolar ribonucleoprotein (U3 snoRNP) required for pre-18S rRNA processing forms a heterotrimeric complex containing IMP3 and MPP10
MLRRNARLRQEYLYRKNLEGADKEDYEKKRRIKKALDEGKPIPTELVDFE
FKHRDEMKLDKLDGNRPKSIDDEYARAGIQDPKVLVTTSREPSSRLIQFT
KELRMLFPNSQKMNRGAHVVKELVDACRANDVTDLVIAHEHRGEPVGLVI
SHLPYGPTAYFEIKNCVMIHDIDEATPPSLAFPHLIFHNFTTPLGERTEN
ILKYLFPVPKDDSRRVVTFSNDNDFISFRHHIYEKDGYKNVILKEIGPRF
ELKLYKIQLGTLDQDEADLEWVYKPYMNSTKNRLFL
>DDB_G0293900 mybA, 
MQPKRMSHNLTVGEKAGSPFNVILNAANNTNSDNSSNNSDNENDDNQNNN
NNNNNNNNNNNEEEEEEDDDDDDDSQQNRHNISIPFTINKNSINNNNNLI
NNINNNINNMNNNMNNNNNNNNMNNNINNNGNSNISNNNTPKVEKKKTKG
KWTSEEDQILIKAVNLHNQKNWKKIAEHFPDRTDVQCHHRYQKVLHPNLV
KGAWTKDEDDKVIELVKTYGPKKWSDIALHLKGRMGKQCRERWHNHLNPN
IKKEAWSDEEDQIIRDQHAIHGNKWAEIAKFLPGRTDNAIKNHWNSSMKR
VSNNNVHLKSHAIEHSLSSQDNQDSPKSIITSSSPIPTTTTTTTTTSTTL
ITPPPPPLLPPPPSINKKEKKIKQPKKRNASEIEQTLSQPHINQHESSPI
VFENISNGNNKIDIPAAQYLMTNGISCINNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNINNNNNINNNNNNNNNINNNNNNNNNNNNNNNNNNHSNV
SANTNNNNNTINNNVSLQPPSQLNSNIAQLPSTPKNLAHVNIANKLNSPG
ELMANIVTPIKFFQTATVSMNSSSKNYNNENTNNNNNNNNNHHHHHHNNN
NNNNKRPRLDFSSATPTKNNESFSADLCSQFPDILFSPIQNKNNKESFLD
NSGLSPLRSPLHTNFFETPMKNYEYLDFNSPKIPNTISPLKNFNSPFNKI
NNHSNYNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNHNPNHNSH
NHNNHNNNHNHNHNEFAQPQPPQGNYQQSPYKNNSTTASSTLSTPSYSNN
SSISSSSCSSSSSNSASKATKVIQLSSSGIDKNSLLTINAKLNNNHHKNY
LDSSSSSSSSSSSSSSSSSSSSSSSSSSSSAASSSSTPNNQSDLATVPFT
PDDNVFNNNNNNNPPTPGKTKFKSRFSPNSKPYSYPQEYDNYGQSSSTPQ
NINNTYNSICLGTNNNNNSNSSASNSFENNEENNNENDNNGSSSGGDKVP
QMDSSFMALKLLKDNPNKSLFSKARKILGLGNISSSSLSPSSFVQQISNS
ASASSTPTSSSSTPLSSPTTTTSSVAAAIINKISSTPKYFSNQNQNQNNN
NNNNNNNISNNSNLSAFSTPGGNDHVPNFESNSFLAQSPFQDILNSQKLD
QLHQLTQTNQLSKSKVLLRNHKNSNQTESNSRSTIESINSNGSNNGNNSG
SSNSGSNSDKNNGKPISFRLMESSKPIEGL
>DDB_G0268368 mybAA, 
MELTTEAPFINNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTPNNTSYSNY
PQSGYVYQNYPLNNNNGGNNNNTIYYNQPQQYDPNYQTSVTSNSYPQYSY
FASPNIISPIPSPCLGSTPSPIPSPTIYQYSNNSNNYCAINTPPLTSVPS
PILNCNNKKRPEFNNNNNNHNHNNNNNNNNNNNNYNYNNSNNNQQQQKQQ
QQQQQQQQQPQQPQQQSQQQQQQQQQQQQQQQQQFKQTNINTTPKNLSPV
LQSVNSSASSTPQIQSYFQQPQYQQQYQQQYQQQYQQYQQPQQLSSANTT
PQTNERPNYSSIIQPNQLFSQMVPSFINGAINNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNTTTNNNNNNNNNNNYNNITYFQPYTPFSIVDNSSMI
VPDKQPQQQQPQQQPQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQNYN
DDSNKNNNNNNNNNNNNNNNNNNNNNNNNENINSSNNNNMYNICPAAAYQ
NIQFIKEQSNSSLSSSQPIPPFNLYNEQPQQQPQQPQPTQSQPILSSSST
SVFDINHHHHHQQQQQQQQQQQQQQQQQQQQQQQQQQQQQPQPNLSSSSY
ADNNNSFQSSSGNVWESQSSPIQSSVQISSPPQSNQSSIAPAPAVNLSAS
ASSVATTTKQSNVKKQKQQQQQQQQQQQTKRQELSDSEDDTDNGDDIDED
DEDDDEDEDDMEDEDTSSSSSSSSSSSSLSKKSPAVKKSGLKKSGRSKSS
SNESKAKGHWTKEEDEKLRSLVDLHGTKRWKYIASLLCLRNGRQCRERWS
NQLDPSIKRDAWTLEEDRIILDAHSKYGNKWAEISKLLPGRTNCAIKNHW
NSTMKRKLSKKQYDFSSLPPISSSIVSDNSSSLSTPTDSISSSPSTSPIT
LSSNVVVNDFDSQQQQQQQQTYQQPPPQSQDSGNNQFNFNNNNNNNNNNN
NNNNNSVESIKLYTNVNISYI
>DDB_G0289319 mybQ, 
MLYCGSTGGHYMIMTTNNNSNNNNNNNNNNNNNNNNNNNNNINQNHQHQH
QHHHHQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQNYGESTTSTSMIPP
SITTSLTPLTPTLSSQPQNIQQQQQQQHHHQQQHHHHHQQTQQQQQQILS
PMMGSKRKLEEDMGVGQPTSNLTNEYLMSNNNSLKPILVSSPLLTPLSAS
PGLTSMQMAFANASLSAPSTPLSMSPSLGPCSAPMSPSKKSKNSRSSSKS
KYQNSEERWQSTTSKDNGKPSSPGIVKGPWKDEEDAKLVELVNKCGPKEW
SSIAAKIPGRIGKQCRERWFNHLSPEVRKTNWTPEEDKIIIDAHASLGNK
WTAISKMLDGRPANAIKNHWNSTLLKKIGGDSKSLNKEKDDDDDDDEDAE
DGSSPVLSPISLYQSSSSTTTTTTTTTTNSSEKSNIPPFALSGSTTTSTN
NLNNSTNSTNSINNNNNNNNNNNNSSNTNTAITTNEPLVAPKIIRAQTTP
NSSPSLSSKKTHDKQKVPQSPKNSKQQQTQQQTQQQTQQQLQQQQQQQQQ
QQQQQQQQQQQQHIQQQQMPVISQQPHIQQLPIDQQIQNAQFQQFHLQQP
SMPSSTSVNIIPNQSSMEQHHYQQQQQAQQQQHQQQQHQQQQQHQQQQQQ
QQYLMQQHQQYQQQYQLMQQHYQQQLSQQHHQQQHHPQQAQQHHPQQVQH
HQQHINTTHNQHQQQQQQQQQQQQQQTNNSQVNSNNTDTTFSNSHPIEPH
EVPYYYDYWGSSTSGSQTLIPSDNTTNTYTIENNNDFLLFDDNQHRIQPL
QQHQHIKQEHAQMSHLPYHPTQPNNLNTTTTTTNNNNNNNNNNNNNNNNN
NIPTPNMSASTTGTINHHIHQTHHHLTTPLSQSTPSVSTDQNNYINYDIS
SLFSLPNEI
>DDB_G0288259 papA, RNA polymerase that specifically incorporates ATP at the 3' end of mRNA
MNKNGGPPVANITTSSTTITSTTTTQAKSQLPSSLSVNNLHTTQGSTDQP
TILGVTEPISTAPPSSIDFKLSTELENTLISFNLFESPEESRKREEILGK
LNQIVREWAKQVSLKKGYPEQTASEVVAKIFTFGSYRLGVHGPGSDIDTL
CVGPKHIMRSDFFDDLSDILKVHPEITEFTTVKDAFVPVITMVFSGIPID
LIYAKLALTAIPEELNDLIDESFLKNIDEKSILSLNGCRVTDQILKLVPN
IPNFRMALRCIKLWAIRRGIYSNILGFLGGVSWALLTARICQLYPNSAPS
TIIHRFFKVYEIWKWPAPILLCHIQEGGILGPKVWNPKRDKAHLMPIITP
AYPSMNSTYNVSKSTLQLMKSEFVRGAEITRKIETGECTWKNLLEKCDFF
TRYSFYIEIDCYSMNEEDSRKWEGWIESKLRFLISNLESTPKMKFAVPYP
KGFTNNLHKANNPDQICTSFFMGLSFNFSNTPGADKSVDLTKAVTEFTGI
IKDWLRTQPNPDTMDIKVQYIKKKQLPAFVKDEGPEEPVKTTKKRSSTGE
PSATRKKLKSENSDNKLNSPKSPITTNINSTPTTSTPTTTANTTTNTTTA
TTTTTTTTVPITSTPTSNISSPTMNSTELTTPTSTSTTTSNDSITTPPTT
TTINSVQPPSAQPTENGSSTSNSPTSTSINNTALPPNPTTNSESTIETTI
TLPTTLESQTSTLKDSNEISTNGTAVATEPTITSPSVNINESSTSTSTTT
TTTVTEQQIQTAPTTATPINKTIVNTMEVNELSFISSSSETSQSKPPPKK
PTISIIRGN
>DDB_G0283543 prp40, ortholog of prp40 which in yeast is a U1 snRNP protein involved in splicing
MSSDWVEAIADGKKFYYHKVTRVSVWEIPEDLKSPAPSSNDSNSNNQPVI
IGDWKEYKTDKGQKYYYNTISGVRQWDAPPEFQQKLASTTTSTSTSSPQL
SSSGSTTITTPIQPITTSATPQQPQQVNSNNNSNNNNNNKDLKESKDSNI
NTNNLDTQQQQQQQQQQQQQQQQQQNKEDPIQTFKNLLTDNSISSICTFE
KALKSIANDERYQVLKTMSERKQVFLDYQVDRKKVEQEEKRKKEKKAKED
FIQLLRDSKEVTPLMSWRRASLYFESEPRWEAIESERERESLLHDHIQEL
EQQEKNQLMSIKKEQMKILRQKLELDPSITVFTQWRKVRDQFENDDVFQV
LDKFDFLTVFENFIRDLEKKLDDQKRLEKEKLKKDSRKDRDNFRELLNEK
FKNGELHALTKWKIFKLNNENHQSFINLSQKSIGSTPLELFSDFKDELEI
KYENDYKKLKEILKETNFKYSPESTTLESLKSEFSKHSNYNLIQEFNFLP
YLEYLKYKEESREKNLAKKKKKRISQFKILLTETKVINKSSQWSDIQPII
ESKKEYIDLGDDQERLRIFKDYIEFLVQNALDEEEDGNEEGELVLSPKKP
SNDQSSSKKRRSYIDNLDDEDRYGTGSGSGSGSGGGSGGSSSGGGSGRDS
RDSRDRDRGSDRGDRRDDRDRGRSSHKKEKR
>DDB_G0274229 prpf8, central component of the U4U6-U5 snRNP complex contains the PRO8NT PROCN PRO C-terminal and Mov34MPNPAD-1 domains found in pre-mRNA splicing factors of the PRO8 family
MDDTNSNINQSNESQHLEEKAKKWIQLNNKKYSEKRKFGAVEIRKEDMPP
EHLRKIIKDHGDMSNRRFRDDKRVYLGALKYMPHAILKLLENIPMPWEQV
KYVKVLYHLSGAITFVNEIPFVIEPIYIAQWATMWVTMRREKRDRTHFRR
MKFPLFDDEEPPLDYSDNILDNEVEDPIQMELDENDDSEVIDWLYDSKPL
VNTKFVNGSSYRKWRLNLPIMSTLFRLASPLLSDLTDSNYFYLFDDNSFF
TSKALNMAIPGGPKFEPLFRDVDDDDEDWNEFNDINKVIIRNKIRTEYKI
AFPYLYNSRPRKVKTPTYHTPNNCYIKNDSPDLPGFYFGAALNPIPSYKT
SGNKNEQSEYGTEDDEFQLPEEIETILSKTEIEHDNLANGIQLYWAPRPF
SLRSGTTRRAEDIPLVKSWYKEHCPSEHPVKVRVSYQKLLKCHVLNKLHH
RKPKAQTKRNLFKSLKATKFFQSTEIDWVEAGLQVCRQGYNMLNLLIHRK
NLNYLHLDYNFYLKPIKTLTTKERKKSRFGNAFHLCREILRLTKLVVDVH
VKFRLGDADAFQLADAIQYLFSHLGLLTGMYKYKYRLMRQIRMCKDLKHL
IYYRFNTGAVGKGPGCGFWAPMWRVWLFFLRGIVPLLERWLGNLLARQFE
GRQTKGMAKTVTKQRVESHFDYELRAAVMHDILDMMPEGIKANKSRIILQ
HLSEAWRCWKSNIPWKVPGLPIPIENMILRYVKSKADWWTNIAHYNRERI
KRGATIDKTASKKNLGRLTRLWLKAEQERQHNYLKDGPYVSAEEAVAIYT
TTVHWLEKRRFSAIPFPQTSYKHDIKILTLALERLKEAYSVKSRLNQSQR
EELSLVEQAYDNPHDALARIKRHLLTQRTFKEVGIEFMDMYTHLVPIYDV
DPFEKITDAYLDQYLWYEADKRQLFPNWVKPSDNEPPPVLIHKWCQGINN
LDQVWETSQGECVVLLETQFSKVYEKMDLTLMNRLLRLIVDQNIADYMSG
KNNVVINYKDMNHTNSYGLIRGLQFASFIFQYYGLVLDLLVLGLERASAL
AGPPNLPNSFLTFPSVQTETAHPIRLYSRYVDRIHVLYKFTADEARKLIQ
KYMSEHPDPNNENVVGYNNKKCWPRDCRMRLMKHDVNLGRAVFWQIKNRL
PRSLTTIDWEDSFVSVYSKDNPNLLMNMAGFDIRILPKCRTPLDQLAPKD
AVWSLQNVNTKERTAQAFLRVDTESQERFENRIRMILMASGSTTFTKIVN
KWNTALIGLMTYYREAVVTTREMLDILVRCENKIQTRVKIGLNSKMPNRF
PPVVFYTPKELGGLGMLSMGHVLIPQSDLKYSKQTDTGITHFTSGMSHDE
DQLIPNLYRYIQPWEQEIKDSQRVWAEYAIKYEEAKSQNKNLTLEDLEDS
WDRGIPRINTLFQKSRHTLAYDKGWRVRTDWKQYQVLKNNPFWWTNQRHD
GKLWNLNNYRTDIIQALGGVEGILEHTLFKGTYFPTWEGLFWEKASGFEE
SMKYKKLTHAQRSGLNQIPNRRFTLWWSPTINRKNVYVGFQVQLDLTGIF
MHGKIPTLKISLIQIFRAHLWQKIHESLVMDLCQVFDQELDNLEISVVNK
EAIHPRKSYKMNSSCADILLRATHKWQVSRPSLLNDNRDTYDNTTTQYWL
DVQLKWGDFDSHDIERYSRAKFLDYTTDSMSLYPSPTGCLIGLDLAYNIY
SSFGNWFLGVKPLVQKAMAKILKSNPALYVLRERIRKGLQLYSSEPTEPY
LSSQNFGELFSNKIMWFVDDSNVYRVTIHKTFEGNLTTKPINGAIFIFNP
RTGQLFLKIIHTDVWLGQKRLGQLAKWKTAEEVAALIRSLPVEEQPKQII
ATRKGMMDPLEVHLLDFPNIVIQGSELQLPFQACLKVEKFGDLILKATEP
KMVLFNIYDDWLSTIHSYTAFLRLILILRALHVNLERTKIILKPNKNVIT
QPHHIWPTLTEQEWLTVEGSLKDLILADFGKRNNVNVASLTQSEIRDIIL
GMEISAPSQQREDQIAEIEKQKTEASHLTAVTVRSTNIHGEEIITTATSP
HEQKVFSSKTDWRVRAISATNLHLRTNQIYVNSDNAKETGGFTYVFPKNI
LKKFITIADLRTQIMGYCYGISPPDNPSVKEIRCIVMPPQWGTPVHVTVP
NQLPEHEYLKDLEPLGWIHTQPTELPQLSPQDVITHSKIMSDNKSWDGEK
TVIISVSVAWPCTLTAYHLTPSGFEWGKNNKDSLNYQGYQPQFYEKVQML
LSDRFLGFYMVPDRGSWNYNFMGVKHSTNMTYGLKLDYPKNFYDESHRPA
HFQNWTQMAPSANDDEENQPENENLFE
>DDB_G0282803 rcl1, 
MLKFQGSTHFRQRIICSTLSGKAIRITNIRDEDEKPGLRDYEASFLRLVD
KITNGSKIEINSTGTQITYIPGIIIGGKGITHECGTVRGIGYFVEALICL
GPFAKAPLDITLNGITNNDIDLTIDTIRTTTLPIIRKFGIEEGLIIKIIK
RGAPPNGGGSVNFKCPIVPHLKAIQLIDEGKIRRIRGIAYATRISPQFSN
RVLDKAKGLLLEYTPDVYISSDHYKGNESGLSPGYGLTLVAETTTGCCLS
AECMSNTGIATTEQQLQKQKSTSETPEDLGERTAFALLEEIFNGGCIDSH
NQSLALLFMVLCPEDISKVRLGKITPYTIEFIRQLRDFFGVTFKIEPDQN
SKTVLFTCLGIGYKNMARSTF
>DDB_G0273355 rexo2-1, similar to H. sapiens REXO2 and S. cerevisiae REX2 a mitochondrial 3'-5' RNA exonuclease there is a second copy of this gene 
MSTTPTYNHPVINERSKRMVWVDLEMTGLDISKDVILEMAIVITDAELNV
IEKGPNLVIHRSDEVLKNMNDWCIEHHGKSGLTEDVRNSKISLEEAEKIM
LEFVRKHTDKGICPLAGNTVHEDRRFLLKEMPTFAEHLHYRIIDVSTIKE
LSRRWYPYIPSPKKVCGHRALQDIEESIEELKSYRVTVFK
>DDB_G0273741 rexo2-2, similar to H. sapiens REXO2 and S. cerevisiae REX2 a mitochondrial 3'-5' RNA exonuclease there is a second copy of this gene 
MSTTPTYNHPVINERSKRMVWVDLEMTGLDISKDVILEMAIVITDAELNV
IEKGPNLVIHRSDEVLKNMNDWCIEHHGKSGLTEDVRNSKISLEEAEKIM
LEFVRKHTDKGICPLAGNTVHEDRRFLLKEMPTFAEHLHYRIIDVSTIKE
LSRRWYPYIPSPKKVCGHRALQDIEESIEELKSYRVTVFK
>DDB_G0276159 rtc1, ortholog of RTC1 which catalyzes the conversion of 3'-phosphate to a 2'3'-cyclic phosphodiester at the end of RNA
MGKNKNYNKNQFKKSKTNNDTTVAQQQQTIEEKPDFKIDGSILEGGGQIL
RNSVALASLFNKAISIEKIRYNRDQPGLKNQHKAGIDLMSRLFKAHLTGC
SVGSCKLYYQPTQKTIQDDGVIEADTKTAGSICLMIQVSLPCLIFAPHST
KMVLGGGTNCDFAPAADYIQNVFLPIATTMGFKCEMSIDKRGFYPKGGGA
VTLTTQPLTQPLSPITIVNKGEVNRIVIKSYFTSPRISPLVAERMNNTAK
KLIKKDFKKVDVETELIDVSKFSFGDGTFIEIRAYTDQGCIFGATGNGAI
GVPAEKVAEDAANSLLKDLQDGGCMDEYLQDQLIIFMALAKGKSQIKTGP
ISLHTQTSIHITSLMTGAIFTITPLTNNTQSGEETNLITCEGISYFPSDL
NNNNNNSNSNTTTTTTTTTISTTTIDNQNSEEK
>DDB_G0293554 sf1, ortholog of the conserved splicing factor 1 binds to the intron branch point sequence (BPS) of the pre-mRNA necessary for the ATP-dependent first step of spliceosome assembly
MSPNDVEQQQLPTQNNNYNDNINSPSLNDDDEDSFFREIKEISRGRPKTR
DEIQISDRTRLSRWDTPLTNDGVSPFSSIFKTLPPGLTDEQIAALILRLR
VDEITKKITIGPIEFTERDRERSPSPPPTYDNNGKRSNTREQRIKEKLQK
ERHQLVVTAQQINPTYKPPSDYQPPNEKKTRKIYIPIKNHPEYNFIGLII
GPRGNTQKRMEKESGAKIAIRGKGSSRDGKPTKLQFQENDELHVLLTADT
VDQLDKAEVLVREFLIPVEEGKNEHKRQQLRELAEMNGTLRERPAYMGNR
SWTPVDIKCVQCGETSHPSSDCPLRSNESNQQYIESEYQKFIDEMSKSLG
FDISISPNQNDNSLQNINLNNNNNNSNGNNNGNNNGRNFNNDTVGDMDES
PPHHTQSHFQQNSPQFDQQQHQQQWNNNNNNNNNNNHNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNFSNYNNNNQSFYNNNNSPYGPPNGGSPYGPPRNT
Y
>DDB_G0293876 sf3a2, 
MSEYGKAGSGGLQSSQYDNIDRRERQKQLVLEHVDVSKDPYIISNHIGSF
ECRLCLTVHNNVGNYLAHTQGKKHQTHLARRAAKEQRENPSVSKNNYIQT
TRVIHKKTIKIGRPGYKIIKQRDSKTGQLSLLFQIDYPEIESGLQPRHRI
MSAFEQRVEQPNKDYQYLLFAAEPYETIAFKIPNKEIDRTTGPDGKFFTH
WDRNKTFTLQLYFKE
>DDB_G0270020 sf3a3, subunit of the splicing factor SF3A required for spliceosome assembly contains PRP9 domain characteristic of splicing factor 3A subunit 3 expressed in pstO cells
MSSSLLEKTRNLHENFERYELLIENEMKTEPKTTKERVLQSHRVNHYLNS
SIECSKSLINIYTDSDHSRKDELTSISGFGTDLYSSFYEKLREIKDYHRK
FPNLKEERNNEPLIFTPSISFTGNEMNGKFLDLNENYEKYINLSFNRNKS
INLDYLTYLTSYYKFQYNDINRMKSPQYKDYLESVYKYLIHFIERTQPLF
ELQSSITKSENEFIEKWNNNEFDPIENNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNSNVNSNNNQDDKLFCKACKKLFTSENVFNGHLKG
KKHIQNEEKMNTNKDNKDGDSNNNNNNNNEKWYNKLKSRKDNTMFEYKIN
RLSEYLSDQIESTKENVLKKQSRSYTEVVDGVMVGGVGDDEEEDEEVNID
DLEVDVEVSKLKIANYPVDWSGKPIPYWVYRYLELGVEYKCEICGNQSYW
GRKAYEKHFQETRHSYGMSSIGVPNTTHFHEITKIKDALELWSKIKNQTN
QQQFKSDRDEEYEDETGNVMSKKNYDLLVKQGIINPNQKKRSHY
>DDB_G0275957 sf3b1, 
MSDQQDQTMSEWDDTTLNKAKVVEATPRRNRWDETPVSKPSTGVEETPKR
RSRWDETPININSGGLSGGVTPNYNAMSNGGVTPIFNNMMDGGVTPVYNS
NNNNNSNSNGGSNNNKNILMQTPDPYQAQLQKEIDERNRPWTDEELDNIL
PSEGYEILQPPANYQPVIASKKLTASTPIGAAGTSGGFFIQEEQSRGQDF
GIIDAPDGITIKPEDKVYFEKILQEGGDNDEHLSPEEQKERRIMKLLLRI
KNGTPPMRKQALRQLTDKAREFGPAPLFNQILPLFTSTSLEDQERHLLVK
VIDRILYKLDDLVRPYVRKILSVIEPFLIDQNYYARVEAREIISNLSKAA
GLASMTSTMRPDIDSPEEDIRNTTARAFAVVASALGIPSLMPFLKAVCKS
KKSWQARHTGIKIVQQIAILMGCAILPHLKNLVVIVEHGLTDEQPKVRTI
TALAISALAEAATPYGIESFDSVLKPLWYGIRQYREKGLAAFLKAIGYII
PLMESSYASYYTKEVMTILVREFKTNEDEMKKIVLKVVKQCVATEGVESS
YVREEIIPEFFKQFWVRRMALDKRNYKLLVETTLEIANKVGGGEIIERIV
DDLKDESEAYRRMVMEAIEKIVSTLGASDISPTLEERLIDGILYAFQEQT
TDETSIMLQGFGTVVLALNTRIQPYLQQIAGTIKWRLNNKSAKVRQQAAD
LISRIAVVMMNCGEEQLLSHLGQILYEYLGEEYPEVLGSILGALKAIVNV
IGMTKMTPPIKDLLPRLTPILKNRHEKVQENCIDLVGRIADRGSDFVSDR
EGMRICFELLDMLKAHKKGIRRAAVNTFGYIAKAIGPQEVLATLLNNLKV
QDRQNRVCTTVAIAIVAETCAPYTVLPGLINEYRIPELNVQNGVLKSLSF
LFEYIGEMGKDYIYAVTTLLEDALMDRDAVHRQTACSAIKHISLGVMGLG
CEDSLTHLLNYVWPNVFETSPHVINAFLEAVEGLRFALGPNTILQYTLQG
LFHPSRKVRNIYWKLYNMLYISSQDALTPCYPRTLDENDNKYQRYELDFV
I
>DDB_G0284555 sf3b2, 
MDTTLTETIQNNNNINKSINNKKKLKHQKKKEQKKKQKQQKKEENIFQQT
NNNNIKEEIKLENNDENNDENNNNNDDDKITAPIDGFSIDENDPSFELFS
KLVKHFDNPYEDPKEIERKEQEEQEEKERKEREEEEERKKKEDNEDDDDD
NNDDDDNNNDNEDEDSKKLSNKERKRQRKLHLPILKQLVDRPDVVELHDV
NSPNPGYLIAMKSYRNTIPVPAHWCQKKKYLQGKRGFVKPPFELPSFIAA
TGITKIREAILEKEKEMKSKQKQRERVQPKIRKMGIDYEVLRDAFFVHQT
KPNLSIQGDLYYEGKEFEVNLKNKKPGVLSDELKRALGMIEGYPPPWLIY
MQTYGPPPSYPNLKIPGVNSPIPEGAQYGFHPGGWGRPVLNEFGKPLYEN
VNNNNNNINNNGDQQQQQQQSHPTREYWGELLPESEDFQEEEEQQEQQGT
EEDELQQHQLEDDESIGDGISSVPSGLETPDIVNIKKSRYDQQQQQQQQQ
PRELYQVIEQQNKNSSSGGLMESAHRYNIPSVIKQQQQQQQQQNSSRVDV
IKSQRSAPVEITLNPSEVENGQEIDEELLKKKYEQATQALQKQRPKEDIS
DIIEEQNKKRKNQLQKEEKQKKFKF
>DDB_G0276137 xrn1, 
MGVPRFFRWVSERYPQILQKILDSNPPEYDNLYLDMNGIIHACSQEFANS
LIEFSEEELIRQVCNYVDRLFHTIRPTKLFYMAIDGVAPRSKINQQRQRR
FLSVHRDEKLKQKLISEGKPVPEVIFNRTAITPGTQFMYNLSESIQFYIK
KKISEDLSWREVRVIFSGPENPGEGEHKIIDYIRKNKASPDWDANQSHCL
YGLDADLILLGLITHEPNFSILREEISFKPTKRQLDFQLLHISLLREYLD
LELRNDDLEFGYDLERIIDDFILIMIFFGNDFLPHLPFLEISKGGLNSIF
ELYKSSLPSLGGYLTEGATIDLERLQHFFKFLQKFEKKQQQGIMGSTEED
LDDKKVEVAELVEDSVLEHDGLDEEAKKEFERLAMERLKSHFPVSDSEDG
EEDPDEQLVNELYNVENSYYRQYFNEFPNTLDEIKAFKEKVVLSYVEGLV
WVLNYYHNGCISWVWFYPFYYAPLAIDFNNIPDLHIDFQPGEPITPFQQL
LSVLPPQSVDLIPKSYQTLMLDLFSPIIDFYPVEFEIDTKDPHYFDGIAE
LGFIDHQRLLDATASIKKSLAQSGQKVFTDEEEARNSIKNAVIIYHDADV
DQFVKSPNSNVFKDIEHSSATTEDIVLPTFDNPLPHFRYCADQVLTGVQC
PSGFPTFKSLQFTWKYQNSVIDVWGMMSRKDSLIVVPPHQSKQYDCNNID
EFSKLKSLIGKKCYINWPYHQEGKILYFSCSNRKLFSKGTTDNSTPQKLA
FLDHVKKTKLSLLRKGINIYYDNEKDDGKQQSNGSSTGYTASKLGYEDTN
TILVHINKLVGVQTMPNGSTKKRYSDEEDVYPIELMVDYDLIATDSRFEE
IDELPFEKRFPIGKKVLVTKKQYFGTIGTVINHYDNQLQLEIKVPSVKMD
MNFGHQVAKKEVEYFPIQHVAKLVGTTVSSISQLTAGLFIFKPMVDIGLN
LKFTGRQQQVLGYCRGYDVNRGGNIFHQWEFSQEAINLITEYFNKFPLLH
QILALFSKPDSNIGVGGAKMSKNVDITPLFATKEEKAEFLQGVEEFIEKS
GIRKKRVVPCGTDSLGKEGIEKIEKYYYDQTHQTEYTIQEIHCSTTDIVE
PPSYESVISMERHHLIKGLQPKQDLNNSQNGVKSPTLSSQNYSFQSFANT
GKFHLGDRVVSILDKGNLPFGTFGTVASIQDQKVDVVFDTECFAGNSLDG
YCSEKRGICISKLRLYNLSCPPPPPKSTINKFYDQSIDPAEYWEKVQTQQ
NNNHNHGQKKIYSNGGQKLNQQIDQQTPITNAVENKELNWQQLQLINNIS
NPQQHNANNNNNNNNYNNNNNNHHHGQNHNQNKVNQHPLAINNPNSVNYP
MKRKPTYVKQNFEQQEYADLSKNYPNLEYNFYYDQNDQRKQPQQLQQPKP
QQQPQPQPQPQPKQPKQPKQPKQSKQPPQQPPQQPQEPIDPEKLRQQTRT
NKRLNLIYQNIEQNSFPGSQSSEHNNGDGSSEEQVQTNPNALLLLNNMFA
STSISSDQTNDPDGLPQGPPPHMMGHYPPGPPPMMGYPPHYHPGHSYPPP
PPHMMGNYPPGPPPPHMMGYPPHYHPGHPYPPHHPGQQEEHHHHQQQQQE
QQQHPTQQEQPNQHPKKKQPKQPKLPKQPNQNQTQPTQDGQQQQPPKQPK
QPNPNQTPKQPAQPKQTKQPAQPKQPAQPKQAAQTKQPAQPKQPAQPKQA
AQTKQPKQPKQSKQPAQQTSPTTPNPTTETNLNPTIETTSTPPTPTTAQ
>DDB_G0269922 xrn2, 
MGIPAFFRWLIDKYGGLIQETTEPREADGGRSVVDFTTPNPNGEYDNLYL
DMNGIIHPCAHPEKGPKPKSIEDMIQSIYEYLDLLFAIIRPRKLIYMAVD
GVAPRAKMNQQRTRRFRAALDSRLDKDKEAALWRERIYDGLATQQEYEQY
MEEKKNKFKFDSNCITPGTLFMDRVAESLRTYVAEKLTTDPAWKDVKIII
SDASVPGEGEHKIMDYVRHQRAQPDYDPNLKHIIYGLDADLIMLGLATHE
VNFDILREFIQPIARGVCHKCHKKGHLAIECKEEVDDSVKDFLVKNYQIL
HLHLLKEYLELETKVSTPFGFDIDRIVDDFIFLCFFVGNDFLPHLPNLEI
KDGAIDRVIKCYKELLPSFDDYLVSNGEVNFPRLSQIFVALTKGEEESFQ
RKIQKDAQILKKRQNLSNIVSRPNSLNLNDKTSLSHKEAADKFLAEILKP
VEETTQDEEKEEDSRPTKKSKNSSASSTTTTTVKSNKKAAELIKNKLIGG
GGGKEKDKEETEQDDEQSASSKKSKKRGLKVIELDDDQQQQLNIVAAENE
KKLLNKKSKQQKAAQIQEEADEKEHQENDKPSLFVNSYDDSSLPTEKEIF
FDNSRNIKYSEEGWRDRYYQSCFQVENEDDIKQICKSYVEGLVWVLKYYF
KGCSSWGWYYPYHYSPYITDISKFFDQFEYPQYEMGEPFKPFNQLMSVLP
PASCQFVPKPYQKLMGISVDGEPVEESPIIEFYPRAFRIDRGPTEPLYKG
VCLLPFINSIKLLKTISKTEPLLTEEEVDRNTLGHDLMFCHKDSNINSAF
KEGSVTQLPENSSKIYGTIEDVSALAKKKMPPLMDAVAFKYNNSSVPNGY
SFNYSTLKGSIIPKKSITLFKPRTQNAAINRMVNNSLGGSNQYKDNQQFN
GGFGNVGSGGYNNKQIGYNRFNNQNYNNNRYNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNYNNYNNYNNNNNYKNNNYNNNGNYNGNNSNNYNNNNNYN
NSNYNNYNNSYNNGNNYNNNNNNGNGYNSNYNNNYNNNYNNGNNNGNNYN
NNYNNNYNNGNNNGFNNNNNNNYNNNNYGGYDNNNGFNNNNNNNNNNNNN
NSYNYDFNNLNDPSLIDINNYGGGGNNTDLGTLVASNNNNNNINRNQQQQ
QPQQQQQQQQKNKFNNNSNTKNQPPAKVPMKYNPFAKSKK


# Dictyostelium fasciculatum, SH3

>DFA_12499 g1038, 
MNAIRAQLDELFGKDRNLAPSERAKRQPHFSDDEICKFYICGLCPNELFV
TDPCTKLHDEDCLKLYQDCKDKEQYGYERAWAREMDQIILDNDKKVKKNK
ERLILDAAKLEAEEGLNPSDAAKEFAKIMAEIEERIQALLKKSEELGEEG
QITEAQELMAQAEKLKEEKLELEKAEELKDHNKKMSVCDICGALLFVGDK
EKRSQSHLEGKKHVGFARLRSHMEEHYQSKKSNYQENRGNFNNHHQNNNN
NNNNNYNNNNNNNNGHNNNNNYNNNNNNRDNYQRRNYNNNNNNNYNNNGG
GAYRNNNDRGDRDTRSGGGYSRNDHRGGDYNNRNNNNNNNNGNYHKRERE
YKDRDQPSDRERDNYSRR
>DFA_11037 g10619, 
MSSSPPVTGNFINNMVIDTTHAAPFNNSGGLPNASSTSTSTSTNNTSYLY
HKQLHMPTGIEFCIKGNVTSKDTVNLIVAKTTVIQIYSIKYEKNEQTEHQ
QQQKQHNSEESDMSFLDSSDDDDDNQKDNSNNNNNNNNNNNNMTDQQKPW
LELNYEKNLFGVIESLNCVRFPNEDRDCIILTFKDAKLSILSFNQNTQQL
DIHSMHYYERNEFKSGRETFRSPPILKMDYQQRCAVMLLYDRHLVVLPFR
HTMSSILDEEDEYEEEHMTKDSNNNNNNNNVNGKSTESSSSSFTSKGNLK
DSYIISLKSIGVENVKDLTFLHGYYEPTLLILHEPSQTWTARIAVKKLTS
CLTAVTLNLSQKQQSIIWSFDHMPYNCEKLLSVPEPLGGSLVITPNIMFY
VSQSSRYALATNSYALIDTTSPTGEPFAIPIDNSSTNLVFTMDCAVYSFL
EKDRLLVSLKNGDLLIFHLISDGRSVQRINITKAGKSVLSSNICVLSSTL
LFLGSRLGDSLLLEYTEKVIDVDTNNVTENLSNPYKKQKTSEVLFGEDDN
PSTSIAEEEDDEADAIFNRSRYQTKSYQLTIRDHITNIGPISDMITGISY
ESNKGENEENEVDHTSSRNGKSSGNAVNLASIADRDYELVTCSGHGKNGA
ITILQKNVRPDMIISFELSGVKQAWTLYYDPEQSSSKNKKRHLEIGQEQE
NEEEEEDINWHSYLVLSQEKNTLIFHTAQELKEVATLSNPTVLAANLFNN
KRLVFIHPTSIKVMNGHSSVTQEIKQSPIKYAYVCDPYILLHHQDGNISL
YKGNPKDLTISKLDIPFNSSKTTSIMTCSLFIDLNPISNWWTFKKSNNQA
GDIYLLILDSNYTLNIYLASDPTTPIFTTNHFNKELDILLNYNNNQQQQQ
QQNIQTNNLNFKEISIHYLSDSIWSVPYLVAINEYDQIYLYRGFKNEKNE
ISFKKIVNDVYQELPLEQPMPTTTTSNNNNVKEKKKSSSSSSSSSTATSS
STPSSTTIINFNVEMNRRRIIPFSNIGNKRGIFVSGVSTPIWIFSEKNFP
RIHPMKQQQQTTSSSSSSSSSSKRPITTFTTFHNINCKHGFIYFDHTGML
CICRLPDGTNYENEWPIRKLAIRMTCHKISYHPVQKCYVLVLSYPQAPQS
DEDEQEEQERELLKKPLVLEEKYQLKLIDPANNWNIIDSFSLAEKETVLC
SKIIYLRHADESDIIPKLKPFVIVGTAYTHGEDTVCKGRILIFEIVSHRN
QFNTATSKSTTTTTTTTTATTNTEENNNQEQQQDKEEQDEEGKPKKDTES
ADEKPKVEEEEEEEEKEQDESPDQKIETEELQPQLQKRLNLLYEKDQKGP
VTSIAGLNGLLIMSIGPKMIVNNFSSGSLIGLAFYDTQIFIVSLNTVKNY
ILVGDMFKSISFFKLKQKKNIILLGKDYEEVSTYSSDFIVDEKKLSMVLS
DANRNIRMFSFDPSDPESRAGQMLLAKSSFHIGELNNKFVRIPMKNTNYD
NNSSSSSIIVNDKHLLFYGTLGGGINLLMPINKRFHEILHALETKLMHRG
QTAGLNPRGFRYGHHVNNTLGHLHNQYVVDGDLLTKFQSLSPDDAKQLAT
SIGSTTPIILDLLNQLHQSYNWVFKIFFFVNITIIVYYQFRLSDAMDLLT
QLQDKLDHLFLVFGTCIGVLQRDAPPSSFSEVMNNQPPQLTQEQLTNADN
WNSQTKHMALQVIETTKLIESIIESLPGFQRTENEQYQRLKALNHESKLL
QQQLQDKENENVIMLQQVKEAIRMLSDETNSRSSTNNNKDHDKMDL
>DFA_11745 g11321, 
MGVPRFFRWASERYPQIIQNLVDSNPPEYDNLYLDMNGIIHACSQEMTTK
LIRFSEEELIRLVCNYIDKLFHIIRPTKLLYMAIDGVAPRSKLNQQRQRR
FLSVFREEKEKKELIKEGKELPEVIFSRNAITPGTEFMSNLSECLQFFIK
KKISEDLSWREIEIIFSGPENPGEGEHKIIDYIRKYKASPDWDPNQSHCL
YGLDADLILLALVTHEPHFSILREEIAFRPQANKQLDFQLLHISLLREYM
ELELKCELDFGYSLERIIDDFVLVMIFFGNDFLPHLPFCEISTGGLNSVL
ELYKNSLNELGGYLTDEAEIDLDRLAVFLAKIAVFERKQNLTVGEAEENE
VVEDMLLENDVQDDEEKKEIERRAFERLKNHFGEITFEDDDVEDINDYWI
NGYYRSKFPDFPEENRSAIVDYKRHLVLKYVEGLSWVLNYYHNGCISWRW
YYPYYYAPLAVDMRDLSSLEIEFESNGPVTPFQQLMSVLPPQSAHLLPAP
YQELMTSAASPIIDFYPTEFEVDTSDSHYFDGIAVIGFPDLGRLLEATAA
EDSWDLTDKERSRNALRNAVIIYHDAEQVVSEPSPNLRLFPSLEHSSAKT
EDFILPFYEGLKPFRLCEGVLLGSHSPSGFPTFAVDVDFTWEYKNAVVNI
WGMKSRKESIIVHPPIPQLNKDKTAKQMTLASIKHWIGRRCYVNWPYNTE
ALIVGFSDSNQKISMLENVSSTNGTVRLTTQDYLAVQKVSYLDQLKKIPM
DYLTKGIDVSEVTSNTILVHVRKIAGIDVEFGGRTVKRYTEKEYQLPIQL
MVEYDKVRADSRYLETEEIPFATRFPIGKQVLYTNPDHFGAIGKVLGHAD
ESTETLDLELKVGQSGPDLHFGHRVAKEETDEYFPIQHVCKATGLTNQQL
SLLTGGLFIDKPMTDIGLNMKFTGRNQQLMGYCRGTTLGDRNGGTYKKWE
FSNDAIQLIMDYLKTFPIIQKILVLVSQPSDDKSSSSGGGIRAIDISSLI
SEKTDRVALVKSIEEYFDKTGIRRKRYVPCDSLSLSKKSIKRIEDHYGRL
AAAAILIPMRTRTQSDKVIEPASYESVISYEKQHVIQKAHEQQLQKERKY
GGGSGTSSPGTSSPSKPKQHERQFRLGDRVITMLDKGNVPFGLYGTVVSI
QDQKVDVVLDRECFSGNNLEGFCSEKRGLFISKWRLYNLSTPDANYRRAT
KSEKEGGYRLGSFPEYDSNEYWNNLKMTNDQDHGVVKIKVNHQANTNMVQ
KVQQQTQNLNWQQQQEVYRQNQRKTYHTRQDQFVKTNWETEEDHATYRKT
FPNVQTQQLNWQQLEQQQKSAAKQQQKGEKAGKGGKKGPKEQPQPPQQTS
GQTPELLQKIFDSSKIVPSTNGNNTAAGESSSSVQQPSQPQQPQQPPQPL
MGLIYNSLESQKDGPADHQATPPPPPYGYPPGQMHHPMAPPPFGFHPMAP
PPYGFPPMQPGQMHPMPYHPHQQHQQQQQQQGQQPRQPRQPNPRYQKKDK
PQNNNNNNNNNDTNNNNNHQQPKSPHHQKQPKSQSPQQTNTNTNTNNTNV
HTLKDVKKSSPHPKKDKVWIAKPKTSPTNSPPSTEASSPVAPSQPASTNN
NASTSESNN
>DFA_11830 g11393, 
MGIPAFFRWLVDKYGNVITPTKEPRDSDGSRLKCDFSELNVNGEFDNLYL
DMNGIIHPCAHPEKGPKPRNTQDMMDSIVEYLDLLFAIIRPRKLIYMAID
GVAPRAKMNQQRARRFRAALDSRITKEQAARDLIERLNNGSLSQEDYDAI
QKDGAEKYHFDSNCITPGTEFMALVALTLRQYVAEKISTDPAWKDVKVII
SDASVPGEGEHKIMEYIRHQRSQPDYNPNLKHVMYGLDADLIMLALSTHE
VNFDILREFIAPPKGNRFGTPAPTPSKDVDEDEVKDFLVKDYQLLNLAIL
REYLDTELKCNPPFEYNVERIIDDFIFICFFVGNDFLPHLPSLQINEGAI
DRLMRIYKELLPTFEGYLTDNGEIDLDRLRKVFVRLSREEEDILLRRKKK
EERFAQGRNNRFQQTARTDITGTASSQQLNTQVSERHKQAAASILSDIFQ
PVAADTEENNPKKKVKTDHLSNKDAAQQLRENLAKATSKGTKDQLDATNR
EAAALLQQKMSKYNEKKRVVKVIDIQQTPEAAETKDDNTKKRKKKETEDK
SAAEDQDDSLEKKLFIHKMKDVNIGSEGWRTRYYEHHFETEQPQNELVRA
ICQSYVDGLAWVLRYYFHGCCSWGWYYPYHYAPFILDLAENGSMIVEPQF
ELGAPFRPFQQLMSVLPKASGQFVPKPYRTMMGISIDGEDRNDVILHFYP
NEFLIDVAPGQPTWKGVCQLPFIDENELLSALKPLDNSLTEDEAFRNSHG
TDLIISHNSTSIGSTTDLPRNLEKPDQILGTISPVIPGVEKRLPPLLSAK
VFTYKNPELVNESTCGILPGAILQKVNLEQYRTRGIQNSSANRMINHELG
SNQYKNRNQQNGGNFNSNFNNNNFNNNNNTSNQNYNNNGNNRNYNNNQNN
YNNNNQNYNNQNNNQNYNNQNYNNNNQNYNNNNWNQNQNNMGGNNWNQNQ
NYNNNNYNNNQNYNNNMGGNNWNNQNNNMNNWNNNNNNMGYNNNNMGGYI
NNNNNYNNNNQQQQQGIVNMSVEQKQTVLNNMMLMMKQYNEGANNMTPQE
YNQMSTMMNMLLQQQQQDLQLVQNQNQFNVGRQNNNNDNHHNKKFNNNNN
NQGGYNNQGGNKNYNSNQGGNKNYNNNNNNNNYNNNNNQQQQGGNKMKYN
PFAKSKK
>DFA_01263 g1264, 
MDRQDISSFIGGTDLAAAPIPTVIFNLPKEHELRFEVEHGETALIKLIEG
NAEYFGTELLLNREYKLTGCKGAVFTWNSCKLEVSQSTKAYIANETPMMQ
YARVHKIMDDIRISCLSNRESGPKVIIVGPTDVGKSSISKILLGYSTRLG
YAPTFVDLDPGQGSITIPGAVCASLVDKPVDIEDGLTNSLPFVQYYGHTS
LDANPTLFKALVSSLATSIEKRMETNEQARVSGVIINTCGWIDGLGYEIL
IDSIDIFKANLIVVMDNDKLYSELNKKYTGAIKVIKLPKSGGVYLRSPIF
RKKTRMSKIREYFYGISGDLCPHFTILDFKDIVVLKTGGGPAAPSSALPI
GAQSVIDPLQLQEVTPSTDMIHSILAVSYTKSKQSILKSNIAGFLYVTEV
NLETKKMTVLAPCPGLIPSKFLLMGTLKWLE
>DFA_01671 g1656, 
MSQKEKGGKKPYASFNNNNNNNNNNNNNGGKLNNSGGYNNKQSSPSNNNN
KSSPSNTNNINNNNNNNNNNEDLEVKKMKDRSLYMALCLVGYQVSVTLKN
GVTYEGLLSSATTTSGAGWGIVIKMARKKEVPPPAIITTPPTPLMVIESK
DFLCLTATGVVFDNLSTYQQGSGRGSASFQSDTDISGHDGVVRERELTPW
MSDGSEHESLEASALNKQNASWDQFSTNEKLFGVTSSYDEDLYTTSLDRQ
SDSYRNRLRDAERIAQEIEGKTSSNLHMQEERGQVKGSDYDEEERYSSVV
RQPDAKAGAKQQAGGNLSSSAGVYVPPNKRNQQVQQQTPQQTQSAPVTPL
TKSSSSTSLKDENNTKPAVATAGAAAAVGTSTTSTTTSSTPTKDEKLTQQ
QQIFKESESNSANSLKTSGNGINSDNSPVTKLRFPVRERTNSIDHNDLVS
SPRDGQSPRTLANYTKVRAAIVSEKMRNSEPRSPLCSPLVSDPVGLSALS
LHSSKPTFTENTIKEFNEFKLIKSEVDRKAQMEQLKSFSRDYNISRSRPS
SPLIGPNSPRLANITALSLSPTNLDNKDDSKTEDVKKEATAATTTTATTT
TATTTTSKLKLNPNAKAFTPGSLSANAPVFTPKGLTLKPAAIPASAPHDF
SGMPGGDIGRSTSNNTDSTTPINELYYESMKKRQQNPENPESVSSYWSDV
PSYRQQYGGGEDEQYGSPAGGYSMRPPIIPMGVVPIPPYYPSPPPMVAPP
QIKSMKMPYNGQPRSYPQHQGVPVASNQPLGPPPYAVFQPQFPPPFAVPA
MYNPPPPQHGVPKRYYPHQNSYQMQPHMMIPPQNNSQSPSPSHQSPQIPS
PTSPTHSRIITSQPAFIPAAYQNYGVPPRYQNDPNQGYPPN
>DFA_01687 g1674, 
MTNIDNNGITTTNGSSSNGTTTTSNTKIDKKKQKLQKKKEQKKRQKQKKY
QDKQQQQNGSHSNSSNNVEIDGDEPPAPVDNMMIEEDPDFVIDESDPTFE
LYNKLLKHFDNPTHQDDDQDVQFTDQEENQDTDQQENDEVEEEEEEEEEE
EEEGKGKSKKLSNRERKKQQRLNLPILKQLVDRPDIVELHDTNSPNPAFL
INLKSCRNSVPVPIHWSQKRKYLQGKRGFVKPPFELPEFIAATGITKIRD
ALLERESQKKTKTKQRERLQPKMRTMNIDYQILRDAFFVHQTKPKLTGQG
DLYYEGKEFEVSIKKNKPGQLSTDLKNALGMLEGYPPPWLIHMQNNGLPP
SYPSLKIPGVNAPIPEGAQYGFHPGGWGNPPMDPSMFAQQHHDKTVRGSL
INQEERERWGQLVPEEEYEDDEEEGDEDDEQEGDDHEDGMPPPPPPLSAQ
QVQDGTMSIPSGGSETPDVVDIRKQQYNNNNNGGGNMLPQFQQQKQLYQV
VEQSSRQLGQGIMESNYKYNLPTNIKMNTPSSSSSSTASASGARKVDLIK
GHKSAPVEVTFAPNEVEDVELDEELLKKKYEQAVSGDKSSQRKEDYGDDD
HKKRKAKSQDDKQKKFKF
>DFA_01698 g1684, 
MGEEDKQSYGNNWGDDDEDDFFRQITEIQSNYDRGRPRTREEISADNRTR
KNKWDVDKNPAVSLPGIPKTIPPGLTDDQLSSLLIRVRIDEITKKLVTGP
IEYDTKEDRSRSPSPVYDNTGKRTNTREQRTRDKLAKERHNLVTNAQQIN
PNFKPPSDYQPIHKKKTMKIYIPVKDHPEYNFIGLIIGPRGNTQKKMEKE
SGAKIAIRGKGSLQDGKVSKPQYAENDDELHVLLTADTQDQLEKAAVLVR
PYLVPVEEGKNEHKRQQLRELAEMNGTLRERPAFIGGKGWSAVDIKCVHC
GEISHPSSDCPLKTNPNANMHLIEAEYLKLLSEIKDIIGLDDNYQYKNQN
INNNNNINNNNNNMNGYNNNNNNNFNNYGHFEPQQQQQQQQQHYNNGNLN
MNNNAYQSPPYGDDQQQQQQQWNNNNNNDMMNHQQQQQLPHHHQQQQQQW
GQKPPQQQSPYGPPGSSPYGPPPTNSSPYGPQSGWQ
>DFA_12550 g1756, 
MIASSRTGNLLNLFSKPLKGVSVSSHTSSISHHRHSTRQFSSLSLSSSSS
TLSYCGNRSNIRLLPSQTVSFYNHHHRYMTSNNNNNQPIVHNPPKRDMSR
DNRLIWVDLEMTGLDITKDRIMEIACIVTDDNLQVIEAGPDLCVYIDDAA
LDGMGKWCKEHHGDSGLTQRCRESKISIQEAEKIMVEFLAKHVHKGMCPL
AGNTVHEDKKFLLKEMPLFAEYLHYRIVDVSTIKELARRWYPNVMEKAPV
KRYLHRSLADIEDSIEEMKFYQKHIFIPKDE
>DFA_02161 g2110, 
MTSNGITSPSALSNGGKATTTTTATAAAVASASATNIALNGSTGKMPIHN
ISSPPISSTSSTAPLQSNGSSNQQQYYGVTEPISLASPTSIDTKQSTELE
TTLRGFNLFEPPEESRLREEVLGKLDAIVKQWAIKVSVLKGFTEQMASEV
IAKIFTYGSYRLGVHASGSDIDTLCVTPKHIMRADFFGTLADVLCVHPEI
TEFTPVKDAFVPVIKMIFCGIPIDLIFARLSIPAIPEDLNDLIDENYLKN
VDDKCIVSLNGCRVTDQILRLVPNVTTFRMALRCIKLWAQRRGVYSNVLG
LLGGVSWALLTARICQLYPNAAPSTIINRFFKIYDGWKWPSPILLCQIQD
GGQLAAKVWNQKRDKSHLMPILTPAYPSMNSTYNVSRSTLSLLKSEFSRG
AEITKKIDSGERKWTDLVEKGDFFTRYRFYLQIDVSAPEEDTHRKWEGWI
ESKLRILISNLEQTPNMKFAIPYAKSFANKASAVNGGICTCFFMGLQFNF
STAIGADKNVDLTGAVTSFTNLIKDWPGKLPTIEMKIHYIKKKNLPVFVK
DEGPEHPPKQKTAAKKRNVSGNLVNNNNNQNNTAAGESTSTTTTTTPTGT
ATPPPSSTDAKKKVKTDHPPSQTTTPTAASSTSSPPSPLATDHVAVVALP
TTTETAAVSNQPNISPQPILPISVSDNNNNISIDGDNVNMNEIKDSNIDE
QLQSPPSEVPTIVAASTTTPTNTTPSTKKADNSSPTEVSELDFISSSSAP
KPDVPKGPQPKKAAISLIRG
>DFA_02244 g2194, 
MGDKTTTTSEWDETPSTKTTAAVAATPRRNRWDETPQKLATSTIEQTPKR
RSRWDETPVTISGGMGGSATPQIMSGGIGATPRFDVSSTPNVLMHAGMMT
PDVHQLRAEKELDERNKPWTDEDLNAALPSEGYEILMPPSNYQPIMTPAR
KLMATPAAGVGGGFFMQEENRSQDYGVSETMTQGGLPIKPEDKQYFDKLL
KVSDEDEEMLSPEELKERKIMKLLLRIKNGTPPMRKAALRQLTDKAKEFG
PAALFNQILPLFTSQSLEDQERHLLVKVIDRILYKLDDLVRPFVRKILSV
IEPYLIDQNYYARVEAREIISNLSKAAGLASMTATMRPDIDSPEEDIRNT
TARAFAVVASALGIPALLPFLTAVCRSKKSWQARHTGIKIVQQIAILMGC
AILPHLKGLVEIVEHGLTDEQPKVRTITALAIAALAEAATPYGIESFDSV
LKPLWYGIQHYREKGLAAFFKAIGYIIPLMDASYASYYTKEVMGILIREF
KTNEDEMKKIVLKVVKQCVGTEGVEAQYIRDEVLPEFFKCFWIRRMALDR
RNHKQLVDTTVELANKVGGAEIISRIVDDLKDESEAYRKMVMEAIEKIIS
TLGASDINPRLEEQLIDGILYAFQEQTTDETAIMLQGFGTIVLALGVRVK
PYLTQIAGTIKWRLNNKAAKVRQQAADLISRIAVVVQMCEEEQLLGHLGQ
ILYEYLGEEYPEVLGSILGALKAIVNVIGMTKMTPPIKDLLPRLTPILKN
RHEKVQENCIDLVGRIADRGADFVLEREWMRICFELLDLLKAHKKGIRRA
AVNTFGYIAKAIGPQDVLTTLLNNLKVQDRQNRVCTTIAIAIVAETSAPY
TVLPGLMNEYRIPELNVQNGVLKSLSFLFEYIGEMGKDYIYAVTPLLEDA
LMDRDPVHRQTACSAVKHMSLGVQGLGCEDALVHLLNLVWPNILETSPHV
INAFLEAVEGLRIALGPAVILQYTLQGLFHPARRVRDIYWKVFNMLYVSS
QDSMIPAYPKTIDDGLNTYQRYELEYIL
>DFA_00019 g22, 
MSEYGKAGSGGMQSSQYDNIDRRERLKKLAMETIDISKDPYVISNHLGSY
ECRLCLTQHNNIGNYLAHTQGKKHQTNLARRAARDQKDNPNNHFNKSSSA
MSHRPRIIPKKTIKIGRPGYKIIKQRDPDTGQLSLLFQIDYPEIEQGLQP
RHRFMSSFEQHVDHVNKDYQYILFAAEPYETIAFKIPNKDIDRTTGPDGK
FFTHWDKNKLSFTLQLYFKESSNKDQQQQTQQQQPPPTPTTRINRNDMYR
CRLVWNSNNKLIIPFIIFQEKQKEKT
>DFA_02618 g2559, 
MIRRNARLRQEYLYRKNLEGKEKDVYEKKRKIKQALNEGKPIPSELIEFE
FNSRKEMSIEGEEDYKRLNIDDEYARAGVLDPKVFVTTSRDPSARLTQFA
KELKMLFPNSQKMNRGAHVIKELVDACRANDVTDLVIAHEHSGEPNGLVV
CHLPYGPTAYFEIVNCIMIHDIQDAPPASLAYPHLIFNNFTTPLGSRTEN
ILKYLFPVPAQDSKRVLTFSNNNDYISFRHHIYEKDGHKNVLLKEVGPRF
ELKLYKIQLGTIDQPEADVEWVYRPYMNSTKNRTFL
>DFA_02670 g2613, 
MSSSLIERTRNLHESIERYELLIVGEQANEPKTVKESIIQSHRVNHYLES
SIECAKELGKIYKDEDQTRKNEISGITGTGNTVYSNFYENLREIKEYHRK
YPNLPVENLNTTLYYTPQISFTGNESYGRFLDLNEIYNQYVNVPKVNRID
YVKYLTTFTSFSYDDINRLGIQKYKLYIESLYEYLISFLKKTQPLFDLQK
TLSDMDKEFEEKWSNQEFTSKNDGVVVAVENNSENNNNGNGIGGDDDDSN
GKEPKDNDKNGEEETTKEEEKKKNDTKTTTIVSLDCKACKKSFTSQGVFN
SHLKGKRHIMLQEILDKNSETSKSSSGMLPFKPIVQKEFYISKFGDMLSD
QIEDSKENTLKKQSRTLKEIEEDLYADETVLDDDEMDEEPLKLRIANYPV
DWSGKPIPYWVYKLNELGIEYKCEICGNQSYWGRKAYEKHFTESRHAYGM
SCIGVPNTVHFNHITKIKDAIELYKKIKDQNATAAFNADREEEYEDENGD
VMNKKTYEMMVKQGLIKKRKH
>DFA_02697 g2639, 
MRPNIKGGVWRNVEDEILKVSVMKYGLNQWARIASLLTRKSPAQCKARWF
EWLDPSIKKSEWTKEEEEKLLHLAKIFPSQWKTIGPLVGRTAAQSLEHYN
RLLDAVQQEGGGIGGGEGDGGNNEDVRRLRSGEIEPLPETKPAKPDPIDM
DEDEKETLSEAKARLSNTHGKKEKRKFREKQLEEARRLAFLQKKRELKAA
GIILKEKQKKKDTKRFDYSQEIAFHKKPLPGFYDTTEESQVDPNKDRQFI
NARMDKMDASKTSEDTERANKLAHIKKKKREAMALPDLIKQVNERNDVDM
TVKRGKMVLPTPQLTDDDLEEIAEFEKHNNKIAASGSSSATSALVGGFKV
PQTPANSVGGSTSSTITARTPLREDNLKAEAKALLAMTTAQTPLKGGANP
AFNPADLSSVTPSMTTQRTPNPIRTPNTLKQELLAQSTPASSGFASTPLS
TSNKQQQRAERQSLLGQLNNLPKPVNEYEVSLPDDEPTIEEMDEDQDGVV
LDESERGIRQDQEFRHKQQTKMKNRSTVLKKSLPRAHSTINNKKDNKDNN
NKDRSSSSNNRFVVTEKITDDLENAELLLIQEMNDIILNVNRSFPMIIGD
NQQQPQQQQQQQQQQQQQQQQQQQSIIEEVDENYETFTNKEMDQAIILLK
KEMESMKQENYNQQQENVDGDKLLKEEFVNNWEKVNEKYVFVSNEIGYME
REKVTDDQYVKMLNEEYTHIINSMKTMSKKTAVIEKKMTSDHQPFVQRLA
NSAKSISQLHDDLVQASIELQCFRDLASTEKQSLENRHKHLENLVYDQCE
RENNLQTRYSKLILKKNQLLSSN
>DFA_03304 g3216, 
MPKYYCEYCDKYLTHDSPSVRRSHIIGKVHQQAVRLYYQQFEADYHKSIT
EQRIKEMIKPGTVPLPQGPPMFPQAPPHGMMMGPSPYGMGGMGMVGPPGM
GRGGMPMPPHNMMMAQGPPPPYGGQPDFNQPPPPFPFIPKQSFNPFPPQ
>DFA_03692 g3576, 
MGIPAFYRWLVDKYPKSIQYFQQQQQVEEQQEQDEVSSSSTTTTPLKIDP
RCNNVKFNNLYIDMNGVIHNSTHAKGEPSVESIMASKPNTDDTIKENIFK
RLNDIIVTTNPSDLVYIALDGVPPRAKATEQRRRRFRSAKDIREILSKGP
KQPNPRHNNNNNHSSSSPSTSSPSTSSPPITTDESSDPSTPSTPAAATTP
AVAQPNYLELLDKVFDSNSISPATEFMNKVNHWIGEYISTVLAKSHPHLA
VVLSDTTVPGEGEHKIMDFIRGNHKNWSESTSHIFYGMDADLIFLGLSVH
LQRFFVLRDFQSLSYGCSICKSEYHMSYECKSALAMKKLHASNKEGDIDW
QSNGINTTKISVRNIPSQASEADIREIFSYYGEIVDIKFEKAPTKKPSLT
AFIQFGSVDAVKDIACRGAYFYVNNIKLTIAQYYKDEKVEVESPLIEEQA
ENAVVEEDDGVYYNNSIFCTRLELDVTEKDVVDFFKPTGKVESVQFLTTP
RRKAHPFKFCIINFGNLDDVKKSLTMDGKYLKGNKVTLKKSRPPAQKEPV
VPEEPKPKVVISEEEKAEKEAEKEKRRIEKEEKANQFITIAGHGMNLDSS
YFYLGLAEWDFERANELFIKFGRALSKQEDEYLTVERKFEFVNLDNLRQY
IKYYLTFGLENESQLDINNCINDITVMCMLMGNDFLPHLPAVSIKDGSIE
LILGWYKDWIHNSIKSTGKVNYLTVDTNIIYSNFSELLNVLGNWESVIYP
DKLERMGKKDLARLTHILKVSKEREDKEKEEAEANGRMASEFEEERNDIL
TSQLANLQEGELNYYKLKLQASTSEVENVKQWVCQSYVRGLSWVLKYYTV
GCPDWKWSYNFHYAPLAADLAAYCKEMVVGKGFKDTSHIEFKLGAPLQPL
AHLLSVLPVYSGKFLPAPLADLMKSKQLSTFYRERYRIDLNGEEVSWKGV
VLVNFIDVSALESLANPIAEQLSKNDPHIAQLNKLGNDIYYLQGDAQSSP
DSYKIFLDQQKKEEEESSHTNQSINNSSTASSGLNEKDFKWATRYLITLS
PQDSDLYNSLD
>DFA_04168 g4009, 
MVQDPKTFRSGIIIEDWAALARRLARKYGEGRETTETDYLYTQHRVTVDV
NGQVIKSEAKIDKTAFDQSTSLIYDVRVSTSIERQLGQLPLVPPPESSSQ
RTKFRHTFIDVDNPQWKIDMTLVRTNTGEETYEVELEIFNTYIIDALEKN
TLHNLISKFINETKNIITMIQPGRLSFPDIQMTKLEDPRFVDVLKQKVLG
YIPDCNPNRTREFPGAMPINFGKKHFPTIQRDMYYVSEKTDGIRYMILIY
KGVMYMIDRKFDFFKIDGNDELCKVLHDDTLLDGEMIRHLESKEPMYFIF
DILARENTKFGDKLFQERMQHIGKVVGDYRQSVGSGELGKTPFILIAKSF
FEKKHISKIFSSIKTNKNGERIFSDQKRNHQTDGLILTPNNAYKAYADQS
LFKWKYLDLWTIDFKVAQNSDRKWFLHCAGPNNTDIPCKELMLSTEDFAL
LSTDYKRSRDQGCFIAEFSFEFSKGIWKYHLVRPDKKRANYITVFVDTME
SICEGITKEELEYRFLCAGGHDNWDVEVEKMRHHIASTLTNKLKQQQQQQ
HQQQKIQQPQHHSQHHGQHQQHQQHQQQDDIFGGGSSSNHNYDPHGQR
>DFA_04210 g4045, 
MGVGGLADYISTYYPSVVRFQQQQQHGVGVGGGRPRYDHLAAGSSYMSVG
QLRNKLGGRHSNNTRGAETTHLFMDMNSIIHTIFRRNPNTDTSKIYKQIN
MRIKQTVDEHFPVKTLFLTTDGPGPRAKIPLQRKRRSKSKEDGISSSLIT
PGTMFMSGLKDSLANYFKHSRSVSSAIISASDRYGEGEFKIFEYINSKTW
TDQDSVIVFSDDSDVILCSMLSSAPNIIVKGTSSTKCYHIADLKQQLIAS
APLINPKQLIEDFVFLNLFRGSDYYPRMDGFNFVRSWTAYLEEKSKKGLY
NPKTRSINKELLQKIFNIGEGAVGGGGDSVNSLRQLSWNTSLKNYIAMTW
SHLGFKFGKNNIISNNNKNNKEGTTSTTTTTTEPSIVEEIESKLLRPTFS
KEGNGQYYMTLDGIKHGPFKVDAKYENVDGWADLSPIVSRAILTEPDNIF
LNYYQPQLSAEKYQILKSKRTSFAMEIEDQENASPPDVGQYMQCVIWLME
LLKGKCTNFHHRYLPKYSPSINHFSSLSKLNNKELDRFALPLTPLECNIA
LTHQKTIHNVHPIFHSIINHSNHFSLIDYIAEQAWNDEESVNNLLKQINE
TDTSLLNEKEKRLMTFSPTVIYTKSGNQIYYQEEKLANETNKIPHIYSER
KPYIEDIIPPVEEEELPDPLNQQSSAFEPSTAEPFKTFNSFKESQDKLSN
LIKSNANHRKTTMLNNFNKNNNNQNNQNINNDTFNINININGKSLIGSIL
ESKSTNIKPIPTSFSSSSPIINNYQLVFNLLKKIK
>DFA_12844 g5464, 
MCKKELHHGSSSNPTVFNKEDIPCTSTSNTSPIHSPTLSTTTTSNNNNNN
VISPTSPDQRLSLSFITNNNNQNNNQNNNNQNNNNQKNNNNTQNNNNNQK
NNNNNQNNICRNVSTNCINFKVIDQLSTPPCIPSKNNSTTTTTTTTTDAT
TIINDNTDIISSSSSTTTSTLTPTFQSYRQVDQNFSINQLPTLVIKNILD
YVHGGKFPGYHAVVAKSVCKLWWDLTLSTIKSFILVEMWMECRDTEIANW
CNQVSRFLDRAYGHSSEEMKFKRRCCVDFIQLKMGVTSPMIEPVLLRIKD
YFGVVDLDLSFNRIDDNGAEQIATLLMGQLKGLNLARNRINDRGACAIAK
ELANNTTLVKLNLSGNFFGRPMIGRLFRIIQDSNDTLRDLDISGALVAEA
MECLEPTPKFSNGPSLTKLTRLNIASTASGRHIDFLFNCRQIANLLVHLN
ISDNNIQPSACKIIADALIRKECRLEILIIDDNSIGDIGLYELCKMIPHN
RSLRILSLGNNNISFSGVSHLCHALSNPGVKLQELDLSNNSLKSISIPYL
SFLFGAHSRNRHLHTIKLRYCQLEDEGADMMSRSLFNNTYIKSIDINGNN
IQSKGCDSLAKLIQQNRVIETLCLGNNSIDNVGAAILANAIKQNTTLKML
DLENNNINYFGAFPLLEALKVNTTLRDLNLYIVNIPTRNLRFFLNGTIG
>DFA_06069 g5858, 
MSDQWAEATSADGKKFYYHKVTRVSVWEKPDELKTPQELAAGSSSSSSSS
SSSSNGAVSSSSGAASSVPVSLPPNWKEYVAENGKKYYHNAITNETKWDL
PTADNLHNNNNNNNNNNHNNHHHHHQHHNNNNNNNNSNGDEAPSTTTPTP
TPTTYEPNSKEASIKMFKELLQSHDVASSWSFERAQRVIINDERYQVLKT
MSERKSAYQEYMVDRKKYEYEEKKKQDKKNREALIKLLKESGEVTSSMTW
RRASLYFDGDPKWMAVESEREREDLFRMVVIDLEKKEKEDKDLAKRDLMK
QIKAKFEVNLTITSRTQWRKVKEEYENDALISTCDKYEVLQVYESYIREL
EKKEDEAQRSEKEAAKKEARIHRDSYREFLNEKYNEGEIHAYTRWKEFYK
KYQSHPIVVQLAGQVVGSTPLELFTDFIEELESRYEKDFKRLKTMTQDVN
FLFSPQQTTLDDFKQSISTHDKFNSISALNIVPFFEYLKEREEKKQKDSI
KKRQKAILNFKALLEDTRTISKHSKWEEIKPTICKIPHYTDLDDEEEKAK
IFQEYLDFLSQEESDEEGIIKGDDDINARKEFSSKKRYSREGSDNITDDR
KRKESKH
>DFA_06329 g6090, 
MFSKKYKDVFEQEEGDSKSNGNKKQDNYDDYEFDEDELMKDSSDDDDDDD
DDSSDDDDDHHQDKDGVHIQSTENVEHLEIKLRENQYNFDLHIEYIDALK
KAKLLDRLREARFAAQRLFPLPLSVWVSWLSDEQQLSSPLQEQEKIDLFE
KAINDYLSINVWVQYCKFIENQVISNLGGDIKSGEDERLKRVRDMYERAV
IACSDHMVDSFKLWNTYRTFEQQVLAMIPTEATEDIKTKQLARIRSIYQR
QLSCPQMNLEQTYQDYEQWEQSQVNSSSSSSSSAAATNIQTRYQLALKVI
EDRKDYEKAVVDAKTTGEGGSTLEKWQEYIGFEKKDQSKKLNRIAILYER
ALQENYFVFDLWKQYLGFLEHDFKAPSATIFSVLERASRNVYWSGDIWSI
YMSRLEKYSDKDDMILKVDQVFERALVAGLSGPTEYQHIFSTRFDILWRH
QKKEGGAGAPLLDEEKVNMFEQHFQKEYEVLVSLGMDVSESLMFRAKFEA
YQLDNSTLADQTFQLLYAGAPHLYHLVDEYIRFKITKQKDIDGAREIYKK
AVKTIAETSRIWQDWLNFERVYGTLQTSDHATHVYQDTVQRYQAKQQKEF
EKQKLQQAQQRKALEDKKRKKEEKQEGGKALGAGDGRGSLKKKKIEKTTI
YISGLPFSAHSNDLVKMINERVGDLKEVHLVSDKNGKSKGIAFAEFNTSD
AAQKCIDTLHGDITFNEKHPINVTYSKKEFKQQSEQEKNTQLHFEQEQKR
LAQIEINFENSEGKTVFINNLSSNVTKEKLQSFIESNGATVSDVRVIVKA
RPFAYVDLPTPEQVQNALKLNNKYFLGNYMRVALSKPPPGSAPREHKPNP
NKTEIVANPFDSKPGDEEMTTSTTSTTTAVPTRKPVLFIPRGLKKAPATS
NK
>DFA_07065 g6797, 
MDSADEDDYYDDEDDDDDYVTSLNSQQYNNNSNNQQNQQGGDTYNYYDDQ
TDLVIDEDEEDLKNLQKVLEDEDSNDDDDSDNVVVEKSKSNNNQDFKITI
KKKPTTTTTTTTTSTTSTTSTTNNNNNIDKSIVLHSNNNNNGVLNNIPSL
GISFLENMEVLETSGMDKEEALKTLQLNQEYQKQLRLYLRNIDASIILNQ
QLLSKARASLSISVNPKTENVGNSKRAGVAPYFQDSEGAVPNDNPDTQFI
KATYNNMPTYFKSKRWTKNELSTLSKGVREKNMQILLFRLSQRTHSKDEY
DREKKKIENLTLSDLEENLDGLEWESIVHEYLPGRTPMECELRWRNAEHP
LINKQPFTKEEDKKLLELSKKYGSHSWSDVAQELGSNRPALHCCQRHQRS
LNTKFMKREWTKEEDEILLREYTKYRTFGDKSWQQIAEALEGRTGQQCLH
RWQKTLDPAIRKGRWTAEEDELLTKAVESYGKGNWILIKNHVPGRTDMQC
RERWCNVVDPALIKDPWTEEEDKILKDLTAKYGVGKWAIIAKELGRRTDN
QCWRRWKQINSKTPFLKEYREVLSKKKEIIVSNFVGREKERPAFDISDFI
SEEKLKEMSSSEALQQLSRQQTPKSKGLIPSKASKKRKSKRKQDSDDQED
NGSTLEEQQQEEEDEEEEMTEEQVRGLPTIQPPTMETLIERLMDQSKQLE
QQLSQKNQNNNSRMDTSDDQVIEEMIDNNNNNNNSTPPTNTLLPFTPTLT
HTTIPSPPSSTPKKPPRSSVARPRKSQKTTHVNNTTTATTECENDQTSTT
STTSTTSTTSTTTTPRTPKARTPKPPKAPKPPKPPKEAKPKASRSRKSTS
STEIPQLPNLVQVLPIATREQLDENININNYIQQNEQ
>DFA_07485 g7217, 
MKPMKSKKDEEIDGKVDIKFTPNEIKNKARRVILWNKLRQQTNKEKSERR
RDRLKEAKKLGDQAPAKLLPRTIERLRKGDETIVQDEDEEVSEDVNLDEF
ASYFDGKEPKVCITTNTRPQGRHITPFVKMIEEILPNCEYFPRKDFKLKD
IVKFCANRDYTDLLVVNEDNGVVHTMMMVHLPYGPTVQFRVTNITMPDKI
ENCGKMTSHKPELIINNFTTRLGLTVGRMFASMFPQDPNFKGRRVVTLHN
QRDFIFFRHHRYEFASNEKAYLQELGPRFTLKLMYLQKGTFDATGGEYIH
LHKADMDVDRKTFVL
>DFA_07649 g7386, 
MEYSDNNIYHPTFIGQPHLVNNATGPAMAQLYQPMLTATAPQPIYNYYGQ
SHMPQQHHQHQIQPQQQIQQMPQGMMQNTSYVMPSVPSPSMFMNPTTTNN
NNNNNNTTSSIYTSNSASTYALAPSSMIDNSHQQQQQQPIPIHQNIQHQQ
QNIPILQQQQQQMPQFGYYVQQQPLISPIPSPLFYSHPQQIVNNNPTQQQ
LSNSNNNNNTAMEIASTPTTMIQNTSPNSSSLSIGNNNNNNNNVTSSPNN
NNNNSGNTGTSQIMTKNRILINPLPLHTLSNLNSNSTSSSGGASSPRSAS
TTPKQKPYSPRNTKAKPSPQQQLHSHQLPLSPSAIKAGISSLPTYPAGGQ
TSPRQYNRTPRMVDGSLVADSNAFNSPRISSAATSPLSSSPITMNSILTS
AQFNLINSSGEITDKMATTMTISDPLTPQQQQQMFYQQQQPNLPLTPKQH
QQILSESTTSNICPAATYLSNSGQIAPSLPLQQQQQQQQPTIINNNNGSF
LGYQPISVSDNTSNSSLDEYCPMTSSPVQLNADDTNMVGQLAHHLQNTMG
GAVSTAMSTGSVPLQMQTLPILTPCGRCGNSVTTNDQAIACYSCAHLFHH
VCVVSHSGQQWQCTYCNQIQTDPQVHLMQQLQFNWNDQMSSSPLHITSSL
PPPPNHMIMPQQQLQDTTMQQLISPVVSQTPTTMVTSQQAIVLLPKDHDH
HSVNDNNSDDDEQDDDDDDEDEEEEEKMDKTSDEDEEDEDDDDEDDDEEE
EVMTTNSPASTPRGSSKGHWSKEEDELLKNLVDIHGTKKWKYIASLLTLR
NGRQCRERWSNQLDPTIKRDAWTLNEDRIILEAHAKHGNKWAEISKLLPG
RTNCAIKNHWNSTMKRKITKNQYDLTLINVDSSTIEAIKNESASKKKVTP
LIVAVSNSSNNNNNTIKSPRPSTRQPNTPRQSSDQQQQQQQIPLTSQTTT
TTTTMIIPKIQSFDQLQQFQQQQQQQQQTNMNDHDGDLHLPTLVSINTDD
QHFSTSTTSTTNNNMTSSSSPFSQQPPSSPPKPCYICETISFIRPKSSDN
KVHSLTMEHCAHFNVQYPHNYSTEKVYGLCNQHYVCYKRNTKTTTVDGNS
PRLIIEDPVQRELREAGLWPDLADIDRMKMETKTDTSKINLLNLYTVLHQ
TKNLPIEKSYLALYEAFNIKAKTKKNQVGAGGVPEKVDDINKKLVKNNIK
FKIKNLLLTYPHLQFYGGVKEFHLGRLQKVPEVLLVKNNHHLKMRFQESD
>DFA_08024 g7749, 
MTSRKNTSIDHSNQITTAHSISSLVSSSSSSSFDSSTSTPSSSFIMASPL
GAPTMTMSTLMTTTTSTTSPPSTVIAAPALTRTTSRTGSINNMMMMMIPP
ESPISTASTLSSSTDSAFSDISSASTVNGGNGGPLKKSKGKWTLEEDDIL
RQAVAKHNQKNWKKIAEHFPNRTDVQCHHRYQKVLHPNLVKGSWSKEEDD
KVRELVEKYGARKWSEIAQHLNGRMGKQCRERWHNHLNPAIKRDGWSEEE
DRIIKEQHVIHGNKWAEIAKSLPGRTDNAIKNHWNSSMKRSKKPSNFKRA
PRKRKVITKKEEGDEEEEDYGEEEFGEDGEGGEEQGEEEKSLNLNTTPQK
TKLAVDVPSLQNFITNGYIISPKPRVSLTTTLPASPRPPLTPVQAISNLV
TPIKPFSNSCKKKSKTTNDNNNNNNNISPLKMNYDYINPEIFPQNFESEL
SPIRSNTSLLATTSFIPPPTTPVGGLGIIATTPNSANTTASNGSSSQHLF
DHSNFENSLSSPNKLCLLSPYRSNLSDLLHNNNNHINNNNNHINGNSLNP
FSNILSPSPKYKTTSSTPTTPGSTIQYLNNVNNHNNNNTTTMVHSPPNHS
VNNNSSGNNCTGTNKTLGIQLSDKGIDKTSLQNINSKLKGITTPPKINNA
NNNNMMIDNNNENDTITTYSTPSKPSNNNNNNNFYFVPFTPFKDNNVSHI
STPLGSTRSSTAKLNNHIKQSEMENFITEDNFDPSTKKTSCISTTTTTTK
LNTTGLGNFEGCSSVALKMLNDTSKRSIFDKARKLLMTTENQQQNNNNQN
NTNNNNNNKNNGDDFVPNISFLTPSSSNFTPTTPLKQQQYFLPTPYKPPV
NMNMNNHEINSINNNNIENNNNNNNNGNNKPNNQTFLDHHGCYVGIH
>DFA_08117 g7838, 
MSSTTTSNNSSSENNNQLDDSKLQEKAKKWLQINSKRYSEKRKFGYVDPP
KEDMPPEHLRKIIKDHGDMSNRKFRHDKRVYLGALKYVPHAILKLLENMP
MPWEEVRNVKVLYHITGAITFVNEIPLVIEPVYLAQWGSMWVTMKREKRD
RKHFKRIKFPLFDDEEPPLDYADNILDVEVEYAVQMELDPEDDKAVYDWF
YDNKPLINTKFVNGPSYKKWRLDLPIMSTLYRLASPLLSDLTDQNYFYLF
DDKSFLTAKALNMAIPGGPKFEPLFRDMGDDDEDWNEFNDISKIIIRHKI
RTEYKIAFPYLYNNRPRQVSIPYYHHPPNCFTKTTNPEAVGFQYDPILYP
IPSYKIDRSASIYGDEDDDFVLPEDVNPILLKSAQVNTENTLDGINLYWA
PKPFNQRSGLTRRAEDVPLVKAWYQERCPSQHPVKVRVSYQKLLKCYVLN
KLHHRPPKSLNKKYLFKALKATKFFQTTEIDWVEAGLQLCRQGYNMLNLL
IHRKSLTYLHLDYNFYLKPIKTLTTKERKKSRFGNAFHLCREILRMTKLV
VDTHVKYRLGAAEAFQLADGLQYLFSHIGLLTGMFRYKYRLMRQIRMCKD
LKHLIYYRFNTGAVSKGPGCGFWAPTWRVWIFFLRGIVPLLERWLGNLLA
RQFEGRQYNTTAKTVTKQRVESHYDIELRAAVMHDILDMMPEGIKANKSR
VILQHLSEAWRCWKANIPWKVPGLPVPIENMILRYVKNKADWWTNVAHYN
RERIKRGATVDKTVCKKNLGRLTRLYLKAEQERQHNYLKDGPYVSAEEGV
AIYTTVVHWLEKRRFSAIPFPQTSYKHDIKILTLALERLKEAYSVKSRLN
QSQREELVLVEQAYENPHEALARIKRHLLTQRTFKEVGIEFMDYYTHLVP
VYSIDPFEKITDAYLDQYLWYEGEKRQLFPNWVKPSDNEPPPVLVHKWCQ
GINNLDGIWETANGECVVAMQTTLSKVYEKIDLTLLNRLLRLIVDQNLAD
YMSGKNNVVIAFKDMNHTNNFGLIRGLQFASFIVQFYGLVLDLLILGLNR
ASEIAGPPQLPNPFLTYKDVETETKHPIRLYQRYVDKLYVVFKFSSDETR
DLIQKYMSEHPDPNNENVVGYNNKKCWPRDCRMRLMKHDVNLGRAVFWNI
KNRLPRSLTTIEWEDSFVSVYSRDNPNLLFSMNGFEVRILPKCRSPNDQI
IPKDSVWALQNINTRERTAQAFLRVDKESMDRYENRVRMILMASGSTTFT
KIVNKWNTSLIGLMTYYREAVVVTREMLDILVRCENKIQTRIKIGLNSKM
PNRFPPVVFYTPKELGGLGMLSMGHVLIPQSDLRYSKQTDSGITHFTSGM
SHDEDQLIPNLYRYIQPWEQEIKDSQRVWAEYALKYEEAKTQNKNLSIED
LEDSWDRGIPRISTLFQKNRHTLAYDKGWRVRTDWKQFQVLKSNPFWWTN
QRHDGKLWNLNNYRTDIIQALGGVEGILEHTLFKGTYFPTWEGLFWEKAS
GFEESMKFKKLTHAQRSGLNQIPNRRFTLWWSPTINRKNVYVGFQVQLDL
TGIFMHGKIPTLKISLIQIFRAHLWQKIHESVVMDLCQVFDQELDNLEIA
VVNKEAIHPRKSYKMNSSCADILLRAAHKWQVSRPSVLQDTRDTYDGSTT
QYWLDVQLKWGDFDSHDIERYSRAKFLDYTTDSMSYYPSPTGCLVGIDLA
YNIYSSFGNWFPGVKPLVQKAMDKIMKSNPALYVLRERIRKGLQLYSSEP
TEPYLSSQNYGELFSNKIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIF
IFNPRTGQLFLKIIHTDVWLGQKRLGQLAKWKTAEEVAALIRSLPVEEQP
KQIVVTRKGMLDPLEVHLLDFPNIVIQGSELQLPFQACLKIEKFGDLILK
ATEPKMVLFNIYDDWLNSIPSYTAFSRLILILRALHVNNERAKIILKPDK
NTITQPHHIWPTLTDQEWIKVEVALKDLILADFGKKNNVNVASLTQSEIR
DIILGMEIAAPSQQREDQIAEIEKQKKESSQMTQMTVRTTNVHGEEMIST
TTSPHEQKVFSSKTDWRVRAISATNLHLRTNQIYVNCDTAKESSITYVIP
KNILKKFITIGDLRTQIMGYMYGVSPPDNPQVKEIRCIVMVPQWGTPVFV
NVPNQLPEHDHLKDLEPLGWIHTQPTELPQLSPQDAIMHGKLLADNKSWD
AEKTIIAAVSVSWPCTLTPYRLTPSGYEWARANKDSQNFSGFQPSHYEKV
QTLLSDRFLGFYMVPDRGSWNYNFMGVKHSANMTYGLKLDYPKNFYDELH
RPSHFQNWTQFDQKSSTNQDSEEVTNNADNENLFD
>DFA_09027 g8714, 
MNAGIAIQQHIPPSQQLQPAQQVTNQYQYQQQLQQQQQQQHQTQQHHIIP
PHSPIYHSAIPQYLHYQPQPQLQQYSHQYHHQLQQQQPQSIYQTQQQPVH
SPYLTAVLSTSNNNTNASNNNLSNSSQMATNNSREYGESTTTTSVIPPSV
SSCMTPTMPSQVMSPLIVGTPTSGSVAKRKLEDDGDFSLLKVQPLLVSSP
LLTPVTQSPGLTSVQLAFQNTTLSNPPTPLSMSPSLSPSMAPMSPSKKSK
NSRSTSKSKWSEGEGSGRWPKVGTIVKGPWKDEEDAKLIELVNKNGPKEW
STIASKIPGRIGKQCRERWFNHLSPDVRKTNWTPEEDRLIIESHQELGNK
WTAISKLLDGRPANAIKNHWNSTLVKRIGADIRNHPPSTTRDSKSGDDDD
DDDDEDIDTQSPALSPISLYPQDNALGKKSQLAHHTMASSTDSSSSSSNY
NIPPFVLSDSISNSVPIQTHAQSSSSLLHQQPLTHQQPLQLPQQQQPQQQ
QQQQTGTIVAPQIIRLQNSSTTPSSSTSSPPPPTQSNNNNNNNNNNSSQT
KKPLDLNFPQQQHEHHSVPQQQQTQSKAGYYQGAGGAGTTGGDITNYPPS
HDPTSTTSNYWEMTPSGEMPNHQTLFTADFHVFENPSEFLFFGDSDHQHL
QQQPQQQQQQNQHQQQNQQNNMPLTKTDLNPPQHQGSQPYDLNNLFNNNG
DISL
>DFA_09245 g8922, 
MKGLEPVTIDTDEIREVWAHNLEEEMALIRELVDDYNYIAMDTEFPGIVT
RPVGSFRTPSDYHYQTLRLNVDLLKIIQLGLTFSDSDGNLASNTCTWQFN
FKFNLNEDMYAQDSIDLLSRSGIEFKKNEENGIDVLDFGDLLMSSGIVLN
EKIKWISFHSGYDFGYLIKVLTCTALPQEEPEFFDLVRTYFPCIYDIKYL
MKSCKNLKGGLSELAEDLDIKRIGPQHQAGSDSLLTCTTFFKLRKMYFEN
QIDDSKYQGILYGLTSSFSQDNSQQNNNNSNSSSSSGTTSTSTSSTTSTS
TSSSTSSPSSSSLPNNNSYTASASSLNGHVITS
>DFA_09249 g8927, 
MEDQEGGTNNANDDNNNNNNSNVEEEDTNRRERDEDQQINNNNNSDDGGG
GGGEDDGDDNDDDDDDDDDDDDDDDEDVNIVLDTESVEAGRTSTKNSSGS
IKPSFYKSAPVTNLAGVKYQISKQATASQHQPNRPQKSIYDLDLGGFEDR
PWTKPGADMSDYFNYNFTEETWKLYCERQIQLRAEQANLGKIKSYESTNK
MINGGNDQKIDLPPEFMPQEMNKQQQQGGRGGGGMPTDKRGAVPQQGGRG
MPGGWQANMQPPGGGPPGNFYPGGSGGYPPPYGGGVEGPVPTMPPSDERR
RERERDREGTSSGSGTRERERERDSGSGSGSRSDRDRERERDGSGSGSSR
SDRSERDGSSRSDRSERSSGSDRDRDRERSRRGDDSEYKRKSSEDPNDDR
LKRRR
>DFA_09405 g9080, 
MAKTKRNYKPHLQTSKIKPTPENKDDESNGSGTSEVVSDDTIREETPEYT
IDGSILEGGGQIIRNCLALASLFNKPIRINKIRNGRDQPGLKAQHRSGVD
LLVRLFRAHASGCKVGSTDLYYHPRRLAHEIKDTSIEADTGTAGSITLLL
QISLPCLVFFGSSTKLVLGGGTNVAFSPMIDYIAQVFAPAAKLMGVDMDI
TIDKRGYYPRGCGQTTVITRPLTTEPLKPIEILDRGQVTKIIITSYFTST
RINPSVADRMCQHARKLLKKEFKVEIEERSVDVAKESYGDCAYIFIMAET
STGCRFGASAIGEIKVPAEKVAEDATIQLINDLNGGGCVDEFLQDQLIIF
MALAKGTSKIKTGLISLHTETSIHFTSLMTGAKFQVIPDPDGKKDVNIII
CDGIGYLNGQNQNQNINNCTTSTSTTTTTTTTTNNTTTTTDS
>DFA_09630 g9281, 
MEDNQDGHDVSSSSEQVVAPETTTTTEEQQQQQDNNNNTETTTTTTPTPP
LTNTNTDTTTVVVEETSAMDTTTTTAEPPLNSSSEEHTDNTTTTVTNGSS
VGAGSNGDVVEEHQKDEQKEEQKDEQSNQQEQSIQQEQKEEQSNQPKQKP
TITILKSPPVPLSSIASVMPAIGKRLNVAIDQLEARITSDKYDTEAWTLL
LNEVQSQPINIARDIYERFLAVFPTAGRYWKLYVEQEMAAKNNEQVEKIF
VRALRSVRNVELWRTYIQYIRSGQQNDREEVIKAFELALEYIGMDIASTP
VWIEYISFSREDRAAATNPQDEGHRMNSLRKLYQRAVENPMHDLDALWKE
YEQFEMSMSKQLAKTMLAEHLSKFQHARNVYRERKALLEGILRNMLAKPP
RASDKEAHQVRLWRRLIAYEKTNPQRFDAVQLRNRVTATYNQCLLCLYFY
PDIWHEAACYQVDVGSVDAACQFYERGLTAIPNSLFLSFSHADVLESSKK
VDKAKEIYEKLITATAPSTPPLVWIQYMRFSRRHERIEGPRKVFKRAKSS
PDCTYHVYIALGFIEYYINQDTKTARDIFEIGLKKFGTDITFVNFYVDFL
SNLNEENNTRVLFEKILSNVIPQEKSEAFWRKYLDFEYRQNQDLATVIKL
EKRVAGLSPAFEKHSLLQHLNRYKFLNLWPCHPNEIEIMSKNLLRDDGED
EDDMDEDGGDDQDTSSSSYSSNKYRNARGGGGGKWDHRGGGGGAGGGDEK
NEREDKPTSQTKIPLSTLKASRPDTSAMILYRNEMGKISARGGGGGIGGS
GEISPPIQSINVPPNNMPPGIMGGGMGGGVMGGMVNGIPDFLMPIAQFYP
PPQQFNGPWVDVDQLMMLIKESQFPPQLLQMMAMAGINIGMNPNMGGGNM
GGNMGGGNMGGMNQNINMGNINNNNNNNMPPNQSFNKQQQQQQQQNSQPT
TPTTTDRSNQSPPTSGMSGGQQQQPPQQQLLGKRKIDEPDNSNNGNQLSD
QQPPNDQPIKPNVIPAPVPSSSQPLPSSSVVQPTSNITSTTTPLNSAQPI
SSTISSTKLPDNDIYRKRLASKLSKKI
>DFA_09793 g9438, 
MSLKEKVVVQQQQQEKEEEEKEEEKETQEETKDENGSTIPKPPPLPPVET
VKSSLSSYPSSNRVIPDDALRIIQFNIQADIYTHPQRYHYCPSYALYRPY
RQYIIPEYILEHNGDIVCLQEVEVEFDRLRKVLIESGYNHTAVLAKETDR
QHEQCITFYQTSRIQVIEEHLVNYNTIEKHPELISKEQIASLTNNNVHNT
NMYNQLLHTLHHNRHNILLLECKKTNQKFIVVNVHLYWGASSNDTNYYLQ
ILQMNMLLIMVQNILTRHKLGSWTTIDDNFETNTPIIISGDFNNGPANYT
YRYLAKGNLNVNTNQGLVNYQHPFKFKSAYNLHPNGELKYTCITRDFKGC
VDQIFVNDKIKVESLLEVEKYYGECLPTITEASDHILIASTITFLKNKE
>DFA_00932 g946, 
MNEEEDICRVCRNGSTPDNQLSYPCKCSGSIKFIHQDCLLEWIKHSKSSS
CELCGYPFRFTPIYSDNTPDILPFKELSVEVLKRSFKFLKRFARISFSFM
CFLVMIPALTCITFHLFFGMSTKKMLPYTIFNSFLIGVTLYFFIIATSFL
SYIFLTFLNGKLIELEIEEPTTTEQEQQVQDDDEEDEEEDDDDDDDTEED
ILYDQDHIFDNGLEQQMNQQPAIPAHIPPPPPPAPQQQQQGEVRHIFRIP
GVIEILGVQQQDIPLNPEAVLENNNIVVNEHDDGDDIETLIGLRGPISDV
ILRTASFAIYNFIFLLIFLYLPFQIGKQAILVISQIDTGIGGFKLSQLTE
GIINLFIGYSSLALVSLYILSECIKHKIAYKISRLLYSFIKISMIFFIEL
GIIPFLFGVALDLLTLPLFGGNLESRMNSFTNNKIQYILTRLAFGLFSII
GISSSSRVLHQIFRPEVIWFLKDSADPDFSLVKFLIKAKLHHIFFNISMA
FLTYVIIGFLIIFLPLKVLSFVPDLLPMDFGDIFNKLGTDICLIVSTSYF
PRFHPQFTFNSFIKTTFTFFVTKLGLDDYMLIRKPTTTNTTNNTTEPVQQ
PEYNYQPPDQQPEQQQQPEQPQPQQQQQQSQPEYIKPNNYGYKVIIFMVF
CWFELFCASLLALSIPVSIGRYLFSLVQFTYHNDTVSLFVGLAVLWLVTK
GISTIFSGRTSVDHFFSKIPNVLKLVLMIAIFTVALPLLIGLVFELVVII
PLIANYDESYYIFVIDLFNIWGIGVLLLNFWYQWITSRAPMNNIRNRRPP
EDELIPHQNGEQDQQGDGDRWMDRFNQLKRNGFMGIDLWFSLQKIVFPIV
YFLLKLLTVPYFISKGVVPFFGGSPILENITFIYGYPVFVFLLASEILYF
KLKKSVIHFHNIIRDDRYLIGKHLHNLEQQRQQHH
>DFA_09953 g9594, 
MLKVQGSKQFRQRIVCSLLSGRAVKITNIRDEDERPGLADYEASFLRLID
KMTNGTRIEINNTGTMVTFHPGVLTGGKLQHDCPTSRAIGYFVEAVVCIA
PFSKAPVDIAFTGLTNNDIDLTIDTIRTTTLPIIRKFLGGEEGDTSSLSI
KIVKRGAPPSGGGLVYFKCPIVQQLKPLQMVDEGKIRRIRGISYATRVSP
AIPNRVLDSAKGILLQFTPDVYISSDIYKGAEAGSSPGYGLTLVAETTTG
CCLSAECMASEGEIPEDLGQRTANLLLEEILNGGCIDSNNQSLALLFMIL
CPEDVSKVRLGRLTPYTIEYIRHLKEFFGVTFKIEADDETKTIIFTCLGI
GYKNLARRTF


# Dictyostelium lacteum

>DLA_01330 g1210, 
MNMNMPPVNDDDEEDADSKRMKSKGNPTTTTTTTSTISTNGKNTSNNNNN
NNNNNNNNNNINDSNNGPNGGLNGLAQLIESTQELHSSQSMDDFSDISST
TSTISFVAPSKPKNKGKWTKEEDDLLVKAVLENNQKNWKKIATNFTNRTD
VQCHHRYQKVLHPNLVKGAWTKEEDDKVRELVAKHGAKKWSEIAAHLNGR
MGKQCRERWHNHLNPNIKKDAWTEEEDRIIREQHAIHGNKWAEIAKLLPG
RTDNAIKNHWNSSMKRERSDASSTSSSSSHSSPVSSPTIIQQKLKLKKAA
EKLWKKKPAPNSNNIDSNNNNNNNNNINNNNNNTTTTTTTSTTKKLTKKQ
QLQLQQQQQQQQLQQQQQQQGSLHHHIDVDTLHGYLVNGLPITNVNGKRP
LGSTMNASTGNIMTSTDQYPYEYSIENMISPVKLFTNSPNDHLKKKQKLD
QTPTKQQLSSTQNTVFEGYTSEIYELGFYSPIGKSDKNTSSFLNSSELSP
LKSPSKIINNNANQLQQQQQQQTQNGTPCKHFLDSNVCFPYSGGGGTPNG
HYININSPVKSFQSPYKNFNNFNLIQQQQIQHLQNQQFLSPFKQQQQQQQ
QQLLQQQQQQQQQQITTTSNSLPPRFNGENNKSSKSIIGIHLSASGIDKT
SLSTINAKMNGSSTTTAVGSGKSSTEHHQSNIYSNNAITSANNTSTNNTI
PYSPISQNDGSFQYQFPLTPGTTKSLSRFSSSHSISPQKQFQNHSDSFEE
DDVDVQQPQPQPQQDKQQEQVVQQSPMDPSMIALKLIKDNSSKSIFSKAK
RILNNSVSSSPATSVPGKFSTPISSSNSVNIQNNHHQQQQQQQQQQQLQL
QNGYLVPSSAGGNSNGMTAVNQSFGSPMNGYGNSYNTHQQQQQQQLLNNT
TRSNDTYLLSPATIQPQSPTATTTQCNSSISLNSPNLPASLVLTTPTTTT
TNTNINNREPIVFQLNAHSYKSQNISTTPSTPTTTTTTNTTTNTTSSPTT
TANNKNAFSFKESDHGILT
>DLA_02388 g2168, 
MEKIIIFPPIPVVHVPITLFNSNKQAIDSKYILECNWFIDDEIVQSFNDN
VDHHIGTQLKRTSSTSLLSLVTSIDQPKTLTYIPTLKDANKTLKVTCKFK
SKILNLIPDTSIFQYEHKVLSSKSNSQREIYYLDNSLNHPNNNTINNNNF
SSSKSNLGASLQNYRVIQYNILADGYVSKFIFPYCEPYALFYQYYRKYLI
GKQILQYNPDIIGLQEVEDSYVDLFKEMEECGYVRSPPFSNCTGLPITPG
AAQEGCVIFYRSSRFQVISHLLIRYQTINAQTCSVPNSILTLEQYQLLLQ
EPIFKPILEKVMPFTDHHTKHVLLLLQDKQTQRIFVAASIHNYWGSISKM
EFNYQFQCLQIVILSMILENFLRSNQLPLDTGVVLCGDFNAGPESESYKF
LSKGFFSDTAKITIPFQSPFIFKSIYSQLPYGEPRFTTYTKSFQGNIDQI
FINNSFQTHSILDISDRSLYNEFGYLPSIVLGSDHILLLSDIELKK
>DLA_02701 g2436, 
MSFEQTFQEEDGDICRVCRNGPTPDNQLSYPCKCSGSIKYIHKQCLLEWI
QHSKSSSCELCGHPFRFTPIYSENAPEFIPLSELVFEAMIRLKWYIKRFA
RIIYIIFCWLFVVPVVTCWIFHFYFGKQWFLSAYDRKADFTVNSFIYDFF
IGTMLFFWILFATVVSYVILDFIHHKHSEIDIQNEMMNYDIQQIQQQMAA
NNNQMHQHQQQQPVQPLFQNNNESDSETESDDSDQELIPNNVNEAYNQLP
QQQQQQQQPQIQQPQQPQARVIFRIPAVFDILRGEDQPIQQQLQQLQQQQ
QQPQPQQQAQVNDININNNIDDDGNDDIEHFIGLSGPLSNIITNCIILVL
FNAAFILVFLYLPFLLGQFVQELTLTKLEMSIFLKGSLDIFIGYAVFSIT
SLLFLSFLISKNIWLKFTVLLYSFIKVVIITVVELGFLPILIGMYIDFSS
LSLFGSSISSRFDYFMSNKLPFLITRWGFGIFFMLNFTYLLSTLHQIFRK
GVLWFVRDPDDPDFDLVKDMIKSSFQKHLFKISFSVFAYIFASTLLVYLP
SKALSLIPNLLPINIDFGQATNKSTSDVIFIYAISYFPRIDARVTIKNVT
KFWVKEASNLLKLDGYLLPQPAVPSNQATNNNEANSQAAAVVVKPDNFGL
RISTFIFMGWLSLFLIISTYLSLPVIIGRYLLGPVSGNDIYSIILGLITI
WIAGKSIYILVSNGSSINLLQWSIILLKIIFISICCLILIPILTGILLDL
IFFIPLTTPYDETLHFDYNEKFKILFQYWCSGALILQFWYRCVTAANYNP
NNIRNNRPEDLERPRDKWIDRFEVLKRNGFANINVKYTLTKVVFPIGHYL
LTLFTVPYCISKFVIPYFGGTLTLESIAFRYGFPVYCFILIAEKLFMQIK
SWILIFNNAIRDDRYLIGKHLHNLDESKY
>DLA_02844 g2562, 
MYNFISTVQKPTAVSHSVTGRFTGPHDNNLIISKCTKIEIYKMGTDGLKP
MLDVNIYGQISSLKLFSVNGYDQDLLFLSTERYRYCVLAYDQQRKEIITK
LSGISEESTGRPSEPGQICIIDPQSKMIALHIYEGLLKVIPLSTNIFNSS
SVSGSGNITTQEAFNLRLEELQIIDLVFLDKCDRPTLAVLYKDTRHSRHI
STYEIKIDKDHSPGPWSHNNVEIGSNLLIAPPFGGVLVVGEQVITYLNGK
FPVSVQIPFTTISCYEMVDKDGSRYLLGDQSGQLYLLLIKLDNQGNVIEM
HIELLGETSTASSISYLDNGVVYIGSSQGDSQVIKLKTEKDTQTDSYLEI
LDTFENIGPIQDFCVVDLEKQGQGQIITCSGIFKDGTLRIVRNGIGIAEQ
ASIELQGIKGIWSLYDFNQVGSSSHQNADRYLVVSFLTSTKILQFDGEEI
EEKEYIGFDLSNQTIYCGNIGDHVVIQITRNGVYLIDGKAQQLLDQWKPS
TASGQINLTSRNSNQILLASGNQLFYLEIQQKKIKQISTVEMPFEISCLD
LSSFEGQEQSQLCAVGLWTDISVRLLRLPQLEEVCKEILGGEIIPRSVVL
LTMEQQHYLFCSLGDGHLFNFSLNINNHTLHDRKKLTLGTQPIILQKFQK
NQSMNIFASSDRPTVIYSKSKRIFYSIVNLKEVSHVCSFSSKVFPNCLAI
ANQSSLTIGTIDQIQKLHIKTVPLNGEMARRITYSEESSVYAIATLRYSL
DDQSSSSTTTTTTTSSNNNNNNNNTQKKTNSTGYGIPEFQLKLLNDQTFE
QTSSYQFQPDEYVWALTTCKFSSDHNTYIVVGTSYREKESGPLKSSQGRI
IVFSVHESRLVLCEEQPTVEPVYYLLPYQGKLLAAVGKRIQVGSWKFNSQ
EENGKLQLSESVYKGHTMIVQLAARGDFILVGDAMKSMSLLSVTADGKFN
VIGRNPQPIWLKSIAIIDDDHFLGAETSNNFVVIKKNSESTNEQERQLLD
SVGHFHVGEGTNWLKHGSLVTLPEQDQQQRKIPTILYVTINGSIGVIASI
TKEEFDFFSKLQEGLNKVIKGIGGFSHSDWRSFANDHHIMPANNFIDGDL
IEMYLDLDHDKMLKAIQGMNMSTDEVYKKIDTLMQHIR
>DLA_03103 g2802, 
MTYQFNELPEQFNNEAEDNFFKEMKEIGRGRSRTRDEISKIKRTKLTYFD
EKVNVSPFSSIPRVIPPGLNDQQISALILRIRIEEITKKLLSGVFEITDR
DRERSPSPPPIYDNNTGKRTNTREQRIKEKVMKERHQLILAAQQISTSYK
PPSDYQPPVEKKTCKIYIPIKDHPEYNFIGLIIGPRGNTQKKLEKESGAK
IAIRGKGSSREGKSTKPQYQENDELHVLLTADTQEQLDKASILVREFLVP
VEEGKNEHKRQQLRELAEMNGTLRERPAYIARSWERADIKCVHCGESSHP
SSDCPLRNNNDQQMLSIIEEEYKKCISEVKEILGYDFELNLNDNNNSNNS
SNNNMANGYDEFREALDKDEQLQQQQYQQQYYNNNSNNNYMQNNNYNQQT
NNNFNQYQNWNQNNNNDGNFNNNSPYGPSNPSHQQSNIYNQNFNTSPYGP
SR
>DLA_03551 g3204, 
MSEVENEQDVVMNTDNSNLQDESNEQNTDKQESNIDENNTNDGDENNNED
EDDEEDDDDDDDDDDEDDVVLVLNSESVEAGGTARSFKTGQNGKSGFYRA
PTSMTPGLNKYNIVKQAGGTTFSSNRMQKSIYEVDLDNFEEKPWLKPGAD
LSDYFNYNFTEETWKAYCERQNQMRLELTNQGKIKGYESKSVDSKSDLPP
ELLGVEQQQQQQQQQQQQQHQMQGHIPKRVPPYLLKKGPPDQRQNFNPHD
QDSQDRSHHHHQGGGGGGRGGNQYANNPNVSGNGSGSGGGGGGGGGGGGY
QGRNSVGGQDYRKPYQNNDEDGDRRNSSNRGGSGSDYHRDSRSSTDTRER
ERDDRRERERDRDPRDSRGSDERDRRGGDDRRERERDRDPRDSRSSSDYK
RKLEDADDDRSKRRR
>DLA_00419 g377, 
MLHSYCPISGQYIMTSQNPNNMNSSQNSNPMQGGREYGESTTTISIVPPS
ALTPTLNSQPIQQIQQIQQQQQVISSPMMGSAKRKFEESDSQISYSDYSS
MLKNPVLVSSPLLVTPNTQSPGLQSVQMAFQAASLSAPSTPLTMSPSLHP
TSPMSPSKKSKNSRNSGKSKWGPTTLHLSQSSIPEELQMPSPNSSTSSLK
ANIIKGPWKEEEDAKLVELVNKNGPKEWSSIAAKIPGRIGKQCRERWFNH
LSPDVRKTNWTPEEDKIIIDAHANMGNKWTAISKLLDGRPANAIKNHWNS
TLLKRVGGESTSSPRTRKQKSDKSGDKDKVSTEDEEEDEDEDENTSPALS
PISLYSSDNSVQIQNNIVHTPTQHPNISYNNIPPFVLSSNSTTPYPMLDQ
QQIQQQLQQQQQIQLQQQQQQQQQQQQQQQKNTGANGNTVIAPKIVRLQT
PNSSPSLGSQQQTQQTQQLSKKEQKKQLQLQQQQQQQQQQQQLEQLQQQQ
QEKMHQPPIHQQIEQHLQQQVSQYQIPIQSNIYQQQEYIIQQQQLQYYQQ
QQQQQQQQQQDYIYHQNSQQTQNNGQQHQHNSHVLQGSTQEVPYFDPYNF
IQQQQQQQLQQQSLQHHLNQQQGQNQQQPTQQNIDPNSHTYAQDYNHDFL
LFDSDHQNVNMPNQLHHIKQDQTSYHQISQQHHPNILQSPTQTQQNNQQD
QKHVNSYDISGLFNLEV
>DLA_04230 g3813, 
MSGLHEENGITIQEQSGNLKNITIKDVWSHNLEEEMAKIRELVDDYNYIA
MDTEFPGIVTRPTGNYKTQSEYHYQTLRMNVDQLKIIQLGLTFSDSEGNL
AKSTCTWQFHFKFNLNEDKYAKDSIDLLSKSGIEFKKNEMNGIDALDFGE
LLMSSGIVLNDKIKWISFHSGYDFGYLLKVLTCTDLPQDEIDFFQLVKTY
FPCIYDIKYLMKSCKNLKGGLSELAEDLDIKRIGPQHQAGSDSLLTGTTF
FKLRKMFFEGQIDDSKYLGILYGFTSYLQDSNGNLMVHPVQPQQQQAPPP
QQQQQLQQQIQQQQQMQHQMHMQMQYQQQQQQQQQQQQYTMYNNNNNNII
INNQQNNYNNIYYPTSSPSSYNRGYPYYPNSPINTNSPSNSNINK
>DLA_04254 g3836, 
MNGVIHSAVKLDDSGNIRGNLIKIRDLPNEMIKSSLYYRLDQIIHSIKPN
HILYIGIDGVPPRSKSIEQRKRRFKASKEALDVINKMRSYQKSNGNGTQL
VSGNQQPQQESYFDSNSISPATEFILLVNEWVREYCVKLSKIYENLNIVY
SDSSVPGEGEHKIMDFIRSYQKSSHYQSEKESHIFYGMDADLIFLGLSTH
EKNFYVLRDALDLIQCSVCKSNQHLSYECHSAFAKRKLNFEKTTKITVKN
IPIQTTEETLRKLFGFYGTIVSLSIERANTKRSSLSALIEFSDKKIIDEI
ASRGGSWFINNDRLTIHINYPKSTKSRGSSGSTSTSNNNNNTDGEESGDE
EEDDSNGDIKDTDEDIIDNCLFISNLDILVTPFDLEAYFQEFSLVKSIDI
LPSPRYPKQRFARIVFQDQKSARNAFNIGRNSHFFGSDITIKIPIHKPPP
VTETPQEKAEKEKQQQLEKERRMLEKEMKVATFLKMADTDTSRDSAYFYL
GNSEWDLEKAYKLYLDYDKAPMEIPSKPIKFTFDYVNLSNFREYFRYYLV
CGLSMEHQLMIDENRCINDFTLMAMLLGNDFLPHLPALHITSGSIELILS
WYRDWLHESIKNKEVRYVTTQDSNNINYQNFYALLSVLSDWESHIYPDKL
EKQVKREFKLKNPSLSPPSSPNSSNTKSKVVTLKDLNIIDKEKPKKNSNS
NNNNNSGNGNNGEVLEGFEFTYEKDCYYKIKLEQQFIEQKDGLIKSMCES
YLEGLVWVLRYYTHGCQSWEWYYPFHYAPLARDLQQYISEKMSGGQLNTE
FQFNMGAPLNPLVHLTSVLPIYSSKFLPEPLRSLMVPPSPVSKYYKEDFK
IDLNGEDVPWKGVVLLDFIDFQLLRKLAEPIIESNLSDVEKHRNHVGNNQ
LIVNSQVLEMPDISVLITSKEFHDEVMSPRSSALTVRDLNWTTRRVFTLS
PQTMVAYQEIPQTVAVRAPLHQNATVQMTELQTQFLEWRKTLGINQSLVV
LDSIDSKSCNALNLDSKQSIKSLLRVFSNEISTEKDIVIVSQDGDPQMIV
NIGFSKPTKVQGIKFISTFEQKYQPKQIKVYINTNLDFSNAASEKPIQII
DIKNSSDLAIVSTPITFDSLKFKTIQSLSLFIESNFGGNNITKLEKIVLL
>DLA_04891 g4424, 
MSARLLRMKKDVDTDDYSPSEYKLRRSPNDIKCKSKRMELVGKLMAAKKI
AREQSRKLRKKERDNLGDQAPPKQVPRTIESMRRADETVVDKDDGEFDEE
INNDEFASYFNGKPPKTCITTNQQSGHEAKNLARMFSKIFPNSGYFNRRT
YNLKEIIEFCNNREYTDLVVINETKGKVDELIISHLPNGPTATFRLTSLE
FPHEIAGSGKITSHTPELIVNNFTTRLGHTIGRMFASLFPQQPEFHGRRV
VTLHNQRDFIFFRQHRYAFESLSKANLHELGPRFTLKLLSLQHGTFNTSS
GEYIHIHKHDMDVDRKKFVL
>DLA_00493 g446, 
MKKVNSFENQNNYEFSPPPKIHIDNLVSSGGSGSLSPPSPRSLTSSSESN
SGGRLRRASSPLAFLQSLSPKNKKKNKKKTEFDPLSLMGDINTTNSDSRS
SSIDSGYYKPLLNKILNINNLPHRILIKIFNYLIIDKFDTVKLEIRKSKQ
KPQQLVSLDNNNNNNNNNSKSSSSIKGFGKSKKKSISKDEEDEEDKQNDN
IDLESICLVCKLWGLEIAPQVFHYFIVKSPKDLKSLIGLVTQGLQEGGRK
FQFYYISMIIDKSSTYQKFMNILKHRMPDKLPDKFVKATTKPILSNLFSK
SLFAQFFENSTSTRYFRFYQKWMSKDNFSAIGMALRSNTSICHLSFRNNN
LEDVIVEDVIKALYDNQTITYLDLCGNKLGYQTAHELAMVLTKNRHLETI
DLFYNNINTDGGSALFKALRINSTLKNLYLRWNHIKTPAAIDLAETIKLN
NTLQSIQLDRIEDAGGGCLFEALCENSSIVEINLSDCAFQQKSSVAISKV
LSSKISKLSVLNLKSNQLGLLIRPLAISLGSCQHLTKLNLADNRISDDTG
YLLGESLGENKSLTSLSLSMNGLSNHFSESLSLALRINQTLLALDISANK
ITFEGAKMIAESLQSNSSLKLLNLNQNSLSPQFGPIIAETLKLNQTLTHL
EMAYTGLRNEGSLPISKVLALPTLHIKKLNLSENSISDQVGIEFANALAT
NQFLQDLDLSYNTLSSKSKEIFEQSLLTNLSIINLTYSSVPLKWKFQI
>DLA_05215 g4709, 
MENVEKDVLSLRQIIDNRQRVLSQRYEPTESNENDNIEDYDEDDEFDDSE
VDDELIIDGDDDNNIESTISTQKPMNFTRSLQQQQSNPIHQLDLDIDDED
DQDYEDDEEDLIDDIDSRPLKKHKNLPDSSRTISIPPLTTTTTTTTTTSN
NNNVNTNYNFDKVPFSEQDFRNYQQILKKNSIQQEPPQLIELNTLDDILP
DDILSDLPNHSTTDKQYAKDALQLNREYQQLLKTFQVQIDEAIKRNSQLI
KKITTQQKLSYSNMYAGITQKKNERKAGVSYFSYEIQVDGVTQTFYPAEN
LDSEMIKKNFGTMPLFFKCRKWSKGDINLLHKGVQDRNKAKQMFRISESN
LTRAEYSARMDELNNIAPKEFENYPLTFDDFAMICMDSFAQRQPDEIKLR
WDNFENPSINNGNFSKAEDKELLRLALKYEGRQWHEVARELNDKFAPWPP
VPEPTRDNPHPARPPRPPIRTPISCLIRYQRSLNPTLMKREWTREEDETL
KMAFAIHGDKNWQTIAEYLSARTGQQCLHRWQKTLNPNIKRGKWSAKEDE
LLRNAVEIYGYGNWVMVKKHVPGRTDMQCRERWCNVIDPQLNKTPFTPEE
DRKLKELIEQHGVGKWATIASALGTRTDNQCWRRWKQVHNKSEDLVKYQE
KISKKKQVVVGNFVGREKERSSLSVDDILEVQESLKTNTTPTSESPNLNS
NNNNNNA
>DLA_00535 g483, similar to H. sapiens REXO2 and S. cerevisiae REX2 a mitochondrial 3'-5' RNA exonuclease there is a second copy of this gene 
MLKFIFGNYRTTSIFRNYRHYCTNNNKILDMSDNRAHRLVWVDLEMTGLD
LNKDVIMEMAVIVTDENLNVIESGPNLVVKVDEDKLQSMNKWCTEHHGQS
GLTQRCRDSKITTQEAEKIMVEFIQKHTDKGLAPLCGNSVHEDKKFLNKE
MPLFSDWLHYRIVDVSTIKELSRRWYPNELKKAPKKQMLHRALDDIIESI
EELKYYRTNVFK
>DLA_05449 g4922, 
MYNFNYQDDDQGNITMIPTPTKVNSKPRLSSAKTTIVSAGLGNEIREVWC
HNLEQEMALIRELVDLYPYIAIDTEFPGFVTKPIEAMRMKPDYNYQTLRI
NVDALKIIQFGITFSDNTGKLPQPTCTWQFNFKFSLKEDMFSHYAIELLT
NCGIEFSKIEKDGIDVSDFSELLISSGIVLNDKVKWICFHGGYDFGYLLK
VLTCTDLPKKESEFFDLLKIYFPCIYDVKYLMKSCKNLKGGLSGLAEDLN
VLRIGPQHQAGSDSLLTVSIFFKLREEFFENEIDDFKYKGVLYGYNFETH
HDEPFNP
>DLA_05543 g5005, 
MIRRNARLRQEYLYRKSLEGKEKDIYEKKRKIKKALDEGKPIPTDLVDFE
FQVRDEMKLNGDGEVKPPSIDDEYARCGIVDPKVFITTSREPSSRLTQFA
KELRMIFPNSQKMNRGLHILKELVDACRANDVTDLIIAHEHRGEPTGIII
SHLPYGPTAYFEIKNCVMIHDIQDSTPPSLAFPHLIFDNFTTPLGERTMN
VLKYLFPVPKDDSKRVITFANNEDFISFRHHIYEKDTYKNVILKEVGPRF
ELKLYKIQLGTVDQEEADVEWVYKPYMNSTKNRLFL
>DLA_05599 g5059, 
MAISNINELPESILILLLNYVNSVSSSSYWLVNYSLVCKLWSTQILPEAW
TDLLIATSQPSEPLIEYNEKGSLSQIASHGGASRAYKFNRLIMRIGKVTE
YLDQLKSVTTVDLRNNPSTNSVIDRLCDSLKTNKTIKSLNLYNNRLMQKG
GVSIARALEKNTTLTHIDLGLNLLGANGGNAIADALKKNQTLIHLDLSSN
QLGFRGVGPIIEALKINKSVKYLILHSNQLRDESTLLLADILRQNSGFIE
LGLNDNEIGSKGGIALARMLKTSKTHTHLDFGKNELGEDGGVAMADVIKF
NKLITQVRLNWNKLGVKAIKAISEALKQNTSVNWVDLSFNNLTDEGLTIL
SDCLKVNKAIRYLDLSRVATSAPGHKALAESIKVNQYITYLDLTNCKISN
DGGVAIAQSLQSNKSIRTLILNQNLISSDTIQEFSKTLSVNTTLHQFSLV
QNSLDISGLESLFQVLSTKNSTLGVLDLSSNLLGEEGGKTLAKYLSSFKL
SEISMANNQLQSSGATAVLANLSQTIQTLDISNNAISADSATQLSKTLTN
STTLLKLNISQNKLGDDNVPALVQSLQSNKSLIHIQISSNQFSQSSNNQL
LNSIRSNKSIFFYDLVEEGSN
>DLA_05656 g5107, 
MNNQKKKPQQGGNQQQQKQQQQNLSGSGGSGHSKLQIQQQPQLSASQSQM
KDRSTFIYMSLIGYYVNVTIKNGVVYEGVLHSVQPTVGGGIGIVLKMARK
KETGTITTQPTPTIMIEAKDFVSLVATGVQLEQHSRINGSGSPHHHHMMK
GDGTINTDTDISGFDGNLRERELQPWSSETHYDESLEGDSGRENGSDKKW
DQFATNEKLFGVKTTFDEQLYTTHLDKASEFYKSNIHLAEKKANEIENDK
SLNMHLLEERGHIQGNDYDEEERYSSVVRKGNPNPTTTSPTARVPLIPNT
DKYIPPRERQRLLQQPSPTTTTPATNPTTTSPTTSNKPEQKSTTPQKQQP
EQQQQTPTKESTTPSKEKDGGDQPQQPTTPNTMISGGTGTANVSKLILNR
DKQPNDEPDHGVLGSPRDGLSPRFVEYMKVRQGLTDSKTKNLSGSDTPKS
PLIQNRELLNSLSLEVVSGVNPDVVNDFNNFKLGKMNQSMDRQTNFENLK
TFQRDYNIKSKSRPSSPSVSSPRVLPPPSHFSLSGSSKDDQKSDDASSTT
TTTVATTTEQTTVQEASNQDTKKDIESKPSKSETTSTSKDTTTAQSSSAT
ATTSGDKSTSATTPITSTSATPTNSSTGSLSKFKLNPNAKSFTPTVGGTK
APFKGSTDDLNKSTSNLQTTDTQTPINDVYYESMKKRQAIQEPSDSVPPY
WVDPYGRQYEDDPYYHMRGIPSTMVTMPINAIPFYPNMPPGKGPTIITTK
PLPYQPTPPRTYANGQNAFLSYVHQPQPPPGYQYVPQGIPVFNTSPPPPL
IGGKRYFHPPPNSTQYSIPLIPNQPPPPQQPGTSPNRILTPPTIYPQPYG
TITTTRYQPPHEVSPQHGYHPGNFQ
>DLA_05769 g5197, 
MSNLFHVFQKQVQQSTGVEHCVKANLTSANDINLIVSKTNILQIYTIRYE
KIEKPENHSNGDGGGSEDKQKIETRPCLDLVLEKSLFGNIESLNVIRFPD
EQRDAIILTFRDAKISVLEYNVDLMDLEIRSMHYYERDEYKFGRQHFKHP
PLCKVDHQQRCAVILLYDHSMVVLPFKQAISILDDDEDTTMNINDDDISS
IMQYAQQQQQTQNPYYQQKSYSSSLLDPTDFCFLHGYYEPTLLILHEPTQ
TWTSRISAKKLTSVLSAVSLNLSAKQTPTIWSIDKMPYNCESLLPVPEPL
GGSLVVAPNILFYVNQSSRYGLAVNEYAQTDTGDQFPFPLDNTLNLVFTL
ERSTYVFLESDRFICSLKGGELLIFHLISDGRSVQRIHVSKAGGSVLSSC
ICVLSSNLIFLGSRLGDSLLLLYTETTVSDSGEEHENFSNPYKKQKTSEL
FDLFDEDNDTVQMQKQQQLQKQQEEEEDDEDDIFKEKKSQIKTYQLGICD
HITNLGPITDMVIGNSYDMYQQQKEQEEDQDDYNPPSKSSDSATTPHNLD
LVTCSGYGKNGSIVQLQKNVRPDLISSIPILDVTNSWTLYYESEILQKTQ
HNITGKKRTIDSISTESESPNTEESSNSDSKQSKDSSNDDSNSYHQFLYL
SLSDSTLIYEISQDLKEIGKFNQSTLAMGNIFGKSRIIQVTVNSVKLISG
ASTVTQELSFPTLKIRQCYIVDPFILIHFQNGSISIYQGNEQVHQLMEFP
FLKDRLNITASSLFIDHHNLYFKSTSSSSILDTTNSSQLSTKVNLILIDS
QGIIEIYKLESKELLYQYNNFFNEADILHWNENINDYENTISQYLNLNKT
NKLTNGQHQPNINNNNNGELKHSKITELSVHFFNQMEWSNPYIIGINQLG
DIIIYRGFKTPKDNILFRKFNHGIITRPLENSGGNGNDGKRIIEFSNIGG
KRGLFITGKSPLWLFCEKNYLRVHCMNNEGAINIFTPFHNENCSNGFIYF
TESSALRICQLPMDMNFENQFPIRKYMVKNTCHKISYDQVSKCYCLILSY
PVETGEIPESDQRKPVIVEYKYQVKLIDRRDLSTFIDSFSLQEKETALSM
KMVQLKFTDPDGQTRLKPFLAVGTAFTYGEDTQCKGRILIFEIITHIGQQ
RLNLLYEKEQKGPVTALSSTNGYLLMTIGPKLIVNNFMSGSLIGLAFYDA
QLYIVSISTIKNLIIIGDMFKSIYFLKWKDGKQLVLLSKDYQSLNVFTSD
YIINQKTLSLLVADLDKNILMFNFDPKDPNSRQGKMMLCKADFHIASNIQ
KFIRLPLRSTSNSTTSTTSNGNGNGNNKNHNSMIIPDQQMVFGGTLDGGL
VTLIPMNEQQFNLLSHLQTKLYHIPHGCGLNPKSYRSFKSYQQHYSPSIQ
QPQKFILDGDLIHHYLTLNNNDKHLLAIQINSTPDEIISILNQINYSSST
F
>DLA_06010 g5401, 
MLKYQGCTHFRQRIICATLSGRPIKITNIRDEDEKPGLRDFEASFLRLID
KVTNGSKIEINTTGTALTYIPGIIMGGKSLTHECGQSRGISYFVEGLLCL
GPFAKAAIDITLTGITNNDLDLTIDTLRTTTLPIIRKFGLEEGLSIKVLK
RGAPPNGGGMVNFKCPIVPQLRAVQLVDEGKIRRIRGIAYATRVSPQFSN
RVLDTAKGLLLEFTPDVYISSDHYRGTESGLSPGYGLTLVAETTTGCCLS
AECMGSSSGIDSGESPEDLGKKTALALLEEILNGGCVDSHNQSLALLFMV
LCPEDISKIRLGKITEYTMEYLRHLRDFFGVTFKIEPDQDSKTVIFTCLG
IGFKNMARSTF
>DLA_00627 g569, 
MVFIGLRLKTALIEALTVILSSILIFLPIKFRNFNALILVPVFVTIIACN
LSSQSSMISGGIVVISTATSSLVIYAFLKIFQERIWVSFIVGFFFAFFLQ
CTVLRGGRWFNGLACKKILLDLVIVYYFSPYPDSEKETDILETFICSLFM
LFSIVIVSIIFPVMATKLFHFNLMRTLRTSRDLFRAIGYSVESKLYKDPN
QQTSSKIPKNHSFTFPLNDLTGHDDEDGEDDIKVQHTSIEQQLQQKNDMT
TLEEKTNESQEKDIVSSSNTMKSSEKNIEKIVLPLSSLVTKKEVTFQLPL
GGEKEPKSPNVSGDIKLKKLKKSKSVEILSKSIPIPTDKEIKELQFRLTD
EVNRLTLVLKECKEERWNSTLVESYKAILNLVEMSLKHLMSLRISIESGF
SQNASRELVSPMEPFLDSLIEEVYLQIGLMIDVLKGKLHLSETPTSSPNS
ANAVSGADNPKRKFSRKEQKTVIERNILESSFEETDELIVKLQEFYKQLV
GEYQRSGLPGLHESEISRLHFFIFGIIEYARQQKVIYQLVLQIKARIRHE
SIRYQVIRYGLVYVLTALPIHWYKVTLFIISKFSKKSDVDKETPNLNQQQ
TVRNPDSEVPKKDHILKRIFKFIVNYIYVLCFKNGKWKFPLQIAIAYTSS
VIVFWYINGETKGELVIKGVWTCATAILVMSPSVGASLLKGFNRVIGTMG
GGGVGFLVSWLCSVIPKGGKEVVILAFTFVWITIISIIQQNPSFSYSGAV
SGLTFVLVVYGQYLYGFDYWYALFRSFHITMGVVWVIIICLTVFPYFSFQ
YTRIKMVNTTIQMSRTFVNIIRLGLKIETLNQSQEIMMDIDYTDRDRRAK
EIRKSLTDQRMILDQIKLSLNDIKSELILMPNKSNAYRKVYKDLSYSYTR
LVAAEASFRSSFSDPLLQAMSPINQKIQGIFNELDALAKDLNIFTTLTIS
KSQRSQLTVDHEKQLTDSVKALGDSFQEVRVDLLKRRILSTLHPEMIQFG
SGMYXEKMSEFGKAGSGGIQSSQYENIDRRERTNRVAMESVDVSKDPYIM
TNHLGSYECKLCLTTHNNIGNYLAHTQGRKHQTNLARRAAKEQKDNPNSK
IVPTAAKRLVTRNVVKIGRPGYKIIKQRDRDTGQLSLLFQIDYPEIEPGL
QPRTRFMSAFEQKVEVPNKEYQYLLFAADPYETIAFKIPNKEIDRSTGPD
GKFFTHWDRNKLTFTVQLYFKESTLKSTTSTTNTSTTNM
>DLA_06760 g6073, 
MELNTEIYNSMLSVYPTANGNFIYSNMLPTGQTAYATSIYPSNNNTGTGL
CNTTPTTPQYYINQAYSISPTLTSVKVSSPISSPYTSPLPISTTNNNNNN
NTTNNQCINSNINNNVSQSSCSTNIEISSSTNSIYTSPPMNSANCSPMKT
NNNKRDRDQMSLSVQSPSSQTNSPRKSITSSPTLKPQSPPTSASTTPTFN
SLPVGLPSNGVVQQQQQPPPPQHQMYPQQQNYEIHHNFQMVQQQQQQQQQ
QPQQFILHPSEKSQISSVIPQQQLSYDDMFLSNICPIAYQIIDVPQQPPQ
LQTTPPSFYLYQAPIIASQTPILAGHQLDTHDQHVDKKLKLDVSPSPSVA
YYSTQQPGTPTMISTPTSHIHVEKACGSCLTLVNKFMVDPQSIIHCPSCD
TIYHRNCLMFNSNTHWYCNVCSQYQYLYIQQQQQQMVPPTPTLPIQITQS
TLSSSSGMPPPLPTQLSNQQLNRSDSDDDTDQQDSDQVAANCDDDDDSEE
ESDDDVSDDDSDSSSEKKSKISHSSNSTLTSSTSTVSKKKKSSKKSHKSM
NGDVKAKGHWTKEEDEKLKNLVDVHGTKRWKYIASLLCLRNGRQCRERWS
NQLDPTIKRDAWTLEEDRIILEAHSKFGNKWAEISKLIPGRTNCAIKNHW
NSTMKRKLSKKQYDDILLPNSNNSINMSPNNSMEIPIVCTTKTDFANIAV
VTNLNATTTSPTIQTTPYDNILHHPYSLSLMDENSSSSLMDYQQAPQTIP
VQLDFANSNNNNNQHHLNDFVHNNSNNNSNNNSNNNNNNNNNNNNNNNNN
NNNNNNNNNNNSSNVFKSTTNFFLEQKQYFENLLSQQQSSPLTSAVTTTT
TNLNIPTINNNNNINNNNNTPATTTTTITTINNSPSLSSPRGKSKHSHQI
LNANSNCWICESITFLPPKGSDFKSNKQHPLSREHCQYFEIASPPLDPTM
EKKYYICHAHYNSFRRRSNSGKLDSSNNSSGSPLITSSNGGSSYFDPQTV
EDQVIQQMRSQKQWNDLESILKMKNENTKSDTGKIDLVQLYQTLIDTQST
PLEECFLKLYQVFNIKAKSKKKKSDLSSSAASTNSDKDSTDEPNKKLIKN
NIKFKIKNLLVTFPHLKYYGSEKEFQLQRLQKVPEILLVIDNPYLKKKFL
E
>DLA_06993 g6279, 
MGIDSLNSFLAGQFPKFKCTTTNFEQTDHAYVDLNNLCYMNSGKVKLNYN
QFFFKLIPRLRLFSTVLAPKKTLFLSLDGPGPRSKMLEQRKRRWKRSDKS
NFANSINQMNVDDEEDDDNDRFEDDIDSDRLLREEELEDFESESSDEINS
SPTTDNNCNGEDKMNVDRQPIVNILEDKEPNTFILRHGEEKFVSNNLTPG
TEFMGALKDFIEKFIVKEFTLKRKILDIYFSPADRAGEGEWKIFQHLNAQ
NYDPNDRILIYSNDTDLIISSLLSKKNIVIVSRCMGKKYEIIDIPALRKM
IIESVRDFEQKDPDRIIDDFVFMAGMCGSDYLPKFTFFSSEKYWLSYKKL
KVDEYIYDSNSQQINIENWKSIMSHSLFLPNWFHKPTTPNTNTNTINTEN
NNNNNKDMRPEHIPEAVYSLHLNNYINEKFLKASRIVQASLFPGNTDYTS
TIYFNHNEMENCLDLVVGHQVVDSEPMDYSNPFGRAKSEKNIMKNLKSRF
TDCSHPFWLENGSKILPTLNDRLQEVFKMNPHELVVKKFKVVENPNEKVN
NYLYALVWQMKYFMGNCNDFNYYYPYFTSIEMDEIKNFIFIPNYSPTKIA
QPLKPLYFGFVVLDSSYPEFFPEAYHPIYKTLPHYGKAKDANNELLEEDG
VRVITNLLDKYTEENKSNFTKYQISQLQFQPTIKFSFVRNFILLQIETPK
SYIDISVPEFGPPRYMLMRNINFNSTLSEHPFYTNEEINRFSYSKANLFR
DRPRLNNSNINIRITTNTKTNFNSNTNIKFTTNPINTSNTKININTSSTK
IITNTNNNINSNINSNINSNINSNINSNINSSNTITNTNINNNINSQLTQ
KEIEHILVLAKKRDSIQKSINNGTAQPYSTKKLNSTNDKIQKLINMRNVN
TDVQALKKMIESNNEKKEGWDSKNIKYELDKKNLAMELMKQNTIITPIGF
KAILDRSDAALAGKPHPKDDETSTLKRYPSLRAKQQTEFNLILSKKNKIL
SQLNSVDNQSLSTLSDDDKNMLEKLEIELFDLGCICPSVFKVSTNTQNND
QKVTVDSAKPNTMNVDNAKPKPQKKEKKLPKIKLTTTTTTSLETKSVIVD
SNTTPTKTCATSPKDNNQKPRKSRAKKQEFHKRTLESTERDINDSKKIKI
SK
>DLA_07314 g6563, 
MESDKVELNSNSNGTNVNKKKLKFQKKKEQKKKQKLKKILESKTSTNGNH
IHSNGNGSVEKKYESDDIGFKLDENDPTFDFYSKLLNHFDGNDQEQSQQQ
QQSTDTIDKSEKDNEKSNEIENTDNKTSENGDDSKPTEKDKEKKLSNKER
KRQQRANLPVLKQLVDRPDVVELHDVNSPNPGLLIGLKSTRNTISVPAHW
CQKRKYLQGKRGYVKPPFELPSFIAATGISKIRDAILQKEQKKSSKQKQR
ERLQPKMRSMDIDYEVLRDAFFIHQTKPKLSVIGDLYYEGKEFEVSIKNK
KPGQLSQELKRALGMQDNSPPPWLIYMQSFGPPPSYPNLKVPGVNAPIPE
GAQYGTHIGGWGKPLLNEFGKPLFEAYTSQQTNVVTMNDNGEEIVREYWG
ELIPEEEVEEEEEEEDQQQDDQENQQDGNSLEEMDQSNDGQSSVPSGLET
PDLIDIRKHRMNMDPNGPKQLYQVLEQQNVNNNNNSGFMESTHRYQIPQV
IRNSTSQSSARGGGATGNRVDIIKSHRTGPVDINFNPSELENLNEISEDL
IKKKYEQAVAATQDNYRNKQSKDDYSDLIEEQSKKRKQQLQKEQDKLKKF
KF
>DLA_07524 g6756, 
MQNYHNQMNNVHENQQYYYLQQQQQQQQQQQQQQQHQQEDDIKKEIDSVG
ASYPPIILKPQAILQNIVDNPQQQPQHKSPELDNKKQTENLKNENPPQPT
NPTVKSSSSSPIINSNSSNNNTTNTNNNNNNNNNSNPTTTTTTTTTTTTT
TTSTTNNNNSIFEIPDILLTPGTRSPYIQSLSNQIISIILGKTHRDEPTS
IIFTNISSVCKLWRQVSVERIQSYIYHLPADKQITNFFHNISHNLYPKLT
NLQFKVTTPTSSTFDVSNFVKLLMSNNKIIVNLELSQNGIGNKAAHCIGA
CLLENQTITSLNLSFNSIGNEGAEEISKALQVNKTLISLDLSQNCIGLKG
SKALGTALQSTIVLQTINLSKNRFGTKGIDSISEAIGKNQSLHSVDFSKN
DLCEKSAKIIGEAIRKHPFLQTLNFCDTKLSAEGVKYIAEGIQGSQTVSY
LDLSRNEFGYKGLKPIASALAQSHSITYLDLCGDIIGDKGALMLAEAIQT
NNTITNLSLAFNSIGYPGAHAIGRAISVNTSLQNLNLSINAEIGPNGAYS
ISEGLCFNKKIHTVNFCTTGFGPQGGRYLGDALRFNNTLTDLQLRGNEIS
DEGCKAISDGLKQNTSVTEINLSGNGIGNEGARQLMEALWFNHSLTSIQL
THNNINPSGVQYMKEVLQQSHLVNSDSYFHPPNSTTTVSCLYVTRSNNTI
CRIVI
>DLA_08046 g7211, 
MNSVSTPPIIASTTAPISTNGEISKPQYYGVTEPISLSFPTSVDLKFSQD
LENTLKSFGLFESQEESKKREEVLGKLNQIVLDWAKKVSLKRGFTEQMAA
EVVAKIFTFGSYRLGVHGNGSDIDTLCVGPKHIMRSDFFDDLSEILRVHP
EISEFTAVKDAYVPVMKMVFLGIPIDLLFARLSLASIPEDLNDLIDESYL
KNLDDKSILSLNGCRVADQILKLVPNIPNFRMALRCIKLWAKRRAIYSNV
LGLLGGISYALLTARICQLYPNAAPSTLIHRFFKVYEGWKWPAPVLLNHI
QEGGIFAAKVWNQKKDKGHLMPIITPAYPCMNSTYNVSRSTLYLLKNEFI
RGAEVTRKIEKNEANWSLLFEKSDFFTRFRFYLQIDAISANDEEHIKWEG
WIESKLRFLILNLEQTPNMKNAFPYPKCFENKVSQTVPTGFKCSSFFMGL
AFNFTGENKSVDLTKAVTEFTAMIKATDTKTPTMDMKIHYIKKKSLPVFV
KDESPPEEPRTGNAKKRNIKDISVAAAAAAASTATTTNTPPTLTSPITTP
ATNNSLEAINKKLKSDLGEPIVSPSSTAAISSTITSISPSSSSSSIPIPP
FSVSTPSPISRSTSSSDLNLQPISNITTATITTTTTTVDTQMSDANTSID
NISADITSNQDQVQMTKKINTLEVNELDFISGNSVTKEPKPSMKKPGISL
IRG
>DLA_08247 g7376, 
MGVPRFFRWISERYPQILQKVLESNPPEYDNLYLDMNGIIHACSQETSKL
LSFSEDELIRQVCNYVDLLFHTIRPKKLFYMAIDGVAPRSKMNQQRQRRY
LAAFNEEKTKRELLNAGKPIPEVEFKRNCITPGTPFMHNLSEALQFYIQK
KISEDLSWREVQIIFSGPENPGEGEHKIIDYIRKNKASPDWDSNQTHCIY
GLDADLILLSLVTHEPNFSILREEISFKKESSKKPKKEKPVDFQLLHIPI
LREYLDLELRTDNLSFGYDLERIIDDFTLIMIFFGNDFLPHIPMLEISQG
GLNSVLELYRNSLEDLGGYLTNGAEIDLDRLQGFMVKLKNFEQSQKVLPD
SKEQEENEEMIEDMLLEHDSLEEGEKKRLEQLAMDRLKQHFSDIHVSEDS
TTDRDSPSMYENNYYRVHFQDFPETYEEIKKFKQDLVLNYVEGLSWVLNY
YHNGCISWNWHYNYYYAPMAGDFVNVPQLLIQFDYGAPVTPFQQLLSVLP
PQSSELLPECYRYLMTSQSSPIVDLYPVTFEIDAQDPHYFDGIAMIGFIN
HQRLIDATFDQSQFAYTDKEKQRNTLKNAVIIYHDDDIKKHMVPSPNNKI
FSDLAESTATTEDIILPIHDSGLKPFRYCDDVLTGVHGPPGFPTLMTQKF
QWTMRAGVIDVWGMRTKKDSFIITLENPHIRQHLNREISSTEQLKQLAQQ
FMSKKCYVNWPYHTEAKIIGFSTIDQMINSEGQISQFSASQKLMFVDEMK
DMKSKYLQHGIDLYEPGKKVDYNYCPGIMVHVNKLVGVDSLPGGGTKKRY
SDAESVYPIELMVDYSSLKSDPRFEEIVNVPFESRYPVGKKVIYTKRDKY
FGCVGTVTMAYDNQLKLDLNVPSQPIDLEFGHSVAQMDERYFAINEVAKL
LEMPISHVNLLTGGLYIAKPSSDIGLNLKFAGRQQQVQGYCRGIVVARTP
EGHVQRKWEFSKLAVDLMQSYLKEFPIVYKILQYYTTNKAEVPTQNGYSR
TMVDITPLFSNNEEKTATLLKIQEFLDRSEIRKKRIVSCDTMSLSKDLIQ
KIQEHYLAISEKCEMTVQSIHTIADNVNDPLSYESIISYERQQNHQHPGN
THGQNSPGNKSPKFSSSTPNNNHSNNSKKPSNKFRIGDRVLTSLEKGNVP
FGRLATVVAVNDTKVDIVFDQECFSANSLEGFCAEKRGLSISTLRLYNLS
NPFSLYQKSNHHQRGYKQKSVDPHEFWEKLNSSKDGKLPQDNESIPTHKL
HTPEQVEERLLNTSNQEEEQLSWQQLEMLNGISGSQSNNNNNNNNNNKNQ
QSNKNSTYSTNYVANTLRTSSGQQKNVNYLMKRAPQYTQNNFSSEKEYTK
QNPKLPTNFYYDQNGNPVQPPPQKQKKQKSNKVNVEKQHRPVENVDPKKQ
KQLDFIFQNIKADQNNNSNGGSSVGTSTTPSTSSSVESNPPQTAQAALLL
RDIFSSTQPEHSIPPPQYAYPHYPYPHPPPPPPHFQQPYPYPPQFPFPHH
LHTSNNNHNNNNNHKLNLNNQNNQRNQNNQKPSNHQIKNFNLKMHKLKLN
NHQNQKHQNQRMVQPKRNNLQKIESINQKFLHQNHKIHHQLNNNNLNSSA
PSYHHLINNKKKK
>DLA_08495 g7593, 
MEQQQNQTQQQQQQQQNKIENQIYELKKQQELRFEVEFDVKGTIKLIEGS
AEYFGTELSLGKTYKVTSSKGAIFTWSGCKIEVSGNVVSYIGQETPMLLY
AGIHKIIDDKRTEILDKPSESGPRVIIVGPTDAGKSSLSKMLMGYSCREG
YQPVFVDLDPGQGSITLPGTICASLIDKPIDIEEGLSNSVPFVLYYGHTS
LDVNPSLFKAMIASLASNVERRLETSEIARASGFIVNTCGWIDGLGYQIL
LDSIQTLKANIIVVMDNEKLYSDLANQFSSGGVVVKKLPKSGGVFLRSPV
FRKKTRMSKIREYFYGINGDLCPHNVVIEFKDVVIYRTGGGPQAPMSALP
IGTQSQIDPLQLSEVSPNPDIIHSILAISYTKQPQNILKSNVAGFLYVTE
VNMETKKITALAPCSGPIPSKYLLLGTLKWLE
>DLA_08534 g7631, central component of the U4U6-U5 snRNP complex contains the PRO8NT PROCN PRO C-terminal and Mov34MPNPAD-1 domains found in pre-mRNA splicing factors of the PRO8 family
MTDVTPMSDDKLLEKSKKWIQLNNKRYSEKRKFGFVDAQKEDLPAELLRK
IIKDHGDMSNRKFKQDKRVYLGALKYMPHAVLKLLENMPMPWEQVRNVKV
LYHISGAITFVNEIPLVIEPVYVAQWGSMWVTMKREKRDRRHFKRIKFPL
FDDEEPPLDYQDNIFGCEVEDSIQMDLDPEDDQAVIDWFYDSRPLMRDQR
YVNGPSYKRWRLDLPIMSTLFRIASPLLSDLTDPNHFYLFDDQSFMTAKA
LNMAIPGGPKFEPLFKDTNMDLDEDWNEFNDINKLIIRHKIRTEYKIAFP
YLYNNRPRQVHTPHYHSPNSCYIKSNDPDLPGFYFDPLLEPIPSYKTAGN
YTSNANSEIGEDDDEFTLPDHVEPLLNGYELDSLNTPSGIRLYWANKPFN
TRSGRTRRAEDIPLVKTWYQEHCPPKHPVKVRVSYQKLLKCHVLNQLHHR
PPKSVNKKNLFKALKQTKFFQTTEIDWVEAGLQICRQGYNMLNLLIHRKN
LNYLHLDYNFYLKPIKTLTTKERKKSRFGNAFHLCREILRLTKLVVDTHV
KFRLGSAEAFQLADGLQYLFSHLGLLTGMYKYKYRLMRQIRMCKDLKHLI
YYRFNTGAVGKGPGCGFWAPMWRVWIFFLRGIVPLLERWLGNLLARQFEG
RQQTVAKTITKQRVESDYNIELRAAVMHDILDRMPEGVRANKSKVILQHL
SESWRCWKANIPWKVPGLPVPIENMILRYVKAKADWWTNVSHYQRERIKR
GATIDKTACKKNLGRLTRLWLKAEQERQHNYLKDGPYISGEEGVAIYTTT
VHWLEKRRFSAIPFPQTSYKHDIKILTLALERLKEAYSVKSRLNQSQRQE
LALIEQAYDNPHDALATIKRHLLTLRTFKEVKIEFMDLYSHLVPVYDVDP
LEKLTDAYLDQYLWYEADKRQLFPNWVKPSDNEPPPVLIHKWCQGINNLD
EIWETANGECLVLMETQFSKVYEKIDITLLNRLLHLIVDQNIADYMSGKN
NVVINFKDMNHQNGYGLIRGLQFASFIFQYYGLILDLLVLGLNRASEIAG
PPNLPNTFLTYKDVETETRHPIRLYQRYVDRIHVLYKFTQEEARELIQKY
MSEHPDPNNENVVGYNNKKCWPRDCRMRLMKHDVNLGRAVFWNVKNRLPR
SMTTIEWEDSFVSVYSKDNPNLLMAMCGFDIRILPKCRTTTDQIVPNDAV
WSLQNVNTRERTAQAFLRVDQDSQERFENRIRMILMASGSTTFTKIVNKW
NTSLIGLMTYFREAVVSTKEMLDLLVRSENKIQTRVKIGLNSKMPNRFPP
VVFYTPKELGGLGMLSMGHVLIPQSDLKYSRQTDTGITHFTSGMSHDEDQ
LIPNLYRYIQPWEQEIKDSQRVWAEYALKYEEAKAQNKNLTLEDLEDSWD
RGIPRINTLFQKSRHTLAYDKGWRLRTDWKQYQVLKLNPFWWTNQRHDGK
LWNLNNYRSDMIQALGGVEGILEHTLFKGTYFVTWEGLFWEKASGFEESM
KYKKLTHAQRSGLNQIPNRRFTLWWSPTINRKNVYIGFQVQLDLTGIFMH
GKIPTLKISLIQIFRAHLWQKIHESIVMDLCQVFDQELDTLEISVVNKEA
IHPRKSYKMNSSCADILLRATHKWQVSKPSLLNENRDTFEGAITQYWLDI
QLKWGDFDSHDIERYSRAKYLDYTSDSMSLYPSPTGCLIGLDLAYNIYSA
FGNWFLGVKPLVQKAMAKIIKSNPALYVLRERIRKGLQLYSSEPTEPHLS
SQNYGELFSNKTIWFIDDSNVYRVTVHQTFEGNLTTKPINGGIFIFNPRT
GQLFLKIIHTSVWEGQKRLAQLAKWKTAEEVAALIRSLPVEEQPKQVIAT
RKGLMDPLEVHLLDFPNIVIQGSELQLPFQECLKMEKFGDLILKATEPKM
LLFNLFDDWLNTINSFTAFSRLILILRAMHVNMERTKIILKPDRNTVTQP
HHIWPTLSPDEWVKVEVSLKDLILADFGKRNNVNVASLTQSEVRDIILGM
EISAPSQQREDQIAEIEKQKKEASHLTSQTIKTTNIHGETMISTVTSPHE
QKVFSSKTDWRVRAISATNLHLRTNQIYVNSDFAKETGFTYVFPKNILKK
FITVADLRTQIMGYCYGVTPPDNPQVREIRCIVMPPQWGTPVFVNVPNQL
PEHDYLKDLEPLGWIHTQPTELQQLSPQDVITQSKIMSDHKSWDPEKSIV
ISVSVSWPVTLTAYRLTPQGYEWGKSNKDSLNYHGYQPQYAEKVQILLSD
RFLGYYMVPDRGSWNYNFMGVKHSASMTYGLKLDYPKNFYDDCHRPSHFQ
NWSLVSDTSSTTTSTTTDSTENQGPDSENLFN
>DLA_00904 g824, 
MGKNKHNNQKIKQSYKKHKEEPTTQSNGGDASKNEKLTWDQVKPDFEIDG
SLMEGGGQILRNTVSLATLYQKSIKIEKIRYNRDQPGLKMQHRTGIELLS
QLYKADTIGCTHQSTQLYYKPTKEHVDQVEIDADTKTAGSIGLLIQQTLP
CLLYSQHETKMVLGGGTNVDFSPHADYIVEVFQPIFTKHFLEGTNAQMDM
SIEKRGYYPRGGGCVKLNIKPTQQALKPITLLDKGNVILIQVKAYTSGRV
TPLVGQRMTQQARKSLKKEFKKVDIECEEIDCTNRSFGDGCFIFIKAITD
TGCIFGGSSIGSIGVPAETVAQNAVDSLVKDLSDGGCVDEYLQDQLIIFM
ALASGQSKIKTGPISLHTNTSIHFTSLITGCSFQIEKVPKEQEQPGEDTF
IITCNGVGFIKSSNNQQTNINENNNVTTTSTTTTTTTTTSTN
>DLA_09295 g8329, 
MPKYYCDYCDKYLTHDSPSVRKSHTTGKQHNMAVQLYYQQFEADFHQEMH
EKNLKELASGKMPIIPQFFPPGLLPVPYFLGPEGAATPPGLFPPPNPQQH
QQQFQQMQMQQLQQQQQNQEQHHMQPQQQGMYPPQMQQHIQQQQQQQQHH
QHHQQQGYQQHHMQQHMQNYQD
>DLA_09379 g8410, 
MEVDKNQVSEWDDSELKKKPSTVSATPRRNRWDETPVSSSGATGGFKGTV
ADTPNNKRKSRWDETPLNVTSQTQATPMYNLGGATPKYDGSQVAMTPNYS
GLVKQTPMIGGQMMLDPQQLQIQRDIEERNKPWTDEELNSLLPSDGYEIL
TPPAGYVPLVTPARKMALASQTPVSGFFIQDENRKQDYGIDTQGPVDGLS
MKPEDKVYFEKILGSGDGEGEENLSPEEIKERRIMKLLLRIKNGTPPMRK
QALRHLTDRAKEFGPSALFNQILPLFTSQSLEDQERHLLVKVIDRILYKL
DDLVRPYVRKILSVIEPYLIDQNYYARIEAREIISNLSKAAGLACMTAKM
RPDIDSPEEDIRNTTARAFAVVASALGIPSLMPFLKAVCKSKKSWAARHT
GIKIVQQIAILMGCAILPHLKSLVEIIGHGLEDKEPKVKTITALAIAALA
EAATPYGIESFDPVLKPLWYGIRLYKEKGLAAFLKAIGFIIPLMDEGHAS
YYTEQVMLTLINEFKTSEDEMKKIVLKVVKQCVSTNGVKPQYVREKIVPE
FFKHFWVRRMALDKRNYKLLVETTMELANSVGGGEIVALIVDDLKDESEA
YRKMVMEAIDKIISTLGAADIGPRLEDQLIDGILYAFQEQSDESSVMLNG
FGTVVLAMGTRIKPHLTTITTCIKWRLNNKSAKVRQQAADLISHIAVVMH
ACGEEQLMSHLGLILYEYLGEEYPEVLGSILGALKSIVNVIGMTKMTPPI
KDLLPRLTPILKNRHEKVQENCIDLVGRIADRGADFVLEREWMRICFELL
DMLKAHKKGIRRAAVNTFGYIAKAIGPHDVLATLLNNLKVQDRQNRVCTT
VAIAIVAETCAPYTVLPGLMNEYRIPELNVQNGVLKSLSFLFEYIGEMGK
DYIYAVTPLLEDALMDRDAVHRQTACSTVKHMSLGVVGLGCEDSLVHLLN
FVWPNIFETSPHVINAFLEAIEGLRVALGPTIILQYTLQGLFHPSRKVRN
IYWKVFNMLYISSQSSLIPSYPKVLNEGPNTYQRYELEYII
>DLA_09523 g8542, 
MSMKRNVDSIYNNNNQNGNTKNIKRTKPQEEDDPLDSILGEIKTTQQGKY
NNSSRSTFSLENVISKIMEMTNNGSNCEKLEVEGRVGLIQPGMNGGINFK
PGMIQDDWERLREYLASRLSDKQLIKETDYIYDNHRVTYSEDQKKCIRKE
AKTSKITYDQSSSLIYDFRISLCWEESSPPPLEVPTDWKSKREKMRYTFR
DRDWKIDLTRTMVYDQFSQIIENPYEVEIELYPQSIKSCIGNRNLPVMMG
NFIQEVRNLIAIIQPPGAMTFPDVLMEKVTVPKEIDQLRDFVFAYLPEAN
KFKYEMFPGSMPINFGKKHIYNVQSNEYYVSEKTDGIRYMLLILASGSYF
IDRKFEFYQIQNYSVLDETFGNGTLLDGEMVRHLQNRKPVFQIFDILGID
NQSVCQLPLSERLKIIGAKVIQPLRQVLPPNTEVPFTLLGKVFLPKHKIA
DLFARIRDHHSGERIFSDDDKRNHFTDGVIFTPNTAYMPYTVQNLYKWKY
LDKWTIDFKVTERNRVWYLCCVGSGNVEVECREVNFSQEDLDRLKKEFLR
ARDISCIIAECSFQPKYGTWKFHQVRPDKRKGNYISIVMDTMESIAENLS
TEELKYRIPLKPDQDQWDEEFQKLRSTMLLNISKNQQRK
>DLA_09626 g8633, 
MGIPTFFRWLIDKYGNLLSETIEPREADGSRSRVDFTLPNPNGEYDNLYL
DMNGIIHPCAHPENGPQPTCLQDIVDSIYEYLDLVFAIIRPRKLVYMAVD
GVAPRAKMNQQRSRRFRAALDSRLSREKEEREWRLRINSGNATEEEYEDF
KKQKSLKFKFDSNCITPGTEFMNHVALSLRSYVDEKISTDPAWKDIKVII
SDASVPGEGEHKIMDFVRHQRAQEGYNPNLKHVIYGLDADLIMLGLATHE
VRFDVLREFIPKMKCHKCHQSGHFSVNCRTIPEDIEEPSEKEFLTKNYQI
LHLHLLKEYLELELKVNTPNFKFNLDRLIDDFIFLCFFVGNDFLPHLPNM
EIRGGALDRVSKVYKQLLPTFEDYIVDKGDVNMERLSSIFQELSKSEIEL
IKRNANRERQFLQRKQSMARISEPPKSLDLNQAVTFEHRQAANNLLAEIF
QPVTDIKDGTEERPAKQLKSNKQAALEVKQEIKAGGKKSTIMISNKEAAD
IIKNQLLQKQQSIVEKESKKGSKKRARDSDEEEETTKGSEVPVNTAAFKE
KDDETRSIFFDNSRNIRYEEEGWRARYYEAYFDIKDEESDKIKDICKSYI
EGLIWVLRYYFRGCCSWGWYYPYHYAPFIADLAKYCDEIEYPTYSLGQPF
KPFSQLMSVLPTASSQFVPKPFQTLMGITSDGKETLDGDSPIIQYYPLEF
KIDRAPHQPEYKGVCHLPFIDETLLLPTLEKYESLLTQEEVDRNSLGHDI
MFFSKDDQVSQQYLKLKESPKVVHFTIENSEILGFVCENSELIQKHMPQL
NKSMAYKYNHPALPKGYTFKYTTLKGAIIPAKTITQLKSQSMNQNSAADR
LVNGAQHSSQYKNNQGYNKQHGFQNNQNNQYNNNNNYNNNNNNNNVNYNN
QQQQQYNNNQQQYNNYNNYNNQQQYNNQQQQQQQQYNNAYSNGYNNNQQF
NNYQGYDMYQNYNYSGNDMQNYNNNQQFNNYNNTGNNFAAYNQQKFDAYS
NNQNNYLNNQNYQQYLQQQQYNGYLPNNSNNFNQQNQQNQQNQQQQQQQY
NNYNGNNNNKNHSNNGKPQQQKNITQNQNNAKQPMKYNPFGGSNRK
>DLA_09996 g8974, 
MSDWAEATTEDGKKFYYHKKTRKSLWDKPEEMIQYENALNSAKKSATPPW
ASSNSSVANNNQVSSGSVNNNNSSSNSSNSNNNGPIWKEYITKDGKKYYH
NLVTGHTSWDAPDFYQPAILPNNVQQKQQQQQQIQQSSGTNMNQSSGSTS
GREIFIELLKENEIGTTWTSDRAFRLVATDERYQALKTMTERKLVFSEYI
AEKKRTEMEEKKKKEKKNRDDYLALLKETPEINPLTTWRHASLILDGNPR
FEALDSERDREDLYKLYLDDMEKQEKDETLEKKTENMKLLKQKFEQNPSI
TFSTQWRKVRDEYETDPLYTSLDNFDFLSVFEAHIRELEKKQDDLKRLER
EKQKRESRKDRDAFRQLLADKYQSQELHALTRWKEFHSKIQNLPEYEKLS
QQTSGSSPLDLFVDFKEDLEKKYEKDYKKLKDIVRALDFNYVPQQTTIES
FKEAILKHEKISTVSPQNFLPFLEYLRYKEESKEKSLAKKKKKQQKHFQQ
LLSDQRNINAESTWQQVKQQIQNEKYFEELADEDERERLFNQHLEYVKKY
LEENPPSTATNSNGTIELEDGEEGELIEGESNDLKKKRPHNFDYNNNNNS
NRMDYDQFYDRNYMVYDIDDRPFKKDKRR
>DLA_10273 g9222, 
MVSFFIVPETLVVNVPTYICYQQTNSTKTVIPESAIKSIQWFVNDKEVQH
YPSVSDDHKEPKKESLFQKIFKLGKEESGSEIKKNWFVPDSSHVGGTLKV
RVVLKIKELHEKSEHVVEVLSNGTFTTHPTRQMYIYNSESDENDFGFTTY
NIMADCYTHPGRYKTPEYALFRPYRKHLMAKYINFYKSDIVCLQEFEPEF
TEVIKEMDEEGLTSTPTIIRDGSRYQPPDQCISFYRRSRFHLIQQHIIDY
NIITSSGLISKEQIEKLKSNPVTNHFLEGVLKTNHHNRFSFLHLTDLKTS
KPLIVINVHLYWGAPTEDWNYKLQLMQFYIITLILDDYTAKHSPNTPIPI
ILCGDLNNEPHQKVNKFITQGVYNENHEKDLYTFKHSYKFSSVYANHPMG
ETSFTIATKSYQKCIDYIYITKSNITVKSWLEVGNHYSETLPSVTEPSDH
ILLKANLNLNSEKLNTTEKKIDS
>DLA_10308 g9253, subunit of the splicing factor SF3A required for spliceosome assembly contains PRP9 domain characteristic of splicing factor 3A subunit 3 expressed in pstO cells
MSSTLLERTRNLHENIERYELLIENEMNIPPPNFKETILQNHRVNHYLES
SIDCAKELQKIYNDDDESRKNELESMSGKDIFGSFYGKLKEIKEYHRKYP
DLKDQRNNSSLYFNATVPFLGNEHFGKYLDLNEIYDVYLNLPFLQNRIDY
TSYLSKFYEFQYANIVRMKYPVYREYLEKLYQYLISFLERTQPLFDLNQN
LERYEKEFNEKWDKHEYDPKMEAEGDRKDDDEDGGDTSSPLYCKACRKMF
SSESVFKGHLGGKKHKLNQTKMTSDRDSHSYLNLKQRKPTTFLEFKISKL
GELLSDQTHATKEMVLRKQSRSATEINVEEEPVEDEEINIDDMDTVDEPT
KLKIANYPVDFSGKPIPYWVYKLNELGVEYKCEICGNQSYWGRKTYEKHF
QESRHSYGMSCIGVPNTIHFHEITKIKDALELWAKIKKQNNEKTFKSDRD
EEYEDENGEVMSKIAYEMLVKQGIIRKRKNM
>DLA_10395 g9327, 
MNTHNTNYKWKFSNPLKTNWFLKLPNEIIQLIFSQFFNRLELPNNTQFLR
LLLTCQHWNSIATEMYTQFKLDRVKKLPKVGDFQYLKRYKRSLQSIQILG
GQSLHMGYINIIVDIVQINQLTHLNLFSNKMNDYCIIRLIQSLQYQSSLR
FLGLSDTGLTSHTGPYFSDLFKCNKSLREVVLSHNNLGEIGAVAMSKGLE
MNDSLQILNLSYNDIGDIGAREIGRSLQLNKSIQELDLRSNCISPNGSSF
LSEFIELNQSIHAIDLWGNSIGKDGASDIGKALASNSSIRSINLTRNSIQ
SAGIKFITAALVSKNCNLKSIDLSSNSLCSDGAKDLSEALFRNQSIQSIT
LSSNKIDHVGIKALCKALRHNQSVTYLNLAFNEISTLGSRYLRKLLKRNT
SLRCLDLSSNQLGAESVPVIESIQLDQSPLESLILSHNSINITSIQSIYH
HFNVTSQTTPIANLKQLRLEFLSPTNQDKEKAASLKLLNRIHNNLVIKLF
>DLA_10564 g9478, 
METQNENINIKVEPNTDDNTSDITTNSSNDQPIPEIKNENLDSDQQQENQ
QQEEEEEKENKSLVQTEEIKIPIVKKEEEQEDDTKKVKSPTSSSSISTTI
TGTLQPTTSQLINQPKKSIQIEILENRISIDLYDTEAWTLLLGEIQSQPI
DNCRAIYERFLGHFTTAGKYWKVYAEQEMQARNYDLVEKIFFRSLRNVRN
VELWRTYITYIRQHKSQNQREEIIKAFEFALEFVGMDISSTQIWMEYLNF
LKEEKTNNTFEEGQKMTNLRKLYQRAVENPMHDLDIIWKEYDQFENSINK
VLAKSLLQEHHQKYQHAKSVYRERKSLLEGILRNMLAKPPGSSDKEEHQV
RLWRKLLAYEKSNPQKFEQATLRNRIAATYNQCLLCLYHYPDIWYEAAVY
QAETGSWEGSNQFFEKAIQALPKSLFLHFAYADSLEGQKKIPQAKELYEK
LIASVQPVDPLVWIQYMRFARRTERIEGPRKIFKRAKASPECTYHVYIAL
SLIEYYVNQDPKMARDIFEIGLKKFPLETPYINFCIEFLSNLNEENNTRV
LFEKVLLLPNHENKTIDLFWRKYLDFEYRQNQDIQSIQKLEKRYLSSFYS
SNINSLDKSGVLQALNRYKFLNLSPCPSLEIEVISKNLQPSDGDDNSTQQ
QKDSESGQHQNLKEGKGKKQKKDKYQQQQQNESSSTSNISTTSYQNPDNP
EKPTSSTIIPVSNWKVKRPDITNMLPYRGELSKFNSNNNNNNNNNSNNNN
MIGNQNNSPTSRSPQFDIPEFLFPLLQILPASSSFNGPLVDVDFLLMTIK
DSPLPIINPNMGQQIPQQQQSQPQQQPPLTLSPTTSTSNNLLNSPTNVQM
SNISSPTVQQPMQPSSQPQQNPHKRKLEDEDQISQPQPQTQSYSFQNTKP
PTNDLYRKRQASKLSKKA
>DLA_10567 g9480, 
MDIENEDKSMNNFSDISSDETSDSDSSVQHDVEHSSNAYEIDILEIKLRE
NPYSFKEHLNYITAIKKLYMASNCKDLQLFQRLRTSRETFQSIFPLSESI
WLEWISDETSQKNNSEYLQNLYNKALNDFLSVSIHLSYCKFIEKINNNEL
EVIRNQYEKSIKICCNDIVECKKLWSSYRIFEQMVFGSLDNIDKQKNQIK
LIRNIYHGQLSNAHFGIEQTYQDYLVWEQSQEPNNINPQLDEKLQESYKR
ALNETTERLPFEESIKYEPTHGDKHEIFLKWQEYLNWEISKKQKDRIITL
FERAIRIYYNSKDVWLKYLDYILESEEKDNREMLNNLYERLLRSIYWSGE
IWSRYLRFLQKDNRNYQEISAVFEKSLVSGLQSSQDVLVVFNSFIDYCWR
ASRDLIRVKGENYQIAIQSLREQFQRFSEYFKENKDIYSLEELQYYWAKL
ELTEFHSPESFKVIMDQIFNYQTSHYKNYQQVIRHELSLNHFDKCRKLFI
KAIKSVQSIDLNRVWDDYNQFERQYGNLDQYELLLFESNKIFKQQLQQQQ
SNNNNNINNNNNNNDNEKRLLKRQERQDKKKKIKLEKVEDDKVSDEANNL
IFVSSLPYEYDESKLEQYFNNITNNIKECRVVRDKYGKSKGIAFIEFNDI
DSATRSLSLNHPIVIDNNNNNNNNNNNNNNNNNNNNNN
>DLA_10612 g9520, 
MRNNSKGGVWRNTEDEILKVAIMKYGLNQWARISSLLTRKTPAQCKARWY
EWLDPSIKKIEWNREEEEMLLHLAKIFPSQWKTIASKVGRTAAQCLDHYN
KLLDEVQQQQDGTSSERPQRHSEMDPNPETKPAKPDPIDMDEEEKETLSE
AKARLSNTQGKKEKRKFREKQLEEARRLAFLQKKRELKAAGQYLHQKKKI
ERGKFDQSHEIPFFKKPQAGFYQVPDEEIINDPNKDREFIGKRVDQLEKK
KYLEEQEKNNKLEELKKKKKEITNLPGLLMEVSKLNDVQQIKNRKKTQMF
LPLPQLTDDDLEEIAEFERVNGGQELEVQLQQQRTKRTPMQQDNIMIEAQ
NLYNLSVASTPLKGGQTPHLVNTNLQITKPVNSKDSQQTPSVGKTPNPLL
QIAQTPKRKFDSLQDREEIERDNQVQQQKNKSSLLESLRNLPKPKHEIKI
SLPDVEPEDIDMDTQSVGASSTGGASAMELDESEVHIRKQEELKHKEQFR
QRNRSNVLKKNLPRLYEPVEISSSQDLIQKMVSIEMNKIIKNDNQLYPVL
AVNGSGSKQKQKQQQQQPEHQLQYEYYTNKEMDQVNQLINEQIKSSGMNK
DMVLNVILQELDSLQENFQVVPGDNKLVDRSVVTSKQRIETLKMDYEMIV
KDLKSHQMKSQQLEKKLTVYNGGYQNRSKQLVQSIEELYSSIQKANIELN
CYQDLRTLELNSLENRIKSVQHDLYDQVETENHLQLKYSKLLQEKKEILK
NKVTKFLYQEQKNKEN
>DLA_11108 g9959, 
MEALRAQLDEFLGKDRNLLPKDRVKTESHFTDADICKYYLCGLCPNELFT
NANIHDLAPCSKLHIEGCVKQYQNSKDKEVYDYERDWVRLLENLISENDK
KIKKNKERLAANPNENIQDEDLELDRELNQRIEEMDKQIQIYLKMVEDLG
EEGKITEAQQTMEIVEDLKAKKIELQREEMIAHEKNENKKMSVCEICGAL
LFVGDKEKRSISHLEGKKHIGFERIRKVMEDYYKTKNRQPRHFGGGGGYY
NRDRNYHGGGGGGGGGGDGGNYHGGGGYHNRENRDHHGGGGGNRYEPYGG
SNRGQGDRERRTYNFEYRDDNRGNSGYRGGNGNDDNRSYRNDDHHFNNNR
GRDNNSNGSYNHDNYQRSSRDREDDRRKR


# Polysphondylium pallidum, PN500

>PPL_10728 g10191, 
MGVPSFYKWLTEKYPKITCTNYLLENIENNNNNDNNNNENGNGNNNNNHG
NKEKKKLRLNNLYIDMNGVIHNSTHAKNSTTLSPVESDDVCRMNLLKNLD
ELVGTVQPTNLLYIAMDGVPPRAKAIEQRKRRFRSAKDAKDALSKRLPSD
PVFEPFDSNCISPATEFMCKVNEWVLTYAQALVKRMPSVSIVVSDASVPG
EGEHKIIDFIRAHREHWPSDTSHVFYGMDADLIFLGLSTQLSHFFVLRDL
GAQIYCSTCKNNGHLCYECECAVAKKRMNDPERSSRLSIRNIPIQADEEY
IRSFFGRFGKVLNVRLERALTKRPSLTAYVEMDSVESARSVLAYGANYFI
NDTKLSVHYVVEKVVSPATAAGGSTTPPAAADQEDIDNVPLKAIFIPNLD
VRTQMFDVNAFFSGCGAIEDSTFIKSPKDPKIKFVVIKFVEEESAKRALA
MNGVDFYGTSLIIRKSRPMTKEPKAPLTDQQKKQKDQEKEDKKLKKLEVV
NEFLTKADPNTNIDTAYFYLGMAEWNIEKAFETYLAYGKSQHKLLSDTPT
MRDDDSFDFVNLDYLKEYFRYSVLSGLSESVASRVDCNRMINDFTMMCML
LGNDFLPHLPTMAIKEGSIELMMTWYRDWLSSFKDESSTIQYLTDGPNIN
YKPFHKLLLMLSKWESLYFPDKLEKMYKKEIFRLDNLLKTSPHLESHRRN
LDGDINYYKLKIGVTIDKVEETANDICKAFVDGLAWVLRYYTVGCPAWSW
YYPYHYAPLISDLAKFVEKQSQVYNDTKHIVFDYGAPLEPYIHLLSVLPH
YSSKLLPPKLADSIVREPAPLSRLFEERFRIDPNGEEVSWKGVVLLDFIN
TNVLKEHASPIIKDQLSAEELERNKFGKDKYIRFNSEILDINQHLDVVKQ
QSTDREQGKHKLEIDNHYQPLSQLDLLWCERKFYTLSPNTPDSYEQLPIA
HPMINPPTSTLPTEVVISDEQNQFLEWREKQSHVEGSLETLMSSVDLNKS
FTVGVNKKLSTPLDRVLSNEISKTKHIQLVSDVDQQMLIHIQFNKQTKLN
GLKLISTISKETTPKVLKVYFNQQSVDFSSLQSLKPAFSFELDELSCSLI
SDYQAFGQTKIHQANSITLFVESNHSSSDDCKSIIEKIVLI
>PPL_13621 g10431, 
MNYYCSKTFINTTTTTSRFISSYNNHNHQCSFSTININCLSSLPKQSIFE
NSNNKFNSRSSQFLKRQQQQKSIIYKNFYTSSSLSYSDTNSNYNDNNNKM
SSTTNDRANRLIWVDLEMTGLDITKDHIMEMACIITDSELNVVEVGPELI
VHIDDKDLNSMDQWCTEHHGQSGLTEKCRQSKLSIQDAEKQMVEFLRKHV
DKGQCPLAGNSVHQDKRFLLKEMPLFADMLHYRIVDVSTIKELVRRWYPS
VANGLKKRNLHRTLADIEDSIEELKYYRSTVFKQQLP
>PPL_11028 g10461, 
MSEFGKAGGGGLQSSQYDNIDRRERLKQIALETIDISKDPYIISNHLGSY
DCKLCLTVHNNIGNYLAHTQGKKHQTNLARRAARDQRENPNKTTFAPKAR
IQPKKTIKIGRPGYKIIKQRDQETGQLSLLFQIDYPEIEHGLQPRHRFMS
SFEQRVEPSNKDFQYLLFAAEPYETIAFKIPNKKIDRTTGPDGKFFTHWD
RTHLTFTLQLYFEESVNVEDIDPSQQQ
>PPL_11702 g10956, 
MATTTTAASTTTTSSASTTSPTNQITYGVTVPISFSNPTPADLKLSTELE
DTLKSFNLFETPEESGKKEEILGKLNLIVRKWVIDVSLKRGFTEQMSLEV
VAKIFTFGSYRLGVSGPSSDIDTLCVAPKHIMRSDFFDFLGEALKVHPDI
TELNMVKDAYVPVITMIFSGIAIDLIFARLSLSSISEDMNDLIDDAYLKN
LDDQSITSLNGCRVTDKILTLVPSRATFRMALRFIKLWAQRRGIYSNVLG
FLGGVSWALLTARICQLYPNAAPSTIINRFFKIYETWRWGVPGPTPVLLC
PIQDGGIFAAKVWNQKRDKSHLMPILTPAYPSMNSTYNVSKSTLSLLKDE
FARGNQIAQKLESGEANWNKLLEKSDFFSRYQFYLQIDCSAQNEEEHRKW
EGWIESKLRKLISFLEQTPKMKFAIPFPKSFENKPTPAATTTAANGEAVT
DANNSNVCTSFFMGLGFNFSNAPGADKSVDITKAVIDFTHLIKDWAGKGP
TMEMKVHYIKRRQLPAFVKAEAPPEEKPKAKKRGSANSADVAKKKNRTDQ
QQHINSPTGSSTTTTPLLNADSKTSAINKSSDSISSPPIVVATSSTTPKT
ATPISSPKALSPSQQHQQPVVNITNNNGSPVAATSSTTSTSTTIITPVPS
TTGMDTTTTTAITSNHQTNESPTDTTNNTNTESTTTLSPTPDTDNVNNNL
VPVLSSTTNNNNNNNNNNNIINEVDELDFISSSSSNNNNNSNTDKKPAIK
KIDLIRG
>PPL_00003 g11381, 
MPKERAKRQPHFSDHEICKYFLCGLCPNELFTNTNIRDLGPCTKLHDEDC
LKQYNASKDKDQYDYERDWVRLMDQIITDNDKKVKKNKERLILDAAKLAA
EEGLQDTPSELKVAITQMEERIQALLKKSEELGEEGQITEAQDMMTQAED
LKKQKAEMQIEEDARSHDKKMSVCDICGALLFVGDKEKRSMSHLEGKKHV
GYAKLRAHMEEYYKTAKRDYRLPRRDNYNNNNRDYNNNSNRDNNSSNNYR
DRDGRRSDYGGGGSGGGRYRDSRDRDSRDGRDSRGSPYSRDRRGGGDYNR
EYRDRDSRNHREERDRERDRERERDDRSYDRDYDQRDHDRRY
>PPL_02616 g2309, 
MFENNPDSPKDTQTQTPQITWVPYTLKSEDELRFEIDKEAKIKLADGTAE
YFGTELALNREYTLNNVKGAIFSWKGCKIEVTDNVKAYISNGTPMLSYAN
IHSIMDQHRMSILSQKNQQGPRVLIAGPTDVGKSTLAKILMGYSARLGYN
PAFIDLDPGQGSITLPGALCASLIDRPVDIEEGLSNTVPFVQYYGHTSLD
INPTLFKAQIQSLGISVDKRMEQSDNARVSGMIVNTCGWIEGLGYELLRE
SINLLRINIIVVIDNEKLYSDLSREFSSGGGNNSSSGMKVMKLPKSGGVY
LRSALFRKQTRMQRIREYFYGIQGDLCPHITIVDFKDVCIFRTGGGPPAP
STALPIGSTSVIDPLALQEIQPSPEMLHSVLAISYAKNSQSLLRSNVAGF
LYVSDINMETKKISFLAPCPGDLPSKFLLMGTLKWLE
>PPL_02883 g2563, 
MPKYYCEYCDKYLTHDSPSVRKSHTIGKVHQQAVTLYYKQFEAEWFKSQM
QQKGGQVPMMPPFGMQPGLLPPNMVPGQFNIPMMPPGQFPFPPPPGQPMG
GMPPHQQQPMSFNPHHPYPPPHLQQSAQQFNSNSPPSNNDQ
>PPL_02974 g2639, 
MYCNMNHLPPHQQQQQHYHQPQQMIYHPGLQMATAHNNMNREYGESTTTT
SIIPPSVTSCLTPTMSSQVVSPIIVGGVPKRKLEEEDFSLIKNQPLLVSS
PLLTPVSQSPGLTSVQIAFQNASLSNPPTPLTMSPSLSPSAAPMSPSKKS
KNSRSSGKSKWNQGSSDDLSRWQKTKSPGIVKGPWKEEEDAKLVELVQKN
GPKEWSTIAAKIPGRIGKQCRERWFNHLSPDVRKTNWTPEEDKIIIESHL
ALGNKWTAISKLLEGRPANAIKNHWNSTLIKRIGADGKSHQPSPSKDLKD
DEDDEDEDSETNSPALSPISLYPTDPSSAANHAHLTGTPVQTTEQQQYNI
PPFILSGNHQVENNQMLNTNLPNDLYRQGTIIAPQIIRLQQQTTPSSSPQ
MNSQIKKSDPSQNLQRQIPANQSPQLHHQQQTHQIQQQPIQQQSVQQQQQ
QPIQQQPIQQQQPIQQQQIQQNTQQHQQYNQQQYNQQQALHQQQTQQQHQ
QQYNQQGQQQFNQYQVSHEVPYYNQSFWGQPTAENIAAGDHLVTFPQNPF
ISDFNFEHSDFLFFDHGDHSQQNNIKNIDTNQNQSQQPQQQQQQQQNNSQ
NNYDINNLFNVEI
>PPL_12977 g2705, 
MVTLATDEIREVWAHNLEEEMAIIRDLIEDYNYIAMSEFPGIVTRPVGSY
RTSSDYHYQTLRLNVDLLKIIQLGLTFADSEGNLANHTCTWQFNFKFNLN
EDMYAQDSIDLLSRSGIEFKKNEENGIDVLDFGELLMSSGIVLNDKIKWI
SFHSGYDFGYLIKLLTCTALPVEEPDFFDLVRTYFPCIYDIKYLMKSCKN
LKGGLSELAEDLDIKRIGPQHQAGSDSLLTCTTFFKMRKMYFENQIDDSK
YQGILYGLTSSFTQDNSSSNSSSSNSSSNTTTTTTTNNSSGTGTSNSTNS
TPNSSHQSISNYSLLQNITQSISPLSSSASSTTTTTNNNTTNAMNGHVIS
S
>PPL_03919 g3458, ortholog of the conserved splicing factor 1 binds to the intron branch point sequence (BPS) of the pre-mRNA necessary for the ATP-dependent first step of spliceosome assembly
MAVEQQDNIGSDFNDDEDDFFRQINEIQNDYERGRPRNREEIKEEKRTRK
NKWEPEKTQLGLPGVPKSLPPGLTDDQLASLIIRIRIDEITKKLTTGPID
IDTKDDRSRSPTPVYDNTGKRTNTREQRAKDKISKERHNLITQAQQINPQ
FRPPADYQPPNEKKTMKIYIPVKDHPEYNFIGLIIGPRGNTQKKMEKESG
AKIAIRGKGSMKDGKSTKPQYNENDELHVLLTGDTQEQLEKAAVLVRQYL
VPVEEGKNEHKRQQLRELAEMNGTLRERPTFFGAGGKSWQPVDIKCIHCG
EVSHPSSDCPLKGQDHNMHIIEAEYLKFIEEVKDLIDLNDRVVDPYDELK
ASINNNNNGNGNVQNENNNNQYQHQQQQQQQHSSPPNHQQQWNQYSNNNN
NNNNNNQYQQQQHHSSPPYEQQQQQQQHWNQQQQQQQGGYQQQQHHHHQQ
QWNQPKQHFNQNNNSSPYGPQSSYY
>PPL_04210 g3727, 
MKSTIKGGVWKNTEDEILKVAVMKYGKNQWARISSLLVRKSPAQCKARWY
EWLDPSIKKTEWSKEEEEKLLHLAKIFPAQWKTIAPLVGRTASQCLEHYN
RLLDQVQARNDAANPDASGAAAGDDPRRLRVGEVEANPETKPAKPDPIDM
DEDEKETLSEAKARLSNTQGKKEKRKFREKQLEEARRLAHLQKKRELKAA
GIIVHDKKKAKEKRFDYSQEIPFYRKPMAGFYDTAEEQKQAPDKDKQFIN
QRLDKIDGESRAAELERANKLEELKKKKREMTNLPDAIKQINKMNDPEMT
RKRNKLVLPEPQLTDDDLQEIAEFEKQSKSYSAASGDGSGGELTATTALV
GGLMRPPTEVPQSRLNMVARTPMREDVVMMEAQNLLAMTTAQTPLKGGAN
PVLNPSDFSGVTPKPQNMASRTPLRTPNPLAQGMTPRQQKQQNNEDAMAT
KHSIANGLKNLPAPVNKFQISLPDEPTLEEIDEDGQQILDQSEQEIREQQ
ELKHKEQFRLRNRSLPLKKSLPRATTLPTQTAGLAIIGKAEEEQQQQLLQ
KEIDNLILNEMCGIIKHDDKCYPLEGGTNDDNDYEYFSEKELKEAQKLLV
TELNVVKQEQQEQLDEKELIDKFVNIWTSVRDDYVYQSNQFVERASLSTE
QKIGSLKQEYDAIVNAMKTSAKKAQLIEKKMTTELTSYQASLAKVLKQID
EISQQIEQSSIELSCFQQLRIIEQRAIESRVKFVQNQVYDQCDRENRNQM
KYSKLINERNTLLSQQQQQNGNNNNNNNNNNHPK
>PPL_04213 g3730, 
MEEEGDVCRVCRNGPTTNNPLSYPCKCNGSIKFIHQNCLLDWIKFSKSSA
CELCGHPFRFTPIYSENAPDVLPIREFILEAIIRLSGFLKRLVRVLYVVF
CYLFLVPFFTSWSFQTYFYLKLPDSIYDVNTIARDFFLGFMLFFWIIIVT
ISSYLIFDILDHKHSELDLEENLDNNNNNNNNNNVNNQDDGDDDDDATDV
EEEDDDEDIRYNQQEWLGPQHQQENQHANNHGQPLFGGVFEVFQQHQAPQ
APQQVPANGVDQVDQMDLNTLIGLSGPKLEVIAKGVCLIIYNTIFLVVFL
FIPYFIGYLSTNAVSTLFDIQLPASIISKYLLNISIGYIIASTLTMFILS
NFIKNFIYYKYSRIFYSFIKVCIIVILEVGVFPMLFGAFIDYASMELFGG
TFDTRLQGSLHHILPFIITRWGVGFFCIINISSLCKILHQIFRRKVIWFL
RDPNDPDLDVIKDLVKVPFVKHLININLSLLIYCVVTILLIYLPLKALSL
IPNLLPVDFGDPLNKVGIGADVIFFISTFYFPKFHPQLTFTNFIKYFNNI
ITRTLGIDEYILLPPAIPNQQQQQQQQQQQQQQQQDGQEQQGAEQPPQPQ
PIRNPQDFPDVKPTHYKFRIIGFLFLWWLLLFTIICCFIGMPITIGRSLA
GLASISNPNDIITFFVGVVVVWVLSKLVNLVIFHRSTINIIQWIPVAFKV
LILGFSICIFLPVLVGILFDLILFIPFVSSYDETFYIFSSDIFYSWCIGA
LILKFWYRWATAVPNEGNIRHNRIEDEEQTERDRWFDRFETLKRNGFANV
DLMFTMKKIVFPIAHFLMVLFTVPYFVSRGLVPWLGGSAILENFTFRFGY
PAFTVLLIIESLYNKAKVYLIKLHNSIRDDRYLIGKHLHNIDTTN
>PPL_04436 g3925, 
MSLVCRLWRKRTSQTIKVIDLSDIEIYNTPWVDTFFQSFTSLLVLRLKQC
TLELNQLSALLSYFQSLIEVDISFCKLHSVEHAMGIVDTGLSVFLERLEK
HPSLENLYLSFNHLDTKLTQDLLVLSRIPNHYKINLTINLMFANGFNIIC
EIMKSNQSIIKALNLNKSKIGRETNSLVAFSDVLKLNHSLTSLDLSSNQI
SDSAAKILSESLATNDTLVQLNLSFNEIKKEGSVALANALKSNRSIESLN
FSYNFLGEEGTRAFSDLIATNTTLTDLNLSANKITFFNVPQIANALAANK
TLRSLNFLRNMIDQVGAEYISQGLHYNQSLTSLNISSNKFGNLGAVLIAK
ALSSNRDTKITEINMSSNCIEDEGAASFAAVVLHNNTVTSLDLSVNWINS
DGVVEIANAFLENPNSTITSIDLSCNTICPKGARAMAEALSVDCALRHIN
FFSNNIETDGAYELSKSIIKNHTLTSLELSTNLIGNEGIKYLSQALLENN
TIVSLSLSQSLIAYEGIKYLVSLISLNHTLTFLDLSYNFIGPKGAEELSL
SLENNKTITSLDLSSNSIGDDGATAIAGIFPKNNTLQRLSLYNNKIGPKG
AKPIVENLLKNHSLYSINLLANRIDAYILKPIVKRLEHLLPAPS
>PPL_04777 g4227, 
MYNFVSTVQKPTAVYHSVTGCFTSPNERNLIISKGTKLEIFTLTPEGLSP
VLDVNIYGRISDMRILTATGDKQDRLFILTEKYKYCILAFNSESRELVTI
ATGDAEGTIGRPAEAGQIGIVDPECRMIGMHLYEGLFRVVPLEHGQPVRE
SFSMRIEQLQIVDMVFLKQCAKPTLALLFKDTRDARHIVTYSIDVVTKEL
IEGASQDSVEENSTMLVPLDNGAMLIVGEMAITYMNLKGNSQPVTISIDH
THIVAYEQIDRDRFLLADDCGSLYLLHITLDSSKQTALNMKWEPLGETSI
ASSLSYLDSGVVYVGSSSGDSQLIRLNSHIDPNTGSYISVIDQFTNLGPI
TDFCVVDVEKQGQGQLVTCSGTFQDGSLRIIRNGIGIAEQASIELPGIRG
LWSLSNNSNPSSLHRHLIVSFINSTKVLTFSGEEIEETEIAGFDSNATTL
YCGNTTENNHFIQIATSGIYLVDSSSLMRLDQYTPEKGSINLASCNGSQI
LISQGSNLTYLEISDSKLIIKKEAQLQYEISCLDISLLDGFTSSPVCAVG
LWTDISVRILQLPNLNEVCKETLGGEILPRSILFITFEGTNYLLCSLGDG
HLFNFTFDVVENLLQERKKLSLGTTPILLNSFKLKNSTNVFASSDRPTVI
YSNNKKLLYSAINMKVVSHVCSFNSEAFRDSIAIATESSLVIGTIDEIQK
LHIRNVPLGEMARRITYVEEYHSYAVITIQRNDGNNNNNDNDNFNNNNNN
GVPLTNYVKLLNEQTFETTSKYALKSFEFGWSIVTCRFKNDDALYVVVGT
AFHNEVESQQSKGRILVFRIEDNRLILLDEVALPACVYCLLPFNGRLLAG
INKRVQAFNWGVDTNKLTKAESYSGHTLSHSMVSRGHFVLVADLMKSMTL
LVEDQQGAIKELARNPLPIWLSRIEMIDDETFIGGDNSYNLIVVQKNAEA
SSEIDNELLDTVGQFHLGETINKFKHGSLVTSPDMDSPKLPTILFGTVSG
AIGVIVSISKDDYEFFEKLQKGLNRVVHGVGGLPFENWRSFSTEHMTIPS
KNFIDGDLIETFLDLRHDKMLEAIKDMNISIEDTYRRIESLMHHIR
>PPL_05118 g4515, 
MQNYHQQMNNVEQHPYYYMQQGQDQNPNGQPDHINIKKDDSISDQAASYS
AHHNGGANTLADLSDLTDKSVLLKHENILHHDKDSTAKMEPQDPSILQQL
GYQQQQQHMSNIIHSSSSPLSNNTPITTSTTTSSHGNIGQLQQSGNSPQQ
QSQQQPLVYEIPDILLAPGTRSPYIQELTDSIISIILGKAHKDEPISQVY
TNIASVCRQWHRLSVDRITNYCYHLPPDKKITNFFSNIAHNKFPNLHTLQ
FKVLTPTLFDVTSFVKMILIDNKLITTLELSQNGIGNKAATCIGTCLVNN
TTITHLNLSFNSIGNEGAEEISKALGTNKTLTHLDLSQNCIGLKGSKALS
TAIQTTKTLHILNLSKNRFGTKGIDVIADSIGKNTCLLNIDFSRNEISEK
NAKIIGDVIKNHPTLQSLNFCDTSLKSDSMKYISEGIQASQTLNSIDLSR
NEFGYKGSKSLAVALQHSNSLAFLDLCGNDIGDKGAIPIAEALADNKSLT
NLSLAFNNIGTQAAQQLGAAIKVNNSLVSLDISINAEIGPIGATSISEGL
CYNKRLTQVSFCTNGFGPHGAKSLSEALRFNNTLTKIELRGNEIGDDGCR
YICETLKTNASLTEINLSANGISNEGARAVCEALWYNRTLQQIILTHNNI
NQQGVQTMKDTLEQVFVVTFDSYIYPPNSQTILSNLYITRPNHIVCKVTI
>PPL_05455 g4817, 
MNKVGMLKFQGSSQFRQRIICSTLSSRPVKITNIRDDQERPGLTDYEVSF
LRLLDKITNGTKIDINGTGTQLTYIPGLLIGGKLQHDCPVSRGIGYFVEA
LICLAPFSKLPLDITLTGITNNDLDLTIDTIRTTTLPIVRKFGIEEGLHI
KILKRGAPPGGGGSVIFKCPIVQQLKPIQLLDEGKIRRVRGIAYATRVSP
QIPNRVLDTAKGILLKFTPDVYISADHYKGGESGQSPGYGLTLVAETTTG
CCISAECMGAAGESPESLGERTANFLLEEILNGGCIDSNNQSLALLFMVL
CPEDISKIKLGKLTPYTIDFIRNIKEFFGTVFKIETDDDSKTITFTCLGT
GFKNMARKTF
>PPL_05669 g5029, 
MIVSLFPPVPVVNVPLRLAYRETVDPNKVDSAKTVLFSSFKNFIHYNCIE
DLKIYIDDVEYVKESGVGLTADEHEQQQQHHSKTILSKISKFFKSDKSDE
ESEAAQIKSVYGVTDSNIIPLNEENALFIPRVEHANKQVTLKFTIKKKPF
ELKATVQHRHPRVWSDIKQLDLVSLENEKIEEEQTAMTNSSFRVMQFNIL
ADCYTSPANYVGCPVYSLYRNYRQWVLPEYILEHSPDVVCLQEAEVRMER
LTKKLVEAGYLHTPLCDLARYEEEQSITYFKTSRYQPIELQMVHYKNLKN
LLTPAQLDPLLKSSITAKYLDLLSSSMHHNKFSLALLQDKQTSSSILFGS
VHLHWGSPDFDINYIAQVIQLHIFMMVVGNLLDKHSLPRDTPLVICGDYN
NGPTQKAYTLMDYGQYELNGYTLSHSFKMSSAYSHRPDGEPKYTIRTNHF
TGSIDQIWMSEKLRVSKLLEIGDHYPRQLPSLTDPSDHIMMLADLYVSKH
PRVTLTAADNNQS
>PPL_05958 g5309, 
MVETDKKEISEWDEPTNKSGLGAVGATPRRNRWDETPQKMASSVVAETPK
RRSRWDETPVQQMGAQTPRIGMAGVGGITPLGGGVTPLGGMSMMTPLPGS
AGSSSVALKIEREIDDRNRPWTEEELNAQLPSDGYEILAPPAGYVPIMTP
ARKLMSTPVGVAGTSGGFFIPEDQPRVSGGAGEYGVDQTPGGLPMKPEDK
IYFEKLLNEDEEETLSPEEAKERKIMKLLLRIKNGTPPMRKQALRQLTDK
AREFGPAPLFNQILPLFTSQSLEDQERHLLVKVIDRVLYKLDDLVRPFVR
KILSVIEPYLIDQNYYARVEAREIISNLSKAAGLASMTATMRPDIDSPEE
DIRNTTARAFAVVASALGIPSLLPFLRAVCKSKKSWQARHTGIKIIQQIA
ILMGCAILPHLKSMVEIVEHGLNDDQPKVRTITALAIAALAEAATPYGIE
SFDSVLKPLWYGIRQYRDKGLAAFLKAIGYIIPLMDARYASYYTKEVMTI
LVREFKTNEDEMKKIILKVVKQCVGTEGVEAQYIRDEVVPEFFKQFWVRR
MADRRNHKQLVETTVEIANKVGGAEVIAKIVDDLKDESEPYRKMVMEAIE
KIISSLGASDINPRLEEQLIDGVLYAFQEQSTDETLIMLQGFGTIVLSLG
VRVKPYLTQIAGTIKWRLNNKSAKVRQQAADLISRIAVVMQLCGEEQLMS
HLGQILYEYLGEEYPEVLGSILGALKSIVNVIGMTKMTPPIKDLLPRLTP
ILKNRHEKVQENCIDLVGRIADRGADFVLEREWMRICFELLDMLKAHKKG
IRRATVNTFGYIAKAIGPQEVLGTLLNNLKVQDRQNRVCTTIAIAIVAET
CAPYTVLPGLMNEYRIPELNVQNGVLKSLSFLFEYIGEMGKDYIYAVTPL
LEDALMDRDPVHRQTACSAVKHMSLGVHGLGCEDALIHLLNYVWPNIFET
SPHVINAFLESVEGLRTALGPTIILQYTLQGLFHPARKVRDIYWKVYNML
YISSQDAMIPAYPRAADDGPNTYTRYELDYVI
>PPL_06218 g5553, 
MGVPRFFRWVSERYPQIIQNLVDSTAPEYDNLYLDMNGIIHACSQEISNS
LLTFSEEELIRKVCNYIDKLFHIIRPKKLLYMAIDGVAPRSKLNQQRQRR
FLSVFLEDKAKQKMISEGKEIPEVIFSRTAITPGTEFMSNLSDCLQFFIK
KKINEDMSWREVEIIFSGPENPGEGEHKIIDYIRKNKASPDWDPNQSHCL
YGLDADLILLALVTHEPHFSILREEISFRPTKRQLDFQLLHISLLREYLD
LEMRNDSLEFGYDLERIIDDFIMILIIFGNDFVPHLPFCEISKSGLNVVM
DLYKKLLPDLGDYITDGAEIDLHRMQSFFNAVAQFELKQQNVSSGLDDEE
ELDDSAAVAAAIVEGEDPEDAEARKEFERQALERIKHHFNELDFKDEELE
DDISSAWKNAYYRAHFEDFPDNYDEIPQFKRNLVHSYLEGIVWVLNYYHN
GCISWVWFYPHYYSPLACDFVDIASIEVNFEPGSPVTPFQQLMSVLPPQS
AYLLPAPYRRLMESDDSPIADFYPKEFEVDTSDPHYFDGIAIIGFPDLAR
LVEATKDEDSYDLTPRERMRNTLRHAVIIYHDAQIEQSVETPNAKLFDDL
EKSNATIEELILPEPNENSLKPFRLCDGVLVGNHSPSGFPTFQSIDFDWE
YMNGVANLWGNTSRKDSMIVNPPLPELCTLRDLKPLIGKRCYVNWPYHIE
AKIVGFSDSNQHITRDSTIDYVAFQKIQYLDQLKKLPFDYLRRGINIKDL
LENTVLVHVQKIVGVDTEVGGRRVKRYSDKEDTFPLQLMVEYDRVKADSR
YEECEELPFEKRFPIGKQVIYVNSDHYGSVGTVLHHFDNTLELELKVQKM
DTKFGHQCAREEDEYYPIQQVCKMTNLSTQQISLLTGALFIDKPMTDIGL
NMKFTGRQQQLLGYCRGITMSDKTGNSFKKWEFSSLAVEVIKKYLAAFPV
LNSILTMYADDKTGKKAIDISSLIPEKSDRVALIKKIDEFMKESKIRHQR
LVPCDTITMKREAIKKVEDFYLKQSKHEKVSVITRTSPEHVIEPASYESV
ISYERHQSQTHLQQRNNNSPSLASSNISNGGFLESNKSKIFNLGDRVVTL
LEKGNIPFGLYGTVVSVQEHKVDVVLDQECFSANSLDGFCLEKRGICISK
WRVYNLSQPTAVTNYRYRSNKPNYSIDSYEHWKKYDHQNNNKSNNNNNNN
NNNNNNNHNNNNNNSASIQQKIKEEMPPNLNWQQQQEYYKQLHRKNYYNR
QEQYTKNFTPEELVHYAKKYPHVHQSMLWQHHQQQLQIQQSKQLKNNNNN
VDKKPPGLVQQQGQGQQAKPPKQKKDKKEKKEKKNNTSGAEPAKQSDSTS
QILQKMFESQALPDPPASPPITSFFMKAAAQSQEGAPKAAEVGPAETAPQ
QPDQQQQQPPQGPPSQLLQMIQSSLQSVPQSGQPPAMPMQYPPPPHMGMP
PFPHPPPPHMGMPPFPHPPHMGMPMFPPPPFGVHPPPPMPMVNQRPNQHH
HQQQHQQQFPQLSESNKQQKPKNNNKQNKVKQPQQQVSKAQPNTNTNIHT
LNDLKQNNKPKKDKANKSQWVQKNKPTAGSSSAVPENNDNKPNNNNNNDE
NNTNNGDKDQLNWQQKLEK
>PPL_06222 g5557, 
MSEYKSSIFKSISSPTTTANQYNNNNNSNNSSSSFNSFNYGYESSMYYSD
INKLEGMNNNSNTNNNSSISSNNNDVSYGWSYNDIVNGADELKTTAYHSP
TAAPTSGTSMNSSPIYYNKDLIDKLSDDSQPPPHHSPLNLYGKSLRNSYQ
NGIGGGHSRNNSGSGSSHSRNNSSGGGSSHSRNHSRNGSIGNNSTTAAEL
QLLDHDDNNGNSHFNDDNDDNDDSSDVDQTDNDDNGYLSTLPPTTTTSSK
SYSASNLSIFVENNYSELPITKPSGSCTPGGGSSSSPSSLQVSPKLSSKT
PPTAFPSHFPPLSPSIDVPSPSSSSSPPPVPSRTHKRFHSTSNLPNNNNN
ESNSTNNSNNNSSKLNVFSNQRANLTTSANNISQSLSSSFGSISSKISQS
PLKSKLLKVITNVKNSSPVESILTNLSISTSSLSPRNSSDISIQPIASST
SEPSSNYPSLQSSTNSTVDIEYEQQQNQQQQQQQSQDGNNIKLQDSIDKI
VNEFDFDLIKIEDNSQQQQQQQQQQQQSSQQTNSNNINDVNIEKTTLTKK
PQQFLSIDEKSVIQNSRRSSLLHLSYKSSLLNQISQQQSRSERYRFSLPL
LDISPYPKLNNDFNATNNSSNNNYNNSNNNQFISYDELFDNYKNSSSSNS
NGNNIYSYPSLKNTNGSTVKQDITLQRFKPLPPRPPTTQHLQHQQPSNSS
NNNNKMSSLLQFISISLPDIITKNGESSAQSTSTAMIDDEQYRNFSQYLN
ESGDRHKIYQHIQWHLEQPVVNIEIVKLLGTHYGFTDSCRAISWMLMTGY
LPPNKDQRQSALQSKKLQYRDLVKKYYGDCKLFESDENNFERNTNKLLNN
VAVLWSNNTQSGQQEKEKFNELVQQVHIDVIRTRPDGFYDLFELKEIEQM
SERILVIWSSENKDVSYFQGLNDLICPFLIVFLDYAIEVSKVTQDSFPSY
PSLVDTLIDDEVLLSKKIGDGSLVKELIEKKRFDILSRVETDVYWCLSNL
MNSCKSYAANTGCGLPAEGMMKNLESLIKESNEELYLHFKKHGLDFSHFS
FRWMVCFLIRELSFETGIKLWDRYMCDKNNEGFSILHICFCASILSYWSN
DLLNMEFMELVTYLQRSDILPRDQLDPILRNKEREREMVKPKQNIDPEEA
VVRHTPHDIKCKARRVLLSQKLRAQRKKAKESGRKERQKERQKLGEDGPA
AQQPRTIESMRVADDTIVDEEDPEYQDEIELDEFSKYFEGKEPKTCITTN
EKPGGKAVGFAKLFPRILPNAEYFPRQHFELSEIVKFCKNRDYTDLIVVN
EDKGEVNTLMICHLPDGPTALFKVTSITMPDKIPGGGEMTNHKAELIVNN
FTTRLGHTIGRMFASLFPQEPNFRGRRVCTLHNQRDFIFFRQHRYMFESK
SKANLQELGPRFTLKLMSLQHGTFDTKSGEYIHLHKAGMDVDRKKFVL
>PPL_06397 g5715, 
MTENSNNNNSDTTTNINNSKPKSFENVMEITKQLVERLTTQNDQKIRSLH
NHKKNKEDNNSSNSTNNNSNIVDKKKNQYIDTKYITILSFSKKLIQLSLF
QYRSFVNDENLLVLFKKYGRYIGSLDLSKCNHFTVEALIDMLEYLPNLNA
LKLQFCSQLTCENLERLLQLQKERCSIKHLDISFNAMRSLIRSRLQLNPN
LSLISLMHSHISDDDGVALLDSVKGNESLFSLNLSFNSISDKTMHAIAEL
MSRDSTLRELNLATNKISDVGMLEFGAALAYNNHIQILDLSSNFIQDRGG
VAIAKSLALDSSVQKLDLSANDVGPQCGIEFGRSLLVNKTLTSLNLHRTM
IDTEGGLALCQSLATNQTLLYLDLGMNQLENVVGCAIGESLKKNRSLHTL
ILKRNQFGDQAAHAIGDALQTNHTLTSLNISGNQIGHKGAKSIAYSLPLN
KTLRDLDLSYNMIGDGGGKLIGEALGTNSSLIKLNLAANRIGSESCKSIA
QSILNSTFNPNTQQHIDQLNNSGNLTESGSTQSPPHTTQSVQCSHLQQQI
TQQLAANMRISSSHVNLRGLNNNNNNNIGNIGIGNIGNNKTVVRALFPTL
VWLILDSNRVGDEGAIALSQVIANNPPLQTISLVSNLIGESGGRAIGESL
KYNTNLLSLTLDSNRLGPDGAKFICQALKNNGTLTHLGLSGNHIQDQGGQ
YIIDALDLNSTLKSIFIANNDICDNIKETLEAIPQCTES
>PPL_06763 g6038, 
MSDVDKLQEKAKKWKQLNNKRYSDKRKFGYVEPQKEDMPPEHLRKIIKDH
GDMSSKKFRHDKRVYLGALKYMPHAVFKLLENMPMPWEQVRNVKVLYHIT
GAITFVNEIPLVIEPVYTAQWGSMWVTMKREKRDRKHFKRIKFPLFDDEE
PPLDYSENILDEEVEYSIQMDLDETEDAAVIDWFYDSKPLSNTKYVNGPS
YKKWKLDLPILSNLLRLASPLLSDLTDNNYFYLFDDKSFFTAKALNMAIP
GGPKFEPLFRDMEDDDEDWNEFNDINKIIIRHKIRTEYKVAFPYLYNNRP
RKVAIPFYHAPNICYAKSTDPDLPGFYFDPVLLHPIPSYKLDKSQPQTAY
GDEDDDFALPEQVDPFLQETELDTETTPAGIQLYWAPKPFNQRSGLTRRA
QDIPLVQTWYKEHCPPGHPVKVRVSYQKLLKCYVLNKLHHRPPKSLNKKY
LFRSLKATKFFQSTEIDWVEAGLQICRQGYNMLNLLIHRKNLNYLHLDYN
FYLKPIKTLTTKERKKSRFGNAFHLCREILRMTKLVVDTHVKYRLGAAEA
FQLADGLQYLFSHIGLLTGMFRYKYRLMRQIRMCKDLKHLIYYRFNTGPV
GKGPGCGFWAPMWRVWLFFLRGIVPLLERWIGNLLARQFEGRQNDTVKTQ
TKQRVESDHDVKLRAAVVIDILDMMPEGVKENKTRIILQHLSESWRCWKA
NIPWKVPGLPIPIENMILRFVKSKADWWTNVAQYNRERIRRGATVDKTVC
KKNLGRLTRLSLKAEQERQHNYLKDGPYVSAEEGVAIYTTTVHWLEQRRF
SSIPFPQTSYKHDIKILTLALERLKEAYSVKSRLNQSQREELVLIEQAYE
NPHEALARIKRHLLTQRTFKEVGIEFMDLYTHLTPVYDVEPFEKITDAYL
DQYLWYEADKRQLFPNWVKPSDNEPAPVLVHKWCQGVNNLDSIWDTSDGE
CVVMMETQLSKVYEKMDLTLMNRLLRLIVDQNLADYMSGKNNVVINFKDM
NHTNSYGLIRGLQFASFIFQYYGLVLDLLILGLNRAAEIAGPPNLPNPFL
TYKDVETETNHPIRLYTRNVDRIHILFKFTQDESRELIQKYMSEHPDPNN
ENVVGYNNKKCWPRDCRMRLMKHDVNLGRAVFWNIKNRLPRSLTTIEWDE
SFVSVYSRDNPNLLFSMSGFEVRILPKCRATNEQMIPKDSVWSLQNMNTR
ERTAQAYLRVDRDSMERFENRIRMILMASGSTTFTKIVNKWNTALIGLMT
YYREAVVVTREMLDMLVRCENKIQTRVKIGLNSKMPNRFPPVVFYTPKEL
GGLGMLSMGHVLIPQSDLRYSRQTDTGITHFTSGMSHDEDQLIPNLYRYI
QPWEQEIKDSQRVWAEYALKYEEAKSQNKNLALEDLEDSWDRGIPRINTL
FQKSRHTLAYDKGWRVRTDWKQYQVLKSNPFWWTNQRHDGKLWNLNNYRT
DMIQALGGVEGILEHTLFKGTYFPTWEGLFWEKASGFEESMKYKKLTHAQ
RSGLNQIPNRRFTLWWSPTINRKNVYVGFQVQLDLTGIFMHGKIPTLKIS
LIQIFRAHLWQKIHESIVMDLCQVFDQELDNLEIAVVNKEAIHPRKSYKM
NSSCADILLRAAHKWQVSRPSILQDTRDTYDGSTTQYWLDVQLKWGDFDS
HDIERYSRAKFLDYTMDQMSFYPSPTGCLIGIDLAYNIYSSFGNWFPGVK
PLVQRAMDKIMKSNPALYVLRERIRKGLQLYSSEPTEPYLSSQNYGELFS
NKIIWTFEGNLATKPINGAIFIFNPRTGQLFLKIIHTDVWLGQKRLGQLA
KWKTAEEVAALIRSLPVEEQPKQIVVTRKGMLDPLEVHLLDFPNIVIQGS
ELQLPFQSCLKIEKFGDLILKATEPKMVLFNIYDDWLNTIPSYTAFSRLI
LILRALHVNNERTKIILKPDKNTITQPHHIWPTLTDQEWIKVEVALKDLI
LADFGKKNNVNVASLTQSEIRDIILGMEISAPSQQREDQIAEIDKQKQEA
SQLTAVTVRTTNIHGEEIISTATSPHEQKVFSSKTDWRVRAISATNLHMR
TNQIYVNSDFVKETGYTYVVPKNILKKFITIADLRTQIAGYIYGISPPDN
PQVKEIRCIVMVPQWGTPVFVNVPNQMPEHEYLKDLEPLGWIHTQPTELP
QLSPQECITHSKIMSDNKSWDGEKAIIISVSVSWPCTLTAYKLTPAGYEW
GKANKDSQAYQGFQPSHYEKVQMLLSDRFLGFYMIPDRGSWNYNFMGVKH
SANMTYGLKLDYPKNFYDEAHRPSHFHNWTQVSSEINEDNEAADQENLFE
>PPL_07062 g6321, 
MDYRDAKGPCLARYLCQTLYRGERYYLQIDSHMRFIKEWDEILINQLKQC
PSSKPVLTAYPMGYTLPNNLPNYTYPTLLVARSFGSQDKMLRLGSRLAAI
PKLKSPVESLFWIAGFSFSYGSMITEVHYDPHLCHLFFGEEMLMSARLWT
SGYDFYSPTHAILFHLWKRSHRPTFTELSFEEDRQKSHHRLREIMGIPSD
SNDIGRPDIEKYSLGTQRSIDQYQEYCGVNFSSQSISEKALKGGHNDSFF
LNEIIEMAIKSQQDMTDIDNNSNTFESNDQHENQDQIAQLHDETAIEMND
DPSAEHDAQQQSENDGDDENRDDNEDVGNGEEDEDDDDDDDDDEEDEDDD
DVNLVLDTDIVESGRTARSAIKPGYIKGVTSITPGSGAKYDVTKQTNTNF
QTNRPQKSIYDVSLDSFDDKPWNKPGADITDYFNYNFTEDTWKAYCERQN
QIRAEQNNLGKIKSYESKNQDNKNDILPPEFLMMTDNNNGGTNSNNNNNN
NNIASNSNRDMMAKRAPLKRENAPWQHDNMNHVPPHIMRQQGGYIPPYAG
GVPPDYSRGPGGFRGAAYNDNPNQPDDRRRDRDRERDRDQRDTGTVGRDR
GDRERDRGDRETRDRDRDPRDRDRDRDRDRETRDRERDRETRDRDQIRER
DRDDYNRSDDRKKDDRRRGSTERRSSRDTYQSRDTDYKRKLEDQEDERSK
RRR
>PPL_07415 g6630, 
MSYKRINGVDESDNRKKRRNDDFDAFDSIIGDMHKSKEATTQGKTARSTF
SSEAVIKKIIDLTKENDVMNLEVEARIGRLTSEFKSGVIQEDFNTLYSAM
RSKFGEPTKVTETDHIFDDYRIVFCEDTQKVLRKESKVDKNTFNLPTNLI
YDIRISVSIEQQLPVPIYLPEGNRPRRYKTRYTFTDKQWKIDLTQVINYT
PGVETQVSSLEVEVELLPNAIEGCTNVDNLKSLLNRFLNEAKSLISMIQP
KQTLSFPDVEMEKVSNFQEIMDLKKTLFQFMPGSNENKQDTFPGSMPINF
GKKYFSHVQANDYYVSEKTDGVRYLLLIAKDNVYLVDRKFDFYSVKFDKL
IEIYGNDTLMDGEMIRQLRTKKPIFLVFDLLSCRGVCVAGKDLSGRIEAI
RNSITGPFMHKVENQHHQTPLPFLIWGKNFFNKTQIESVFKSIKQRGEDR
QYVDHKREHNTDGIIFTPNTPYTPYTQNDLFKWKYLDKWTIDFKIMDKGQ
KGWYLTCIGNGNSDVEIRSLNFSRDDIENLQRDFKRARDPNTVIVECSFQ
PNTGKWKYHMVRADKFKANYISIVMDTMESIAEAISSEELQYRIPLKHEA
DTWDYEIQKMRAAMLQNLKNKKKAGNSSSSSSSSSSSLSSNNTSHSRPPN
SNNSSANHQHSGQRPNNNNSNSSSSNNNNNQTNSAYYQGSNFEGDPFGTD
EDIEPFQHDGQPIFEDTTYDREDDLDGEGDDE
>PPL_07750 g6931, 
MYTNPPPNVGYGMMQPAMMVPVFGHQYYNPQQALQQQQQQQQQQQPTGLS
NSTGMPQPPPQQYVTTNQLSQSTPQFSTNVHAYPTPQYIYQPQSQPIYTA
NVQQQQQQQQQQHSPNGTPITSQHPISQPIMMDMHHAYLPYNNFLVSQPP
NMQQQQQSPMMSSPTAKSPSTSPISNSQYISNATPAPATQSISQPLTQPI
SNIQQQAQPAPVAPLKPKFSLPLHNLNNQQSNTGPISPKSTISTTPKKPY
SPRSSKSKSSTLISHQLPLSPSTLKNAGLSSASTPLFTNPNQQHVTPNTT
PRSPLSGSLELGCSTADSSLGPYQSTSTFNSPRNLSSAANSPQQQQSPKS
LIIPQAISMVNDLQQQQQQHQQHQDQQQQHLQYFNNVSKICPVIYQQPTQ
QQQSHQPQPIDQSYMYQPNTSTSSLDPMDSESIPLFSQIAMPIPSGLNCV
RCMGSVDPQMLLSCIHCHVHFHRYCVFGPEQQNHLQPQQWICMHCQNIQT
DFGLIPTTTSNTATNWMNIPMPMVDQQQINNNTNLDLQQIKQEMNVSPNN
NNSSSDSEVSDSENGDDSSDEEEEDDIQPLSRQMKDNNDDDDDDEDEDEE
DDSQPTSTESSPRNQTTPREGKSKGHWTKEEDEMLRALVEKYGTKRWKYI
ASLLGLRNGRQCRERWSNQLDPGIKRDAWTLNEDKIILEAHAKFGNKWAE
ISKLLPGRTNCAIKNHWNSTMKRKISKKQYDISLLSLESPRPSHSSSPPS
QFVQSNNNNNSTIINNNNNNNNIISNSSEQAVDQYATADQHRMNHSPRVD
MLSTTSTTTTTTTSSSSSNINSNNNNNNVQLVKKVGLTLPCYICESITYL
PPKGSDSTNKQHILSQEHCNYFDLPFPVGLGEKKQYCMCHAHYNSYRRRQ
ASKCGAGINPVVPPLDFTRKEDETILAFRSRREWPDLNDILMMKNENTKS
DTMKIDLTVLYDILLETSSLPIEECYQKIYEVLNIKAKSKKSSTSSAGSS
SLLVNNNNNNNSSGNLDNDLVNKKLIKNNIKFKIKNLLVTFPHLKFYGSI
KDFQLQRLQKVPEILLVKDNEHLRMKFLES
>PPL_07881 g7065, 
MFGRRRIALVHQNGIKLLSGHSNITQEIKLKSVKMAYIVDPYVLILHKDG
TISLYQGNTGITQLLEYELPQPKDGVMSCSMFHDVKSFFSINNNSHTEQS
NNNSSSSTFNFDTDDEDDKDGDVKMNDKDQSSSSTSTSSSSTTLQQQNVY
LIILTKKSTMELYRLDTKELIISAANVSKEYDILGVASHQFTMNQQQLLA
QQTQHHNINNNTNGNGVNQQQETQPKIVEIVIHYLHNSPHSSPYLMILNE
FGDILIYKAIKYKDSMDNTKELIRFIKHTDQNLHSKQREYSYGIDPSSES
SFYIRKIVAFDNIGGHKGVFMCGKRSLWFFCEKNYLRAHPMNFKDPVTSF
TCFHNINCSYGFIYFTEKGVLRINQLSNMMNFENEWAIRKIPLRMTCHKI
SFHQEFKCYVLVISYPQAPQSDEEEEEKEKSKKPLILEEKFQVKLIDPSM
NWSIVDSFSMSEKETVLCAKIVHLKYADVDGIKLKPYLCVGTAYTHGEDT
VCKGRILVFEIISHREVQDDTGEEKKRLNLLYEKDQKGPVTALAGLNGLL
LMSIGPKLIVNNFSSGSLVGIAFYDTQIFIVSLSTVKNYILVGDMYKSVS
FFKLKDQKQLILLGKDYEEMNTFSSEFIIDQRVLSIIVSDREKNLRIFSF
DPNDPESRGGQMLLSKTIYHIGTNTNKFLRTPLRLPDGTLRNDMHLLFFG
SLDGAIQVLAPLDKKQFQFLQQLQSRLYLLPQTAGLNPREFRQKNDHQYF
TQPGHYIIDGELLTLYQFLSKDDKTLISQSLGTNINEIDRQISILNNSYS
IFS
>PPL_13379 g7066, 
MESYLFNKQLFPPTGVEHCIRAKLIDDNAVNLVIAKTSLLQVYTIRYDRI
EQQQQQQQQTNEQQSQQDTLKPWLELNLELQLFSIIESLNCVRLPGDDID
SLILSFRDAKVSIVKYNKATEKLDIRSLHYFEGNSELKGGRKTFRTPPLI
RVDYQQRCAVMLLYDRHLAVLPFPRSFSILDDEEEEEEEEAAVVADQQQQ
HDENEQQQPQDDQQQQQTSEKNKKKKQSESYVISLNSLGIENVKDFCFLH
TYYEPTLLFLHEPSQTWTSRISSKKFTNVLTAVSLNIAQRQQPVIWSIEH
LPYNCERLVPVPDPLGGAMVLTPNILFYFNQSSRYGLECNEYAQIDTGDQ
FQFPIDSSSTNLVFTLDCANFIFLGDRLLGSLKGGELLIFHLISDGRNVQ
RISITKAGASVLSSTSCVLTDNLLFLGSRLGDSLLLQYTEKIIDVDSSDN
VENLSNPYKKKKTSEVFDLFDDEERNSKTGASDADGNGQSLFDDEDDIFN
DKKNQLKSYRLNICDHITNIGPVSDLITGVSYDHASVSNDESFEQRSLEL
VACSGHGKNGALTILQYGVRPELNTSFELPGVRQSWTLYYDDPLAASQSG
SSASNAAASAASKKRQHEEEYIRCQLASLFVLIDG
>PPL_08452 g7562, 
MDDNDQDQQEQQLEQQRLEQQQLEQQQQLEQQQQLEQHIQNESNNNNNNN
NNNTMQSDNSVIEQNVDVKMTNLPTDSSDSSTNIINTDQPPTTAAITTTT
KQNDIYNPEQASLEGNNNNNNNNSSNNNSPQLNAINKSSPVSTTLSPSLN
AVSSPTSTSHTQTTPTTSTGNASTSMLSPSTPTSTTTTTTTTISSVMPAI
GKRLNVQIDNLEARITNDKYDTEAWTLLLNEVQSQPITIARDIYERFLAV
FPTAGRYWKLYVEQEMSSKNYDMVEKIFLRSLRNVRNVELWKCYITYIRQ
IKGDSNKEEVIKAFELAIEYIGLDISSTPIWLEYLAFLKEEKTATSAEEG
SKKNAIRKLYQRAIENPMHDLDQLWKEYEQFEQASGNKNLAKNLLAEHSS
RYQHAKTVYRERKALLEGILRNMLAKPPRATDKEEHQVRLWRKLIVYEKS
NPQRFEQAQLRQRITATYNQCLLCLYHYPDIWFEAATYQADSGNHELATN
FYERAIQAIPNNLFLHFSFADFLEINKKVAQAKEVYERLVTPSTLAELSH
NPLVWIQYMRFARRTERIEGPRKIFKRAKSHPECTYHVYIALGFIEYYVN
QDTKTAREIFELGLKKFSHEIPYVHFYVDFLTNLNEDNNTRVLFEKILSI
IPSDKSEIFWRKYLDFEYRQNQDINTIVKLEKRFQQLSPSNEKMSIMQVL
NRYKFLNLWPCHPNEIEIINKNLIEEDQDIAIEEQSAENQQQQHHHHHHG
KKKDKHDRGKGGASGSGGGGDGGSNDSSTKDGKSYYNDKPSTSTKIPVST
WKTTRPDTTMMITYRLNEMGKISTPSGGGLGNGTGNDSSNSAGGSGGVGG
GNQPNMMRNDPRNNNNGANNNNQWSNNDNNLIPDFIKPLLRILPAPNSFR
GPWIDVDQLMMLINDTPIPNNSPITMGLGGGIGGGVGGIGGMSGLDSPPN
VMMKPSGGNTGGGNVINKNINKSMNQPHKRKMESNNNNNNNPDNNSNDND
DSQHQPTTQTHQPHQPAIVNKPPEHDIYRKRQASKLSKRS
>PPL_08524 g7627, 
MTKDDNGALVETGVNKADERKKAKHQKKKEQKKRQKQKKISQLQDQSNNN
TSNSSSNNNNNNNNRKKQENGNGIHKDNIENNANGDDHVDEDMGEQEEEF
LIDESDPTFELYNKLLKHFDNPTGYSEEDEQQKEQDKAEQEEEQQKEIAI
KEEPKDIDDNGESDEEENDNDKPSKMSNRERKRQQKLNLPILKQLVDRPD
IVELHDTNSPNPSFLISMKSTRNSVSVPTHWCQKRKYLQGKRGYVKQPFE
LPEFIAATGITKIREALLEKSAQQKTKTKQRERLQPKMRTMNIDYHVLRD
AFFIHQTKPKLCIQGELYYEGKEFEVSIKKTKPGVLSEDLRRALGMADNY
PPPWLIHMQTHGPPPSYPNLKVQGVNAPIPEGAQYGFHAGGWGKPPADLQ
QQYANANSHTNAIIDSLTAPVEKEHWGELLAEEEYEEEQQEDEEDVDQQE
DEEPEESDISEGISSVPSGLETPDTIDIKKGRQQQQDAGQPRQLYQVLDQ
TSRTIGSGIMESNYKYNVPSTIKTSTTTTTPGRGSNKVDIIKSQRSAPVD
ITFNPSELEDMNELDEDLLKKKYEQAVAAEKGPQKPKEDLSNVADDHKKR
KMQSSKDDKQKKFKF
>PPL_09354 g8389, subunit of the splicing factor SF3A required for spliceosome assembly contains PRP9 domain characteristic of splicing factor 3A subunit 3 expressed in pstO cells
MSSTLLERTRELHESIERYELMIVAEQSEEPKTQKDSVIQSHCVNHYLEQ
SIKCANDLKKIYQDEDGQRKADLSAISGQGPAIFSNFYDKLRELKDYHRK
YPTLEIERIGSVLNYTPTLSFSGNEAYGRFLDLNEMFELYLNLPFVQKNI
DYITYLSLFSKFNYNDISRFKNAKYKQYLDKLYQYLASFMERSQPMFDMK
SMNESNEKEFEDKWNNKEFDPSADNNNNNKSDSNGHNNNNNNNSNNNDNK
NEETTADESMDTKETTTAATATATATTDDTSSPLYCKACKKLFASENVYN
GHLKGKKHIKLEELLQKSQSENGGLVIDMVAFNHKSRKPTSLLEYQISKL
GELLDDQVQETKESVIKKQSRSIKEIEDDMNTIENEIDDIEIDDEPIKLR
IANYPVDWSGKPIPYWVYKFHELGVEYKCEICGNQSYWGRKAYEKHFQEP
RHSYGMSCIGIPNTLHFHHITKIKDAMELNKKIKEINASVSFKSDKDEEY
EDENGEVMNKKTYEMLARQGLIKKRKAN
>PPL_12522 g8876, 
MLTDHFNILVFNSTRIYGILVTQNDSFMLSSGSNNSSGNGVAPTSSSSAA
VANSTPTTSSNNTYQISQHNTPAKTMIFHNNNNNQNQNNNNNNNHSGSLN
GSGSVGNGGYQMIFSPALSSSSSSGSLIMSDTASTINENYTQDEDTEDDY
DYDEYYDDDDLSSNLSSSTTITNNNGGGSSGKGRLNSSSPQPKEKNHSRG
KWTPEEDEILRKAVSDNNHKNWKKIAEQLPGRTDVQCHHRYQKVLHPSLI
KGAWTKEEDDKVRELVAKFGAKKWSEIALHLKGRMGKQCRERWHNHLNPN
IKRDAWTTEEDKIIKEMHDRYGNKWAEIAKHLPGRTDNAIKNHWNSSMKR
VTTKKETTTQKKSTGTNGSSRKRKTDSHDNNNNSNNNNNNNNNSNNNNNN
NDSNNNNNHNNNNNFEMSTHLVSPFKEANLNLSLDVPSLQQYITGINSSP
KQKESPLRINNLTQDHHFQNIITPIKPFSNPGNSKKKARIDFSPSKQTDI
FNSDLPLPDLLLFSPIQKVHDRSILETSELSPLKSPFHNTFFDTPYKNAS
LYDHYFDSPSKFQPFSPFKSNPSHSLPSFNVLSPSQPNNNYNNSFYHKNP
STPNAILPSSPYHSLSLMSPSKQYQALPQPTTTTTTATSSIISAPTSQYS
SLSGKFDSSHRKIIGIQLSDKGIDKNSLNTINSKLKGIDTSATSTPSSIS
SSDQHNTSGNNLFVPTTPFKDPLPLYHNSPSAQFLSNNSSAFSTPGSSRD
RSRMSQKYQQPLFFDQDTNPSPYKKSQQQQQQSYNSDTSSNHDNINNSVQ
NNNHNNNNINESVNKNMNIMYNEQSDCSSANTTPSDCSFEALKLLKDNSK
HSIFTRAKQILERSNDQTLKISNISISDIQVPSSPSTSVSNFIYILTPSL
MRKSSMTENNNNNNNKPHQQHSTTTTTTNTNNNNNHNNSNNNNNHNNNNG
NAQPNSLATAPQQQQPNTNNFNYNQSLHTSIIST
>PPL_12567 g8917, 
MCDWAEAVAADGKKFYYHKITRVSVWEKPEELKNYEANFQQYTAGGGAGA
SSTSASSNQHHRHQHQYHHPSSASQQLPPNWKEYTTPEGKKYYHNELTKE
TKWELPTAITNVIPSSSTSSSSFPPISTSQPENNNNNNNNSNSSSTSNLN
SSSNNNNLKESGDGNNNNSSSSSSSISSNIGNKEMDKDSANKIFKELLND
NDVGSTWSFERAQKIIINDDRYQVLKTMSERKMVFQEYLVDRKKFELEEK
RKREKRNREEFVKLLKESPEVTLTMSWRRAQLYFDGDPKWDAVESEKERE
DLFRSYMVDLEHTEKDEREQAKRDQIRQLRHKFESDPTINLKSQWRKVKD
EYEADPLVVAMDRFDVLTTYENYIKDLEKKEEEIQRKDRERLKRDARKYR
LLFREFLNEKYQNGELHAATKWKSFYKKYNGLSVFENLSTQTTGSTPLEL
FTDFQEEMEDNYDKDFKKIKDIIKDLNYQYKPKTTLESLKEDLSKHEKYN
SILPANLPPFLLYLEEREEKKLREIEKRRREAISNFKVLLEETSSISKHS
TWSEVRPLISGASDFDRLEDEQEREKIFNQYLEYLSNEESDEEGIIKSDG
DDNGSRRESFSTKKRLSSAIDDSDRKKKKERSSSHY
>PPL_09416 g8993, 
MSHQQKDNKNKKFGGGGGGGSGGGGGYNNSPQQKHQSPKQHHHHGGGGGG
HHHNNSPQQQSHSNNNNNSSSGQLDDSTKRMKERTVFMSMNLIGHHVAVQ
LKNGDCYEGILTSTNTSQVGWGCALKFARKREVSPPSIITTAPIPQLVID
AKDFLGLTATGIVFDNISQSAFGKEAQFGFHTDTDISGHDGVIRERELTP
WVSEDGGENLESVKINPANANWDQFATNEKLFGVKTSYNEDLYTTTLNRD
SDHYRTRIKDAERLAAEIESKQSNNIHLMEERGLIRAADYDEEERYSSVI
RNGTSSSSTTSPPNNNDKSKNMMPTSNSNVYIPPSKRGSVTGAAGTSSPS
IPPLSTQKVSPTTTTAAASTTSPTTKQSTTTAATTPVAAAAASAKPTASV
EEKSTTTTTTSTTTSTAAASTTSPSNNSPSHQQQQQHFKESTSGSTGSLL
KDSPVTKLRLRDRAGSIDHNDSLLNSPRDGQSPRTLQSYNKIRAAIVAEK
LRNSAEPRSPLFSPLVSDPVGLSALSLDVSKPNISEDTIKEFNEFLLTKS
TTEQPPSADRKSQIENLKNFSRDLSRSRPGSPLIGPNSPRPMSNLSSISL
SGALSPRTSASDIPVIRPIATAPGAATTTTTTTTTSTTTDDKEKEKETTT
DATAVSSSADTKKADPATKSETTTDKPADTVAKPISKLKLNPNAKEFTPV
SLNSNAPVFTPKNFIPATVAKQGLLGSGNIEFYEANSRANTYPNISINDL
YYESMKRRQQNPEQSNAPSTYWNESYGVRGSSQYGADDDYMPPAQYPPAM
RPPFIQMGVPAIIPTYYPPPVANVAPGAPGVPVPVKSMKPIYNPQPRGQP
YAPPPPLLQAPGAMGQPPPQYVFPPQFQYVPQVYPPPGGPHTMPPKRSYY
PGQNPSNGYQPIQPHGIMLPQNTSQPPSPQHQSPQIPSPTSPPHHSRMVP
SSPQMINPMYPYPMIQRYPPHGNDPNATYPPPYN
>PPL_09824 g9376, 
MGIPAFFRWMVDKYNGIIVPTKEPRHQDGSRVVCDNSEPNLNGEFDNLYL
DMNGIIHPCAHPEQGPKPRNTQDMMDSIMEYLDLVFAIVRPRNLLYMAID
GVAPRAKMNQQRSRRFRSALDARLTREKEAKELMENIANGKMSESEVEAL
QKAKDEKFHFDSNCITPGTMFMDLVALTLRSYVAEKVSTDPAWKNLKVII
SDASIPGEGEHKIMDYIRKQRAQKDYNPNQKNVIYGLDADLIMLALATHE
PNFEILREFVQTKGRKQQSTPKEMQGGDDGEEEKKDFLTKDYQLLSLNLF
RDYLDSELKCSPAFGFDIERVIDDFILICFFVGNDFLPHLPSLQINEGAI
DRIMKIYKDLLPTFDDYLTEDGEFSIDRVGKIFAKLAMVEEDIMMRRKSK
EEAMIRRKARMDGNFQQLQEDPQSLNLNSGTTKQHQEAAKNLLNEIFTET
DDTQRPNKKLRVEDGAANKSAAAKIREELLAQKKGTTSNKDAAKQLQQSI
VEQSTTAPADGKKKGEKRGAKVIEIMEVPVEEKKLPSGNKKRKHGEEEEE
EQSNSNGGEGQSLESSTFIDRMKDVKIGTKGWRERYYNHHFESEAKEDDN
VVLHVCQSYIDGLAWVLKYYYKGCASWGWYYPYHYAPFIIDISENIELIK
PATFDLGSPFRPFEQLMSVLPKESGQFVPKPYQKMMGIDDGSGDVSPIIH
FYPSNFMIDVQPSQPIWKGVCLLPFIDEKELLSSLKSLENKLTDEEKFRN
SQGAELLIVNKDLKLTNDENAADKDHSSIDSKVSPNFLGDISKLPERITK
KLPPMQSAIAYSYENPAYPEGYVFKSEMLPNAVKAPRTDITSLRFSIQNS
AANRMINHAVDSNQYKNKHFNRNQGYNNQNNNYNKNYNNQNNNYNNNNYN
NNNNNYNNQYQNNNNNYNNNNNNYSNQNYNNNYIQNNNPFNNNNSNYNQN
NNYNNNNNYNQNYNNNSNMNYNNQNMMNYNNNQNNFNSNYNQNNNSYNQN
NNYNQNNYSNQSVMSYNQQQQQQQWNQQQQQPQQNIMNNSNQQQQQFRQN
QNYNNNGQHQRRNNNNNNNNNGNNNQQPQRSKYNPFAKLKK
>PPL_01155 g991, 
MGITGLSAYLSEELGFQSSTISSNSNNNNNSNIVDENSNNNNINNGGKKF
NTNRANEPRKADHVFIDMNGIIHKQVRRNSNSELTVDRVKRDLIDTLKNI
MKGSGVFYHTKSIQFIFDGPGSRSKILLQRKRRSKKIEDLVDTKVNASLI
TPGTSFMGEMKQLLLDYSKKLLRESQNLNLKDIHVSGSDRWGEGEFKIFE
HINSMNWKENTNVSIFTCDSDTILYALLSDANIRIHDLYDPKSYKDISKL
KQELSLLVPQRDKKQVFVDFVLINLFRGNDLLPALQSFNFDSVWQAYVSS
PDKLGVYNLETSEINWELLLDLLSKNRIINSPQTKISIVGEFRTLLAVLA
RDLKREEKLDFQLSMLEVGEKNVVDIVGVFDGQTFKEERVKSDTQTKTRF
LKKFFDIDHPFWTKYKPLLTREQIDGLVRRMATSMANYVTANAEEPSIEQ
YVEGIVWQMKLMRGQCTNFQFHYPYFFGPRVDDIKPVAIPKDTGVSVPPL
LPLQFCLALTHAQAKNNVNQIFHPMFEELPHYSLLQHVSTDAWKHPEALD
QLNQSFKKYVDTSKLTEFENSQLSFHPTLHLSKHNNSIFLREEILNSKQY
TSPKHLETIPYKPKALNQNINNNNNFNSNNSYSSNYKSNYNINRVSKGAM
QYQSRFGIKDSTTTTTTNTTTTDSQVNQGVINAVRNYFTTNRTPTIQTTT
SLIPTIKYSSSIRSLLFKLIK
>PPL_10444 g9918, 
MAKNEDVNKEVEHINSNNSIINNNNNNSDNNNQQNNQNTLKISRLPPPLP
PTSNNNNSNSNSNNNNISSLNNVTITVQPPPQQTTTQQQQQTTTPLPPMS
QHQQFQHLDFYSDGESDRMTRDNNNSNASEHTKTRPRRPSSPLAFLRSLS
PKHKEKEKDKKKEKHLKSQMEQLSVSPPRGMGVHNYYNHESTPSTSPNIS
LSTTINNNNNNNNNSISLVTSFNNNNNLVTSTGNNNNNNLITSNRKFSGA
SLVDSGMSSGGSSLMSSNGDSRFNSLPHTVILRILNFLVQSRIDGEKSEI
KLEYRSPPMPHQKSAAGGQESSLSSSGQPLSNSLQSSINNNSGVKVNPNQ
DLQTMSLVCKYWAKEITPQVFHHFVVKSPKHLKSLIRLVSKGIIDGGRRF
NFYYVAMILDKSSSFQKFINILKHTMPDRLPDKFYKATSKPIMTNVFSKS
LFTSFFDNCSTMEYFRFYQRWVSPDNFEAIGHALRTNHSITHISFRNNNL
DDEFVVDIIQALHENNTIQILDFRLNKLGNQTAISLAGALLKNRSLTCVD
LFYNAIGPEGGVAIANSLRTNRTLRKLYLGWNHINGQTASILSESLKVNN
VIESIYLDRIDDHSGSLLAESLAVNTSVTELNLADCQLKQFTAKALGSAF
KVNKSLADLNFRCNQLGADLKDISQSLSVNHTLTRINLSDNRINDESGRL
LAESLKTNHSITSLSLSLNQLGNKFADEMGVALLENTTLKLLDLSNNQIE
FTGAQHIANALASNSTLKLLNLCQNSLSSKFGPLIAYSLTQNKSLTHLEL
AYVGIGSAGAVSLAKAVKDNIHLRKLNLSENQIGDDGALAFADAIKSNQF
LYVLDLSYNNFTYRVKEVFEKIQEQNNTLQFSISSVPLHWKFQL
>PPL_05499 rtc1, ortholog of RTC1 which catalyzes the conversion of 3'-phosphate to a 2'3'-cyclic phosphodiester at the end of RNA
MTKTKRNFKPHNKKTKVPQAPATSSTNSKDDSSLPEVEVNPSFVLDGSIL
EGGGQILRNSIALSSLLSKPVRIEKIRYNRDQPGLKAQHKAGVDLVSRMF
KAHTDGVKQGSTVLYYHPRISTTNIKDQSIEADTGTAGSITLLIQIALPC
LLFTPKSTKLDLGGGTNVDFSPAADYLMNVFFPIAKQFGINSNMEVLKRG
YYPRGGGKVSLITQPIKGTLNPISILKKGNLVKFTIRVFFTSTRISAEVG
DRMLNAARKMIKKDYKKVEIVEELVDTAKYTFGDGCCIFITAETDTGCLY
GGSANGAIGVPAEKVGEDAATSILNDLLHGGCMDEYLQDQLIIFMALAKG
TSQIKTGPISLHTETSIHFTSLLTGAKFQVKPAEDKQRGEDTFIITCEGV
GFENKSSETKSDEEITEQNGNDDNTSTTTTTTTSSSS